Gene Cagg_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3624 
Symbol 
ID7269768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4403783 
End bp4405060 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content56% 
IMG OID643568431 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002464897 
Protein GI219850464 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.541774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAAAAC GGTTCGTATT CCGTCTTCTA CTATTGGGCT GGTTGCTGGC CTTCACGGCA 
TGTGCTGCAT CACCGGCGAG CGTACCGCCG GCTCAAGCAG GCCGAACGGT GTTGCGCCTG
TGGCATGCGT GGCCTTCAAC CGAAGGACGT GTGCTGCAAA CGCTTGTCGA ACAGTTTAAC
CAAGCCCATC CGCAGTGGCA AATTGTCGTC CAAGCTCGTC CGGCAGTGTC TCTACCTGCC
GATCTGATGA CGGCCGTGCA AGAAGGTGGT GGCCCGCATT TGGCGATTGT CCAAAGCCAT
ACCCTTGGCA CTTTGGTCGA TGCCGGGGTT GTTCGCCCGC TCGATGATGT GATCGCAGCC
GGTGAATTGT CTAGCCTGTT GCGGGCTGCC GTCGGGTCGG CCCAAGTCAC CGTTGCCGGT
CAACCAACAC TCGTTGGCGT ACCTATCAGC TTTGATACAT TGGCTCTCTA CTACAACCGT
GCTAACGTCT TGCAACCACC AACTACGATC GAAGAGCTGT TGCAGACCGG GCGAGCTTTG
ACCGATCGCA ATCGGGTGCC ACCGGTGTGG GGATTGGCCT ACAATCTGTC ATTAGATCGC
ACGATTGGTT ATCTCTACGC CTTCGGTGGG CGTGTTTTTG ATGAAAATGG CACGTTAGTG
CTTGGCGATA GTGGGCGGGA AGGCACAGAG CGTTGGCTGG CATGGCTCGG GCAGTTATAC
CGTGATGAAC AATTGTTAGC CACACTCGAT GGTGTGGTGG TGGATCGGGT ACTCCAATCA
CGTGACGCAA TTATGGCGAT CGATTGGGCG CATGCCCAAG CTGAATATCG TGCAATTTGG
AACGATCAAC TAGGTGTCGT GCCTTTACCA CGGTTAGGGG CAACCGATCG TCTTCCGCAA
CCTTACGTGC AAGCCGATGT TATTGTGATG AACGCCCGGC TTACCGATCA GGCCGAACAA
ACGGCCGCTC AAGCGTTTAT GCGTTTTATG ATTGAGCCAT CTAGCCAACG GGTGTTGCTG
GCTGTCAACC GCCAGCCTAC CCAACTTGCG CTTCTGCTTA GTGATACCGA TCTCGATGAT
CAAATCCAGT TGGCTGCGGC ACGAGCGTTT CGGGCACAGG CTCAGCACGG TTTGCCGATG
CCATCTGATC GACTTGCCAA CGAATTCGTC TGGACAACCC TGGCCGATAT GCATCTCAGT
GCGGTGCGTG GGTTGCTTAC TCCTGAACAG GCAGTCTCAC AGGCCGTCGA GATCTTGCAT
AGTCGCTTCA CACCCTAG
 
Protein sequence
MQKRFVFRLL LLGWLLAFTA CAASPASVPP AQAGRTVLRL WHAWPSTEGR VLQTLVEQFN 
QAHPQWQIVV QARPAVSLPA DLMTAVQEGG GPHLAIVQSH TLGTLVDAGV VRPLDDVIAA
GELSSLLRAA VGSAQVTVAG QPTLVGVPIS FDTLALYYNR ANVLQPPTTI EELLQTGRAL
TDRNRVPPVW GLAYNLSLDR TIGYLYAFGG RVFDENGTLV LGDSGREGTE RWLAWLGQLY
RDEQLLATLD GVVVDRVLQS RDAIMAIDWA HAQAEYRAIW NDQLGVVPLP RLGATDRLPQ
PYVQADVIVM NARLTDQAEQ TAAQAFMRFM IEPSSQRVLL AVNRQPTQLA LLLSDTDLDD
QIQLAAARAF RAQAQHGLPM PSDRLANEFV WTTLADMHLS AVRGLLTPEQ AVSQAVEILH
SRFTP