Gene Cagg_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0089 
Symbol 
ID7266827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp124799 
End bp127654 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content58% 
IMG OID643564962 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002461478 
Protein GI219847045 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGGC ACCGGCAGAT CACAACCGCT ATCCGCCTCC TCATGGTACT TTGGCTGGCG 
CTCGCTAGCC TGCTCTCCGG CGCACTGCCG GTGCGAGCCG AGGGCGGTAA CACGCCGGTC
ATTGAAGTGC GCGCCGATAT GCTCGGTCTG CCGGTGGGGA CCCATCCGTT GGCCGCTAGT
TATCCCAAGC CGAGTGTCAT CATGCGTGCC GGCGATCAGG TCTCGTTTAC CGTCACCGTG
CCGCAAGCCG GCGCGTATGC GTTAGCGATT GACGCCGCCG TGCCGGAAGC GGTCACCTTG
ATCCCGCCTG AACTGCTCGT GCAGTTCGAG GGCCAAACTG AGCGCTGGCG GCTGATTGTG
CCGCTTTTCT ATCACGATAC TGCCGATCAG TTCCCGCTTG ACCGCTACGG CAACGAAACC
CTGATCCCGC AGGCCCGGTT GGCCCGCTGG AGCCGCGTCT TCTTGCGCGA CCCCAACTTT
AGCCGGCCGT ACCCGATCGC GCTCACGCTG CCGGCGGGCC ACCAGCGCAT TTCACTCAGC
CTCGGCGAAG GCGAGCTGGC GCTGGGCAGC TTCTATCTGT TGCCGGCGAT CGACTCGTAC
CCATCTTACG CTGTCGACCA CGCTAGCCGT CAAGCACCGA GCAGCAGCGG AGTGCTGATC
GAACTCGAAG CCGAGCGGCC AAGTTTTAAG AATGATACCA GTGTGCGCCC GATCAGCCGC
CGTAATCTGG AAGTAACGCC GTATGATCCG TACCATCTGC GGATGAATGC CCTCGGCGGC
GAGAGCTGGA GCCGCAGCGG TACTGCGGTC TACTACGAGT TTACCGTGCC GCAGAGCGGC
TGGTATGCGA TCACCCTGCG CGCCATCCAA GATTACAAGA ATAATTTTAC CGTGTATCGC
CGCATTCTAC TTGATGATCA CGTGCTCTTT GCTGAGTTGA ATGCAGTGCC GTTTGGCTAC
ACTACGAATT GGCGCAACTA CACGCTGGGC GGATCCACCC CATACCAGAT TTATCTTGAG
CAAGGTCGGC ACGTCTTGGG CATCGAAGCC ACAACCGCGC CTTATCACGA CTCAATCGAG
CGGGTGCGAG TCGGTTTAAA AGCGATTGCC GATCTGGCCT TCGCCATTAA GCGCCTGACC
GGTAACCAGA TCGATATTTA CAAAGAGTGG GAGATCGCCG ATTACATTCC CGATATTCGC
GAACGACTGG CGGCGCTGGT CAGCCAACTG CGCGCCGACC AACAGGCCTT GCTGGCCGTC
AACCAGACCC CGGCCTCACC AGAGGTGCTC GCTTACCAGA TGGCGATCGA CAACCTCGAG
GTGCTGGCCC AAGACCCCAA CCGGATTCCA ACCCGTATGA GCCGGTTATC AGAAGGGGCC
GGTTCGGCGG CGCACCTGCT CGGTAGCATT TTGCCCTCGT TGCAGAGCCA ACCGCTGGCG
CTTGATAAAA TCTATATCCA TTCGCCGGAC GTCATTCCTC CTGAACCGAA CATCACCACC
GGCGCAATCG TGACCGATTG GTTCCAGCGC TTCCTCGGCT CATTCCGCAG CAATCCCTAC
CAGAGCATCG GCGCGGCGCC GGATGAGCTG GAAGTGTGGG TGAATCGCCC GCAACAATAT
GTCAATTTAC TTCAGCGCAT GACCGACGAG CGCTTCACAC CACAAAGCGG GATCAAGGTC
AAGTTTTCGA TCATGCCCAA CGAATCGAAG CTGATCTTGG CCTGCGCCGC CGGCACCCAA
CCCGATATTG CGCTCGGTGT CAGCACCAAC ATCCCGTATG AGCTGGCGAT CCGCAATGCG
CTCTACGATT TGCGCAGCTT CCCTGACTTT GACCACTTCA TCCGCATCTA TGCCCCCGGC
TCGCTCTTGA GCTACATCAT TAACGACTCG GTATACGCCA TTCCTGAAAC GCAAGATTTT
TGGGTGACGT TCTATCGCAA AGATATTCTG GAAACACTGA ATTTGCCGGT GCCGCAAACG
TGGAATGAAG TATTAGAGAT TTTGCCCGAA TTACAGCGTT TCGGCATGAA CTACAACACG
CCGCTCTCAA GCGGCGGCGG CATGAAGGGC TATCTGGTCA CGGCCCCTTA CCTCTTCAAC
TACGGCGCGT CGCTCTACAC ACCCGATGGC ATGTCGGGCT TGGGGTCGGA CGAAGCGATT
CAGGCGATCC GGTTTATGGC CGAGAGTTTC ACCATCTACG GCATGCCGCT GACCACGGCC
AGTTTCTACG ACAGTTTTCG CGCCGGTGAA ACGCCGGTCG GCATCTCGAA CTTTGAAACC
TACCTCAAAT TGCTCACTGC GGCGCCGGAG ATCGATGGGT TGTGGGATAT CGCCCTCTAC
CCAGCCACCG TCTTGCCTGA TGGCAGGCAG TTGCGCTATG CGACCGGCTC GGCGCAGGCG
GCGATGATGT TTGCCAACAC CGATAAGCCG CAACAGGGCT GGGCCTTCCT CAAGTGGTGG
ATGTCAACCG AGACTCAGGT CGCCTTCCAA CAAGAATTAA TTATGAATTT TGGGTTGGAA
TATTTGTGGA ATTCAGCGAA CCTCGAGGCG TTTCGCTTTA CCCCGATTTC GGCGACGCAC
CGCGACGTGA TTTTGCAGCA GTGGCAATGG CTGCAAGAGC CGATCAAGCT GCCGGGCAGC
TATATGCAAG AGCGCGAGTT GAGCAACGTC TGGAACCGGA TCGTCTTTCA GGGAGCTAAC
CCGCGCGTCG CGATCGACAA TGCGGTGACG GTGATCAACC GCGAGATCGT GCGCAAGATG
ACCGAGTTCG GCTACATCCG CAATGGCGAG CGGGTGCGCA CGATGACGAT CCCGACTATC
GAGACGGTGA AGGAGTGGAT GGCGCATGCA AACTGA
 
Protein sequence
MGWHRQITTA IRLLMVLWLA LASLLSGALP VRAEGGNTPV IEVRADMLGL PVGTHPLAAS 
YPKPSVIMRA GDQVSFTVTV PQAGAYALAI DAAVPEAVTL IPPELLVQFE GQTERWRLIV
PLFYHDTADQ FPLDRYGNET LIPQARLARW SRVFLRDPNF SRPYPIALTL PAGHQRISLS
LGEGELALGS FYLLPAIDSY PSYAVDHASR QAPSSSGVLI ELEAERPSFK NDTSVRPISR
RNLEVTPYDP YHLRMNALGG ESWSRSGTAV YYEFTVPQSG WYAITLRAIQ DYKNNFTVYR
RILLDDHVLF AELNAVPFGY TTNWRNYTLG GSTPYQIYLE QGRHVLGIEA TTAPYHDSIE
RVRVGLKAIA DLAFAIKRLT GNQIDIYKEW EIADYIPDIR ERLAALVSQL RADQQALLAV
NQTPASPEVL AYQMAIDNLE VLAQDPNRIP TRMSRLSEGA GSAAHLLGSI LPSLQSQPLA
LDKIYIHSPD VIPPEPNITT GAIVTDWFQR FLGSFRSNPY QSIGAAPDEL EVWVNRPQQY
VNLLQRMTDE RFTPQSGIKV KFSIMPNESK LILACAAGTQ PDIALGVSTN IPYELAIRNA
LYDLRSFPDF DHFIRIYAPG SLLSYIINDS VYAIPETQDF WVTFYRKDIL ETLNLPVPQT
WNEVLEILPE LQRFGMNYNT PLSSGGGMKG YLVTAPYLFN YGASLYTPDG MSGLGSDEAI
QAIRFMAESF TIYGMPLTTA SFYDSFRAGE TPVGISNFET YLKLLTAAPE IDGLWDIALY
PATVLPDGRQ LRYATGSAQA AMMFANTDKP QQGWAFLKWW MSTETQVAFQ QELIMNFGLE
YLWNSANLEA FRFTPISATH RDVILQQWQW LQEPIKLPGS YMQERELSNV WNRIVFQGAN
PRVAIDNAVT VINREIVRKM TEFGYIRNGE RVRTMTIPTI ETVKEWMAHA N