Gene Athe_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0849 
Symbol 
ID7407424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp942006 
End bp943682 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content39% 
IMG OID643715227 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572737 
Protein GI222528855 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.206456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATTTT TAAGAAAAAT TTCGATTGTG GTAGCGCTTG TTTTCATTAT TTCCGCGGTG 
CTGGGTGGGA TTGTACCTGT ATCTTCCCAG AAGGTTGAAG GTGCATCAAA AAAGGTTGTA
ACTTTTACAA TGTTTAGTGC AGATGCGACA GTACAGTATC ACCCAGATAT TTTCAGTACT
GCTATTGGGC AAGAGATTAC AAAAAGGACA GGCGTAAGAT TGAAAATCGA ACACTTTGTA
GGAATGGACC AGGCAACAAA GATATCACTT ATGCTTGCAT CTGGTGATTT ACCAGACTTG
GTTTATGGCA GTGGTGAGCA CAAACAATTT ATTCAGAACA AAGCTTTAGT TCCGCTTGAT
AACTACATTC AAAAATATGG TCAGTGGACA AAGAAGGCGT ACTCTCAGGC AGATTTGAGG
AAACTTCGCC AAGCTGATGG ACATATTTAT TTCTTGAGCT ACACAAGAGG TGAAGTGTCA
CCAAGTGCAA GTGGTGAAGG TTTATATGTA ATGATTGACA TGTTACAAAA GAATAACTGG
CCAAGATTAA AGTACTGGGA AGATTTGATG CCAATGTTAA GAAATTATGT TAAGAAATAT
CCAAAGTACA AAGGTATGCC TGTAATAGGC ATGTCAGCAA TTACAGAAGG CGCAAGATTC
TATGTAATTC AAGATCCTGC AACAGGTTTG AATGGTCTTA TAGCAGATAC TGTACAAGTT
GATCCGAAAA CATATAAAGC AAGCTATGAC CCTGCAGGAA TAGGAATGTA CAAAGCTTAC
AAGGCACTAA ATGCTCTGTG GAATGAAGGT TTGTTTGATA AAGAAGCATT TGTTCAGACA
TATGACCAAT GGGCTGCGAA AGTTGCTCAA GGTAGAGTTG TGACAAGCTG GGGAAGGTCA
TGGCACTTTA ACACAGCATT CAATACACTA CGAAAGAATG GCGAAGATGA TAGAATCCTT
GTTCCATTTG GCATTGTATT TAAAGGGGTT AAAAAATCAA GATATGTGAT GCTTCAGTCA
ATTGGAACAA GAGATGGCAT AAGCATTACA AAGAAGTGTA AAGATCCTGT AAGAGCATTC
CAGTTCTTAG ACCAAATGCT CAATCCAGAT ATTCAGAAAC TTATGTTCTG GGGTATTAAA
GGAAGAGATT ATCTTGTTGA CAATAAAGGT AAGATGTACA GAACACAAGC TATGATTGAC
AAAGCAAGAG ACCCTGTTTA CCAGAAACAA GAAGGGCTTG GCTACTGGAA CATCTGGCCA
AGATGGCAGC TCAAGCTTCC AGATGGAAAT TATGTAAAAC CTGAACTTGA TCCAGATATT
GCATATATGC AATGGGCACC AGCACAAAAG AAAGTGCTTG AAGCATACAA AGCAAAAACA
TTTGTTGAAC CACCATTTGC TGATGAACCT GAATGTCCAC CTTGGGGATA TGCATGGGAG
ATCAACGTTC CACCTGAAAA GCAAAAAGAA ATCCAGGTTC CACTCAACAT TGCTAACGAC
CTTGCAAGGA AGTATATACC AATGCTTATA ATGGCTCCAA AGGGCAAGTA TGATGAGGTA
TGGAACAAAT ACAAAGCAGA GGTTAGATCA AAGATTAACA CAAAACCAAT TGAAGAGTTC
TATACACAGG AAATGAGACA GAGAATGGCA GATTGGTACG GGATTAAAGT TAAGTAA
 
Protein sequence
MKFLRKISIV VALVFIISAV LGGIVPVSSQ KVEGASKKVV TFTMFSADAT VQYHPDIFST 
AIGQEITKRT GVRLKIEHFV GMDQATKISL MLASGDLPDL VYGSGEHKQF IQNKALVPLD
NYIQKYGQWT KKAYSQADLR KLRQADGHIY FLSYTRGEVS PSASGEGLYV MIDMLQKNNW
PRLKYWEDLM PMLRNYVKKY PKYKGMPVIG MSAITEGARF YVIQDPATGL NGLIADTVQV
DPKTYKASYD PAGIGMYKAY KALNALWNEG LFDKEAFVQT YDQWAAKVAQ GRVVTSWGRS
WHFNTAFNTL RKNGEDDRIL VPFGIVFKGV KKSRYVMLQS IGTRDGISIT KKCKDPVRAF
QFLDQMLNPD IQKLMFWGIK GRDYLVDNKG KMYRTQAMID KARDPVYQKQ EGLGYWNIWP
RWQLKLPDGN YVKPELDPDI AYMQWAPAQK KVLEAYKAKT FVEPPFADEP ECPPWGYAWE
INVPPEKQKE IQVPLNIAND LARKYIPMLI MAPKGKYDEV WNKYKAEVRS KINTKPIEEF
YTQEMRQRMA DWYGIKVK