Gene Athe_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2032 
Symbol 
ID7408245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2144650 
End bp2146245 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content36% 
IMG OID643716399 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002573882 
Protein GI222530000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGTA AAAAGAATTT AAAAGTCTTG AGGTTCTTAA TTATTTTTGT AGTGTTTGTC 
CTTGTGGTTG GTTCGTTAGC AATAAATCCT AAAGAGAGCT TGGCAGCTAC AAAACCCACT
ATCACATATT TTGTCCAGAT GGATGCAAAA GTTGCAGTTT CGTATGATAA CTATAGTAAG
ATTGCGGCAT ATCAACTTCT TATGAAAAAG CTAAATGTGA ATATTCAATT TATCCATCCT
CCAATGGGAG GTACTGCTGC CCAAGACCAG CTGAATTTAA TGATTGCATC CAAAAAACTT
CCTGATATCA TTTATTGGAA TTGGGTGGAT AGTTATCCAG GCGGTCCTGT AAAAGCTTTA
CAAGACAAGG TAATTATAAG ACTTAATGAA TATGTGGACA AATATGCTCC AAATTTTAAA
TCATATTTGT CAAAACATCC TGATGTGAAA AAGATGATAG TGACAGACGA TGGTGATTTG
TATTGTTTTC CCTATTTAAG AGAGGACCCT GAGATTCAAG GTACTTTTTA TGGACCAATT
GTAAGAAAAG ATTGGTTGAA CAAATTAAAA ATTAATCCAC CTGAAACTGT GGATGAATGG
TACAAGATGT TAAAAGCTTT TAAAGTTAAC GACCTCAATG GGAATGGTAA AAATGATGAA
AGGCCGTTTT CTATTAGTTT AGGCGGTGCA ACAAGTCCAA GACGAGCTTT TGATTATTGC
AGCTTTTTAG TAGGTGCGTG GGGTATCAAA ACTGATTTCT TTGTCGAGAA TAATAAGATT
CAATACGGTC CTTTAAAGCC TCAATATACA GATTTTATAA AGACACTTCA AAAGTGGTGG
AAGGAAGGGT TAATTGACCC AGATGTACTT ACTATGAACA GAGATATAAT CAGAGCAAAT
ATTCAGAACG ATTTAATAGG TTCATTTTTA GGTTTAATTG GTGGAGACTT AGCTTTCTTT
GTAAATCTAA AGAAAGATTT AATGGGTGTT AAGTATCCTG TTCTTAAGAA AGGGCAAAAA
CCAGAGTTTA GCCATCGTGA ACCCCAGTTT GCAAAAAGTG GAGCTGCTAT TACCACGTCC
TGTAAAAATA TTCCGCTTGC AATGAAGGTT CTTGATTGGG GATGGAGTAA GGAAGGATTT
ATGGCACTCA ATTTTGGCGT GTTAGGAAAA AGTTATGTTA TAAAAGATGG ACGTCCGGTG
TACACTGATG AGGTTATGAA CAACCCACAG CTTGATAGGC CTTCTGCACT TGCAAGATAT
GCATGTGCTT CGTTTGGGGG GCCATTTATT CAGGCAAAAG AAAATGCTCT TCAGATAGGC
TTAGGGCTCC CACAACAAAA AGAAGCAAGT GAGAACTGGA GATATGCATC TAACAAAAAA
CTTTTGCCTA TTCTTTCATT TACATCTGAT GAAGCAAAGA AACTGGCAGA TATTATGAAT
GTTATTAATA CTTATTATGA TGAGATGTTT GTAAGACTTA TGACAGGTAA ACTCAACGAT
GTTGAACAGC TGAGAAAAGG ATTGAAAAGA ATGAGGATAG ACGAAGCAAT AAAGATATAT
CAGCAAGCTT ACAACCGTTA CATCAACAGA AAGTAA
 
Protein sequence
MFGKKNLKVL RFLIIFVVFV LVVGSLAINP KESLAATKPT ITYFVQMDAK VAVSYDNYSK 
IAAYQLLMKK LNVNIQFIHP PMGGTAAQDQ LNLMIASKKL PDIIYWNWVD SYPGGPVKAL
QDKVIIRLNE YVDKYAPNFK SYLSKHPDVK KMIVTDDGDL YCFPYLREDP EIQGTFYGPI
VRKDWLNKLK INPPETVDEW YKMLKAFKVN DLNGNGKNDE RPFSISLGGA TSPRRAFDYC
SFLVGAWGIK TDFFVENNKI QYGPLKPQYT DFIKTLQKWW KEGLIDPDVL TMNRDIIRAN
IQNDLIGSFL GLIGGDLAFF VNLKKDLMGV KYPVLKKGQK PEFSHREPQF AKSGAAITTS
CKNIPLAMKV LDWGWSKEGF MALNFGVLGK SYVIKDGRPV YTDEVMNNPQ LDRPSALARY
ACASFGGPFI QAKENALQIG LGLPQQKEAS ENWRYASNKK LLPILSFTSD EAKKLADIMN
VINTYYDEMF VRLMTGKLND VEQLRKGLKR MRIDEAIKIY QQAYNRYINR K