Gene Athe_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0597 
Symbol 
ID7406938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp673118 
End bp674620 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content37% 
IMG OID643714980 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572496 
Protein GI222528614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCTCA AAAAGTTTTT CGTAGTCATG CTTGTTGTGA CTTTTGTACT CACAAGTGTA 
ATTGGTGTAG TGACGGGATT TGGTGCATCT TCTTCAAAAC TTCCTTATGT TAAGCTTACA
TGGTATGTCA TTGGAACACC TCAAAAAGAC TGGGATTTAA TCAATCAAAA AGTAAATGAG
TACATCAAAC CAAAGCTTAA TGCTGAAATC AAAATGACAA TGTTTGACTG GGGCGAATAT
AATGATAAGC TCCAGACAAA GATTGCAGCA AGTGAGCCAT TTGATATCTG TTTTACAGCA
ATCTGGACAA ACAACTACAG AACTAACGTT GCAAAAGGTG CATTTTTGCC GCTCAATAAG
CCCGGAAACG ACCTTCTTTC TAAGTATGCA CCAAAGACAA AGAAGCTTCT TGGCGATGAT
TTCATAAAAG GTGCATCCAT TAACGGAATT CTGTATGCAA TTCCAGCAAA CAAGGAGAAG
GCTCATAACT GGGGATTCAT TGTTAGAATG GACTTGGTAA AGAAGTATAA ATTAGAAGAC
ATGTTTAAAA AGGTTAAGAA ATTAGAAGAT TTAGAGCCAT ATCTTAAGGT AATCAAACAA
AAAGAGCCAG GTGTATATCC ACTTGGAGCA TATGCTGGTG AGTCGCCAAG ATTCCTTTTA
GACTGGGACA AGGTTGTAGA CGATGATGTT CCTGTATCAC TTTATCCGAA TAATAAGAGC
ACAAAGATTG TTAATGAACT TGAACAGCCA AATACAAAAG CTCTCTTTAA GACAGTAAGA
AAATATTACT TGGCAGGTTA TATCAGAAAA GATGCAGCAA GTGTTACAGA CTGGATGTCT
GATTTAAAAG CTGGTAAAGT GTTTGTAATG CCCCAGTCGC TCAAGCCAGG AAAAGATGCT
GAGATGTCTA TTTCAACAGG TTATGAATGG AAACAGATAG ATATAACACC ACCTGTTATG
TCAACAAGAG AATGTATAGG TTCTATGCAG GCAATCAACG CAAAGTCAAA GAATCCAGAA
AGAGCTTTAA TGTTCTTAGA GCTTTTCAAC ACAGACAAGT ATCTTAACAA CCTTGTAAAC
TTTGGTATTG AAGGTCAGCA CTATGTATTT AAAGATAAAG CAAGAGGAAT CATAGCTCCA
GGACCAAAGG CAAAAGACTA TAGCCCAGGT CTTGGCTGGA TGTTTGGAAA TCAATTTATA
AACTATATTT ATGAAAATGA AGATCCTAAC AAATGGAAAA ACTTTGAAGA GTATAACAAG
AAGGCACTGC CTCTTCTTAG CCTTGGATTC AACTTTGATG ACTCAAAAGT AAAAACACAG
GTTGCAGCAT GCAAGAGCGT ATGGAAGCAG TATATTCCAA TGCTTGAGAC AGGGAGTGTA
GACCCTGATA AATACATTCC ACAGGCAATT GACAAGTTCA AGAAAGCAGG TGTTGACATT
ATTATAAAAG AGGCACAGAA GCAGTATGAT GAATTTCTGA AGAAGACAGG AAGAAAGAAA
TAA
 
Protein sequence
MNLKKFFVVM LVVTFVLTSV IGVVTGFGAS SSKLPYVKLT WYVIGTPQKD WDLINQKVNE 
YIKPKLNAEI KMTMFDWGEY NDKLQTKIAA SEPFDICFTA IWTNNYRTNV AKGAFLPLNK
PGNDLLSKYA PKTKKLLGDD FIKGASINGI LYAIPANKEK AHNWGFIVRM DLVKKYKLED
MFKKVKKLED LEPYLKVIKQ KEPGVYPLGA YAGESPRFLL DWDKVVDDDV PVSLYPNNKS
TKIVNELEQP NTKALFKTVR KYYLAGYIRK DAASVTDWMS DLKAGKVFVM PQSLKPGKDA
EMSISTGYEW KQIDITPPVM STRECIGSMQ AINAKSKNPE RALMFLELFN TDKYLNNLVN
FGIEGQHYVF KDKARGIIAP GPKAKDYSPG LGWMFGNQFI NYIYENEDPN KWKNFEEYNK
KALPLLSLGF NFDDSKVKTQ VAACKSVWKQ YIPMLETGSV DPDKYIPQAI DKFKKAGVDI
IIKEAQKQYD EFLKKTGRKK