Gene Athe_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0399 
Symbol 
ID7409334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp458758 
End bp459984 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content39% 
IMG OID643714788 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002572306 
Protein GI222528424 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000195685 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTAACAAT TTTATTAACG TTAATTTTCC TACTTTCCTT GGTTGCAGGA 
ATGGGTGCTG CGCAAAAAGA TGTTTTTGCC ACTTCCAAGA TAACTTTAAA GCTGGGTGCG
TGGGCATCTT CTCCTGCTGA GAAGAAGATT GTTCAAAACC AAATCGCAGC CTTCAAAAAG
CTCTATCCCA ATGTCGATAT TAAATTAGTT GAAATTGTCG GTGATTATAA TCAAAAAATG
CAGCTTCTCA TGGCATCTAA AACAGAACCA GATATCTTTT ACATGGATTC AATGCCAGCT
TGGCAGTACA TTGCAAAGAA TGTCTTAGAG CCGTTTGACA GCTGGATGAA AAAGTACAAT
GTCAAAACAA TTGGTTATGA GTCATCACTT CTCCAGCCAT TCATATACAA AGGAAAAGTG
TATGGACTTC CAAAAGACTA CAATACATTA GTTTTGTTCT ACAACAAAGA GATGTTCAAA
CAAGCAGGTC TTACGCAGCC ACCAAAGACA TGGCAGGAGT TGAAAGAGTA TGCTAAGAAA
CTTACAACAG ACAAGGTTGT AGGTCTTACA ATGAACCTTG AGCTTGCAAG AATTCAACCT
TTTGCATACC AAAACGGTGG TAAAGTATTT GACGGTAGTA AGCCAGTCTT TACCGACCCG
AAAGCCTTGG AAGGCTTAAA ATTTGCACTT GACCTTTTCA AAGAAGGAAT ATGCAAAACA
CCAAAAGATT TAGGTGCTGG CTGGGTTGGG GATGCATTTG CTGACAAGAA AGCTGCTATG
ACAATTGAAG GCGGCTGGAT GATTCCATTC TTAAACGACA GAAAGATACC AAAAGATCAA
TATGGAATTG CAGAACTTCC TGCAGGACCT GCTGGTAAGT CAACAATGGC ATTCACCGTT
GCATATGTAA TGAGTAAAAA TTCTAAGCAC AAACCTGAAG CGTTCAAACT TATAAGATTC
TTAACTGGAG AAGGCGGACA AAAGTATGTG GTTGAAGCAG GCTTAGCACT TCCTTCATTA
AAGAGCGCAG GTGTAAACTT TGCTAAAACT TATCCAGAGA GAAAAGCGCT TGTTGATGGT
GCAAAATATG CACAGGTCTA CTTCTATGGT CTGGATGGCA CAAAAGTTGT GGATGTCTTC
AACAAAGCAT TTGAAGACTA TGTAATTGGC AAAAAGTATG ACCTTAAGAA GAACATTGAG
GAAAGAGTAA AGCAAATCAT GAAGTAA
 
Protein sequence
MKKFLTILLT LIFLLSLVAG MGAAQKDVFA TSKITLKLGA WASSPAEKKI VQNQIAAFKK 
LYPNVDIKLV EIVGDYNQKM QLLMASKTEP DIFYMDSMPA WQYIAKNVLE PFDSWMKKYN
VKTIGYESSL LQPFIYKGKV YGLPKDYNTL VLFYNKEMFK QAGLTQPPKT WQELKEYAKK
LTTDKVVGLT MNLELARIQP FAYQNGGKVF DGSKPVFTDP KALEGLKFAL DLFKEGICKT
PKDLGAGWVG DAFADKKAAM TIEGGWMIPF LNDRKIPKDQ YGIAELPAGP AGKSTMAFTV
AYVMSKNSKH KPEAFKLIRF LTGEGGQKYV VEAGLALPSL KSAGVNFAKT YPERKALVDG
AKYAQVYFYG LDGTKVVDVF NKAFEDYVIG KKYDLKKNIE ERVKQIMK