Gene Athe_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2378 
Symbol 
ID7407797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2529621 
End bp2531081 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content37% 
IMG OID643716741 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002574220 
Protein GI222530338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG AAGCTTCTCT ATTTTACAAA AGTTTTGCTG AAATACCATG TTATCAGTTG 
ATGGAAAAAT CTACAGGAGT AAAATTTGTT TTTAAACATC CACCTCTTGC TACATCTGCT
GCACAAGACC AGTTCAATTT AATGATAGCT TCTCGACAGT TAACTGACAT AATAGAATGG
GGATGGGATG GCTATCCTGG AGGTCCTGAA AAAGCCATCA TTGACAAGGT AATTGTGCCG
TTGAATGATT ACATACCCAA GTATGCACCA AATTTAAAAA GACTTCTTGA TAAAAACCCT
CAAATCAAAA GGATGGTAAG CTCAATTAGT GGAAAAATCT ATGGTTTCCC TGCTTTAAAA
GAAACGCCAA TAGATGCATA TTACGGTCCT CAGGTTAGAA GGGATTGGCT TGAAAAACTT
AAAATTGCTC CGCCAGAGAC AGTAGATGAA TGGTATAAAA TGTTGAAGGC GTTTAAAACC
AGAGACCCGA ACGGAAATGG AAAAGCAGAC GAAAGACCTT TTTCAATGTT AAGAGGTGCT
GCAAATCCGA GAGCTGTTTT TGACTATTGC AGCTTTTTAG TTGGGGCGTG GGGAATAAAA
ACAGACTTCT TCCAAGTAAA TGGAAGAGTT AAATATGGAG CAATTGAACC TCAGTTTAAA
GAGTTTATGA ATACTTTGGC AAAATGGTGG AAAGATGGGT TGATTGATCC AGATATACTG
ACGATGAATC AACAAACAAT TCAAGCAAAT GTTTTGAGCG ACAAAATTGG AGCATATCTG
GGGATAATTT CGGGGCATAT GGGTGCCTTT TTAGCAGCAA AGAAAGGGAC AGACTTTGAT
TTAATAGGTG TGAAATATCC AGTACTGAAA AAAGGTGAAA TAGCACGAAT CGGTCAAAAT
GAGTATCCTT TTACGGGAAG AGCAGCAGCA ATTACAACCA GCTGTAAGAA CATAGAGGCA
GCATGCCGTG CGCTCGACTG GGCTTATAGC AAGGATGGGT ATATGGCTTT TAATTTTGGT
GTAAAAGGAA AATCTTATAT GATTAAAAAC GGCCGACCAA TTTATACCGA TGAAATTCTT
TACAACCCGC AAGGATTAGG GCCAAAACAG GCATTAGCTA AATATGCGCT GATTTATGGT
CCATTTGTCC AATCCAGGGA GTATACATTA CAAATCAACT TGCAGTTGCC TCAACAAAAA
GAAGCTTCAA AGAATTGGGG TATGGTTAAA AATGATATTG CATTAGGTCC AGTTTCGCTT
TTCTTAACCC CAGAAGAGAC TAAAGAAATT GCAAATATTA TGAATACCAT AAATACTTAT
TATGATGAGA TGTTTTTGAA GATGATGACA GGCAAGTATA ATAATTATGA TGCTTTTGTA
AAAACTCTAA AGAAAATGAA GATAGAAGAA GCTATAAAGA TTTATCAGAA TGCTTATAAC
AGATATATGC AAAGAAAATG A
 
Protein sequence
MSNEASLFYK SFAEIPCYQL MEKSTGVKFV FKHPPLATSA AQDQFNLMIA SRQLTDIIEW 
GWDGYPGGPE KAIIDKVIVP LNDYIPKYAP NLKRLLDKNP QIKRMVSSIS GKIYGFPALK
ETPIDAYYGP QVRRDWLEKL KIAPPETVDE WYKMLKAFKT RDPNGNGKAD ERPFSMLRGA
ANPRAVFDYC SFLVGAWGIK TDFFQVNGRV KYGAIEPQFK EFMNTLAKWW KDGLIDPDIL
TMNQQTIQAN VLSDKIGAYL GIISGHMGAF LAAKKGTDFD LIGVKYPVLK KGEIARIGQN
EYPFTGRAAA ITTSCKNIEA ACRALDWAYS KDGYMAFNFG VKGKSYMIKN GRPIYTDEIL
YNPQGLGPKQ ALAKYALIYG PFVQSREYTL QINLQLPQQK EASKNWGMVK NDIALGPVSL
FLTPEETKEI ANIMNTINTY YDEMFLKMMT GKYNNYDAFV KTLKKMKIEE AIKIYQNAYN
RYMQRK