Gene Athe_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2094 
Symbol 
ID7408803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2219790 
End bp2222324 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content33% 
IMG OID643716461 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002573944 
Protein GI222530062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000180475 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGA GTATAAAATT GAATTCTCTT TTTGATGAAA TCAACAACCC TCCAGCTACT 
GCCAGTATTA TTCATTGGTG GATTTTTTCT GATGAGATGA ACGAGAACAG AATAAATGCT
GAGCTTGATT ATATTTCAAA TCTTGGCTTT AAGCAAGTAT TAATTGCAGT AGGACACAAT
GTTTCGCCTA AATATTTGAC ACATGGTTGG TTTGAAATGG TAAAATTTGC AGTTCTCCAA
GCTAAAAAAA GAGGAGTTAA AGTGTGGATT GCCGATGAAG GGACATATCC AAGTGGCTTT
GCTGGCGAAA CTTTTAATAA GAAGTATCCT CACAAAAGGA TGAAGGCTAT TATTGTTGAG
AAGGAGTTTA TTATTGAGGG TAATTTATGT GAAGTTGAAC CTCACTCTGG TACAATTGGG
ATTTTGGCCA AAGACATGAA CCAGAATAAA TACTTTGCTT TTGAAAAGCT TGAATTTAGT
AGCGGATTTT TATACTTGCC CTATCATTCG ACTTGGCAAA TAAAAGTAAT ATCTTCAGCT
TACAGGACAT CTCCAACAAG ATACGTTCAC CATCCAACAG GTGCAAAAGA TACTACATTT
TCACTTTGTG ATTATCTTGA CTATGAAGCT GTCAATCTAT TCATAAGTGA GGTATATGAA
AAATATAAAG CTTATATGGG AAATGAATTT GGAAAGACAA TAATTGGATT TTTCGCTGAC
GAACCTGATT ATTCTATTTC TGGACTACCA TATACGGATA ATATATTTGA TATATTTTAC
AATGAACAAG GATACGACGT TAAAAAGTAC ATACCGTATT TCTTTAAAGA GCAATTAGAT
GAAAAAATAA AAAGAGTAAA AGCAGATTAC TGGGATGTAT GGAGCAATAT TTTTACAAAT
ACTTTCTTTA AGCAGATCTA CAAATGGTGT GAGGCAAATG GCCTCAAATT TGTAGTACAT
CTAAATCATG AAGATATGAT AGAACACCTT ACCAAATCTG AAGGACAGTT CTTTTCGCAT
ATGAAGTATG TTCATATTCC AGCAATTGAT GTAATTTGGA GACAAATCTG GTATGACAAA
AAAGCAATAT TCCCTAAATA CGCTTCTTCT GTTTCTCATA TTAAAAATAT TGCTCAGACC
TTTTCAGAGA GTTTTGCAGT ATATGGACAA GGTATATCTG TTGAGCAAAT CAAATGGGTA
GTTGATTACC AGTTTGCAAT GGACATAAAT CTATTTTTGA CCTCAATCTT CAAGTATCTT
TATGACCATC CGCAAAATTA TTTCTTTCCA GAGGTAATTA AGTATATTAA TACCATTTCA
TATCTTCTCT ATGTAAGCAC CCCTTGTACA AAGGTTCTGG TTTACTTTCC TACACCGGAT
CTGTGGGCAG GTGAAAATAT GTCTGCTTCA AAAGCAATGG AAATTGGCAA TGCACTTTTA
GAGAACCAGA TTGATTTTGA TTTTTTTGAC CATTCTCTTT TAGAATATCT GGAAATTAAA
AACCATAGAA TATACGCTAA CAATAGAAAA GAATACGACA TTGTTATTCT TCCGCCTATA
AAGTATTTGC CACAAGATCT GTTCAGATTT TTAAAGCTTT TCTCAAGCAA AGGAGGGAAG
ATTATTTTCT TCGAGAACTC TCCTTTGTTT GTTTATAACA AAACCTTTAC ATCGTTTTTC
CACTTTGTAG ATAGAGAAAT AGGTGTGGTT GTTGAAAGTA TCGAGCAGCT TTCAAAAATG
GTTGAAAAAG ATGTCACTGT TGTAGACAGC AAAGATGTTA GAGTTCTTCA TAAAAGAATA
GAAGGCAATA ATCTGATTTT TCTCTTCAAT GTTTCAGGTA CTTCATTTTT GGGTAAGATA
ATATTAAAAT TTTCTAAGAA AAATGTATAT ATATGGGATC ATATACAGAA TAAATTTTTA
ATGGTTTCAA ATATCAAAAG TAATAAAAAA AACATACAAT TAGAACTCTA TATACATCCA
TATCAGACTT TGGTTTTAAT AGCAAGTGAT GAGTATGTAG ATGGAATTCA AAAAACAACA
CTGCTTGGAA GCTTACCGAG AACAGTCTTG GAATTAAACG ATAACTGGGA AATTCATTTT
GATAAAGATT TTGTTTTGTT TTCAGATTTA AAAGATTGGC AAAGCTTGGG CTTTGGTGAC
TATTCTGGCA GTGTAGTTTA TAGAAAAATA TTTTCGTTTT CTCATGATGA CTTTATTAAA
AATAAACATC TTTTCCTCAA CTGCCCCAAT GTAAAGTACT CTGCAAAGGT TTGGTTAAAT
AAAAGATATC TTGGTGTAAG AGCTTTTTCG CCTTTTATGT GGGATATAAC AGAGGCATTG
AAAATTGGTG AGAATGAACT TGTGATTGAA GTTCAAAACA CCCCTGCAGC AGCTCTACTT
GGAACACAAG AAAAATTGGA AAAATTAAGA AAAGAGGCAG AGAAGAACTT TTATCTTTCT
ATTTCTCTAA AATTTGACCT GGAAATGGTC CAATCAGGAT TGTTGCCTCC AGTTGCTATT
GTTTCTTTAG AATGA
 
Protein sequence
MNESIKLNSL FDEINNPPAT ASIIHWWIFS DEMNENRINA ELDYISNLGF KQVLIAVGHN 
VSPKYLTHGW FEMVKFAVLQ AKKRGVKVWI ADEGTYPSGF AGETFNKKYP HKRMKAIIVE
KEFIIEGNLC EVEPHSGTIG ILAKDMNQNK YFAFEKLEFS SGFLYLPYHS TWQIKVISSA
YRTSPTRYVH HPTGAKDTTF SLCDYLDYEA VNLFISEVYE KYKAYMGNEF GKTIIGFFAD
EPDYSISGLP YTDNIFDIFY NEQGYDVKKY IPYFFKEQLD EKIKRVKADY WDVWSNIFTN
TFFKQIYKWC EANGLKFVVH LNHEDMIEHL TKSEGQFFSH MKYVHIPAID VIWRQIWYDK
KAIFPKYASS VSHIKNIAQT FSESFAVYGQ GISVEQIKWV VDYQFAMDIN LFLTSIFKYL
YDHPQNYFFP EVIKYINTIS YLLYVSTPCT KVLVYFPTPD LWAGENMSAS KAMEIGNALL
ENQIDFDFFD HSLLEYLEIK NHRIYANNRK EYDIVILPPI KYLPQDLFRF LKLFSSKGGK
IIFFENSPLF VYNKTFTSFF HFVDREIGVV VESIEQLSKM VEKDVTVVDS KDVRVLHKRI
EGNNLIFLFN VSGTSFLGKI ILKFSKKNVY IWDHIQNKFL MVSNIKSNKK NIQLELYIHP
YQTLVLIASD EYVDGIQKTT LLGSLPRTVL ELNDNWEIHF DKDFVLFSDL KDWQSLGFGD
YSGSVVYRKI FSFSHDDFIK NKHLFLNCPN VKYSAKVWLN KRYLGVRAFS PFMWDITEAL
KIGENELVIE VQNTPAAALL GTQEKLEKLR KEAEKNFYLS ISLKFDLEMV QSGLLPPVAI
VSLE