Gene Athe_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2076 
Symbol 
ID7408785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2195042 
End bp2198161 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content33% 
IMG OID643716443 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002573926 
Protein GI222530044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCA ACTTTAAAAT AAAACCATTC TGGTTTTGGA ATGGGAAAAT GGAAAATGAT 
GAAATAGCAG ATCAAATAGC TCAGATGCAT GAAAAAGGAA TTGGTGGATT CTTCATTCAT
CCACGGCAAG GTCTCGAAAT ACCCTATCTT TCTCACGAGT GGTTTGAAAA GGTTTCTGTT
GCAATTGAAT GTGCAAAAAA ATATAACATG GAAGTGTGGC TTTATGATGA ATATCCTTAT
CCAAGTGGAA TTTCAGCTGG TGAAGTAGTT GTTCAGCATC CTGAATATCA AGCTTTTATA
TTAGATTATA AAGTGTTTGA GGCAAAAGAT AATGAAGAAA TTTGCATTGA AATTCCCATG
TGTGAAGTGT TATTAGCAAG AGCTTATAGA ATAAGGAACA ATATTATAGT ATGGAATGAA
TACATAGATT TGATTGATTA TATTGGTGTA ATCTACAAAG AACACATTTA CCAAGAAAGT
GGGCTTACGT TTTACAATAG GAAAAGATAC TTTGTTGGAG ATGGGGCAAA AGCACTTAAA
TGGAAAGTAC CAAAAGGAAA GTGGAAGATA TTTTTGTTTT ATCAGTATCC ATTAAAAAAT
TTTAAATATT TCGGGACATT TATTGATCCT CTAAATAAAG ATGCTGTAAG ACTGTTTATT
CAAACCACTC ATGAAAAATA TAAAAAATAC TTGGGTCATG AGTTTGGAAA AACAATTAAA
GGAATTTTTA CAGATGAAAC AGCTCCAGTT GCTGGCAAAC TTCCTTGGTC AAAATTGCTT
CCTAAGCTTT TTGAGCAAAC ATATGGGGAA AATCTTATTG AAAAATTACC TCAAATTATT
TGCACAGACA TATTTGATAC AGCCGGTTCA AAAATTAGGT ATCAGTTTTG GAAACTTGTT
GTGGACACAT TTATTGAGAG CTACGACAAA CAAATTCTTG AATGGTGTCA TCAGAACAAT
CTTTTGTATG TGGGCGAAAA GCCAATTTTG AGAAGCTCAC AATTAGCTTT TATGGACATT
CCCGGAATAG ATGCAGGGCA CCAAAAGGCT GGTGACATTC CACAGGTTGT ATCTGAAAAC
TACAGAGCAA ATCCAAAGAT AGCATCATCT GCCGCTCACT TTTATAAAAA GGAAAGGGTT
TTGTGTGAAT GCTTTCACAG TATTGGCTGG AGCATGACAA TGCAAGATAT GAAATGGATA
TTTGACTGGC TAATATTGCA GGGAATAGAT ATGTTTGTCC CCCATGCCTT TTATTATAGT
GCAGATGGAC TCAAAAAACA CGATGCCCCA CCTTCAGCCT TTTTCCAAAT GCCTTGGTGG
AAACATCAAA AAATATTGTC TGAGTATGTA GAAAATGTAA CTAAAATGCT TAAAAATTGT
AAAAGAAAAG TTGATGTACT TATAGTAGAT CCAATTACAA GCCAGTGGAC CTGTTTTAAC
GACAAAGAAG TAAAAGAGAA GATTTCGATG GATTTCTGTA GAATTCAGCA AATTCTTTTA
GAAGAAAATG TAGACTATTA TGTGATTGAC CAGTCATTAG TAGGAAGTTT GGAATGCAGG
AATCAGAAGA TTTATTATGA CAATGAAAAA TTTGAATTAT TGATCATTCC ACCTGTGACC
AATTTAGAAA AAGAGGCTTA CATGAAGATA AAAGATCTAA TATTGAAAGG ATGTAAAGTT
GTATTTATTG GCTGTTTGCC ATTCCAAACC ATCGAAGATT TTGATGTTGC CAAAGATATT
AGCAATTTTC TTGGAGTAAA TTCTATGGAC ATCGCAAAAG CATATAAAAC AGGTTCTAAA
TTGAACAATA CAGTTTTTCT TAACAGTTGT ATCTTCATTG GTAATATAGA AGATTTAGTA
GCAAAAATTG ACAAGATTTG TAAAAAGCCT GTAAGTATAT CATATGAATC TTCTAATGAC
CGTGGTATTC TCTGTGCTTA CTTTGAGGAC GCAGAACACG ACTTTTTATT TATGATTAAC
CCAACTAATG AAAAGAAGAT CTGCAAAGTA TACTTGCGGT ACAGGCCTGA TGAAATAAAC
AAAATTTATT CGGTTCCATT GACATCGCAA GAACTTGATA AGGAAATAAA TTTCGAAAAT
TCTTTAGATA AAAAGCAAAT AACTTTTTCC ATGAATTTTG AACCTTTTCA ATCGTATTTA
ATTAAATTAG AAAAAAGTTT TGTTAAGAAA AACGACCATA AAAATGATGT TGAAAAGAGA
GTTTTTGAGT ACAAAATCCC TCTTGCAACT GTTTGGGAAT TTTCAATTGA GAGTTTAAAT
CCTTTAAGGC TCGGCAGATG GAATTTAAAG TTGATTTTCA ACAACGAAAA TGAACAGTAT
TCAATATCTT CAAAAATTCC AGTTACACCA AAACCAATAA TTGATCAGAT AGAAGAGGCA
AAAATTCCAA TACCACTAAA AACAAAAAGT TTCTTTGGAT GTCCAAAAGA AATAACATTG
CCAAGCTTTG AGGCAATATA CACTACTTCA TTTTTTATAG ATGCAGCAAG CCAAAAATTC
TGGCTTGTAA TTGAAGATGA AGGTATAAAA GGTGAATGGG TTGTGCTTTT GAACAATCAT
ACTATTTTAC CAAGAGATTT TGTACTTAAA AGATTTTATT CTCATACTAA TTTGGCATAT
GATATTAGCA ACTTAATTAA ACTGGGGGAA AATCAGCTTT GTGTATGTGT AAAGATTAGC
AGGTCTTTTG ATGGACTTCT TACACCCATT TATATCTTCA GCACGGCAGG TGTATTCAAA
GTTGATGATA GCTGGCATAT AGACAAACTT CCAACTCAAG GCTGCTTTGG TAAAGACCTT
GAAAATGGCA TTCCTTTTTA TGCTGGTTTT ATAAAGTACG AAAAAGAAGT TCAAATGCCA
TCTTTTGAAA ATGGTTTTGT GGAATTCTTT ATTGAAGATA ACATAAATCA GTGTGTAAGT
CTTTATATAA ATGATGAATT TATAGGTACA AGGTGCTGGC AGCCTTATAG ATGGAAGGTA
GATTCTGATT TACTTTCTTC AAAGAAGGTA AAGCTCACAC TTGAAGTATC AACTTCGAGC
CTGCAGCTGT TCGAAGGTGA AGTTATTGAA CCAATAACAC ATAAAATTAA GACAATATAA
 
Protein sequence
MNINFKIKPF WFWNGKMEND EIADQIAQMH EKGIGGFFIH PRQGLEIPYL SHEWFEKVSV 
AIECAKKYNM EVWLYDEYPY PSGISAGEVV VQHPEYQAFI LDYKVFEAKD NEEICIEIPM
CEVLLARAYR IRNNIIVWNE YIDLIDYIGV IYKEHIYQES GLTFYNRKRY FVGDGAKALK
WKVPKGKWKI FLFYQYPLKN FKYFGTFIDP LNKDAVRLFI QTTHEKYKKY LGHEFGKTIK
GIFTDETAPV AGKLPWSKLL PKLFEQTYGE NLIEKLPQII CTDIFDTAGS KIRYQFWKLV
VDTFIESYDK QILEWCHQNN LLYVGEKPIL RSSQLAFMDI PGIDAGHQKA GDIPQVVSEN
YRANPKIASS AAHFYKKERV LCECFHSIGW SMTMQDMKWI FDWLILQGID MFVPHAFYYS
ADGLKKHDAP PSAFFQMPWW KHQKILSEYV ENVTKMLKNC KRKVDVLIVD PITSQWTCFN
DKEVKEKISM DFCRIQQILL EENVDYYVID QSLVGSLECR NQKIYYDNEK FELLIIPPVT
NLEKEAYMKI KDLILKGCKV VFIGCLPFQT IEDFDVAKDI SNFLGVNSMD IAKAYKTGSK
LNNTVFLNSC IFIGNIEDLV AKIDKICKKP VSISYESSND RGILCAYFED AEHDFLFMIN
PTNEKKICKV YLRYRPDEIN KIYSVPLTSQ ELDKEINFEN SLDKKQITFS MNFEPFQSYL
IKLEKSFVKK NDHKNDVEKR VFEYKIPLAT VWEFSIESLN PLRLGRWNLK LIFNNENEQY
SISSKIPVTP KPIIDQIEEA KIPIPLKTKS FFGCPKEITL PSFEAIYTTS FFIDAASQKF
WLVIEDEGIK GEWVVLLNNH TILPRDFVLK RFYSHTNLAY DISNLIKLGE NQLCVCVKIS
RSFDGLLTPI YIFSTAGVFK VDDSWHIDKL PTQGCFGKDL ENGIPFYAGF IKYEKEVQMP
SFENGFVEFF IEDNINQCVS LYINDEFIGT RCWQPYRWKV DSDLLSSKKV KLTLEVSTSS
LQLFEGEVIE PITHKIKTI