Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2094 |
Symbol | |
ID | 7408803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2219790 |
End bp | 2222324 |
Gene Length | 2535 bp |
Protein Length | 844 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643716461 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002573944 |
Protein GI | 222530062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000180475 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGA GTATAAAATT GAATTCTCTT TTTGATGAAA TCAACAACCC TCCAGCTACT GCCAGTATTA TTCATTGGTG GATTTTTTCT GATGAGATGA ACGAGAACAG AATAAATGCT GAGCTTGATT ATATTTCAAA TCTTGGCTTT AAGCAAGTAT TAATTGCAGT AGGACACAAT GTTTCGCCTA AATATTTGAC ACATGGTTGG TTTGAAATGG TAAAATTTGC AGTTCTCCAA GCTAAAAAAA GAGGAGTTAA AGTGTGGATT GCCGATGAAG GGACATATCC AAGTGGCTTT GCTGGCGAAA CTTTTAATAA GAAGTATCCT CACAAAAGGA TGAAGGCTAT TATTGTTGAG AAGGAGTTTA TTATTGAGGG TAATTTATGT GAAGTTGAAC CTCACTCTGG TACAATTGGG ATTTTGGCCA AAGACATGAA CCAGAATAAA TACTTTGCTT TTGAAAAGCT TGAATTTAGT AGCGGATTTT TATACTTGCC CTATCATTCG ACTTGGCAAA TAAAAGTAAT ATCTTCAGCT TACAGGACAT CTCCAACAAG ATACGTTCAC CATCCAACAG GTGCAAAAGA TACTACATTT TCACTTTGTG ATTATCTTGA CTATGAAGCT GTCAATCTAT TCATAAGTGA GGTATATGAA AAATATAAAG CTTATATGGG AAATGAATTT GGAAAGACAA TAATTGGATT TTTCGCTGAC GAACCTGATT ATTCTATTTC TGGACTACCA TATACGGATA ATATATTTGA TATATTTTAC AATGAACAAG GATACGACGT TAAAAAGTAC ATACCGTATT TCTTTAAAGA GCAATTAGAT GAAAAAATAA AAAGAGTAAA AGCAGATTAC TGGGATGTAT GGAGCAATAT TTTTACAAAT ACTTTCTTTA AGCAGATCTA CAAATGGTGT GAGGCAAATG GCCTCAAATT TGTAGTACAT CTAAATCATG AAGATATGAT AGAACACCTT ACCAAATCTG AAGGACAGTT CTTTTCGCAT ATGAAGTATG TTCATATTCC AGCAATTGAT GTAATTTGGA GACAAATCTG GTATGACAAA AAAGCAATAT TCCCTAAATA CGCTTCTTCT GTTTCTCATA TTAAAAATAT TGCTCAGACC TTTTCAGAGA GTTTTGCAGT ATATGGACAA GGTATATCTG TTGAGCAAAT CAAATGGGTA GTTGATTACC AGTTTGCAAT GGACATAAAT CTATTTTTGA CCTCAATCTT CAAGTATCTT TATGACCATC CGCAAAATTA TTTCTTTCCA GAGGTAATTA AGTATATTAA TACCATTTCA TATCTTCTCT ATGTAAGCAC CCCTTGTACA AAGGTTCTGG TTTACTTTCC TACACCGGAT CTGTGGGCAG GTGAAAATAT GTCTGCTTCA AAAGCAATGG AAATTGGCAA TGCACTTTTA GAGAACCAGA TTGATTTTGA TTTTTTTGAC CATTCTCTTT TAGAATATCT GGAAATTAAA AACCATAGAA TATACGCTAA CAATAGAAAA GAATACGACA TTGTTATTCT TCCGCCTATA AAGTATTTGC CACAAGATCT GTTCAGATTT TTAAAGCTTT TCTCAAGCAA AGGAGGGAAG ATTATTTTCT TCGAGAACTC TCCTTTGTTT GTTTATAACA AAACCTTTAC ATCGTTTTTC CACTTTGTAG ATAGAGAAAT AGGTGTGGTT GTTGAAAGTA TCGAGCAGCT TTCAAAAATG GTTGAAAAAG ATGTCACTGT TGTAGACAGC AAAGATGTTA GAGTTCTTCA TAAAAGAATA GAAGGCAATA ATCTGATTTT TCTCTTCAAT GTTTCAGGTA CTTCATTTTT GGGTAAGATA ATATTAAAAT TTTCTAAGAA AAATGTATAT ATATGGGATC ATATACAGAA TAAATTTTTA ATGGTTTCAA ATATCAAAAG TAATAAAAAA AACATACAAT TAGAACTCTA TATACATCCA TATCAGACTT TGGTTTTAAT AGCAAGTGAT GAGTATGTAG ATGGAATTCA AAAAACAACA CTGCTTGGAA GCTTACCGAG AACAGTCTTG GAATTAAACG ATAACTGGGA AATTCATTTT GATAAAGATT TTGTTTTGTT TTCAGATTTA AAAGATTGGC AAAGCTTGGG CTTTGGTGAC TATTCTGGCA GTGTAGTTTA TAGAAAAATA TTTTCGTTTT CTCATGATGA CTTTATTAAA AATAAACATC TTTTCCTCAA CTGCCCCAAT GTAAAGTACT CTGCAAAGGT TTGGTTAAAT AAAAGATATC TTGGTGTAAG AGCTTTTTCG CCTTTTATGT GGGATATAAC AGAGGCATTG AAAATTGGTG AGAATGAACT TGTGATTGAA GTTCAAAACA CCCCTGCAGC AGCTCTACTT GGAACACAAG AAAAATTGGA AAAATTAAGA AAAGAGGCAG AGAAGAACTT TTATCTTTCT ATTTCTCTAA AATTTGACCT GGAAATGGTC CAATCAGGAT TGTTGCCTCC AGTTGCTATT GTTTCTTTAG AATGA
|
Protein sequence | MNESIKLNSL FDEINNPPAT ASIIHWWIFS DEMNENRINA ELDYISNLGF KQVLIAVGHN VSPKYLTHGW FEMVKFAVLQ AKKRGVKVWI ADEGTYPSGF AGETFNKKYP HKRMKAIIVE KEFIIEGNLC EVEPHSGTIG ILAKDMNQNK YFAFEKLEFS SGFLYLPYHS TWQIKVISSA YRTSPTRYVH HPTGAKDTTF SLCDYLDYEA VNLFISEVYE KYKAYMGNEF GKTIIGFFAD EPDYSISGLP YTDNIFDIFY NEQGYDVKKY IPYFFKEQLD EKIKRVKADY WDVWSNIFTN TFFKQIYKWC EANGLKFVVH LNHEDMIEHL TKSEGQFFSH MKYVHIPAID VIWRQIWYDK KAIFPKYASS VSHIKNIAQT FSESFAVYGQ GISVEQIKWV VDYQFAMDIN LFLTSIFKYL YDHPQNYFFP EVIKYINTIS YLLYVSTPCT KVLVYFPTPD LWAGENMSAS KAMEIGNALL ENQIDFDFFD HSLLEYLEIK NHRIYANNRK EYDIVILPPI KYLPQDLFRF LKLFSSKGGK IIFFENSPLF VYNKTFTSFF HFVDREIGVV VESIEQLSKM VEKDVTVVDS KDVRVLHKRI EGNNLIFLFN VSGTSFLGKI ILKFSKKNVY IWDHIQNKFL MVSNIKSNKK NIQLELYIHP YQTLVLIASD EYVDGIQKTT LLGSLPRTVL ELNDNWEIHF DKDFVLFSDL KDWQSLGFGD YSGSVVYRKI FSFSHDDFIK NKHLFLNCPN VKYSAKVWLN KRYLGVRAFS PFMWDITEAL KIGENELVIE VQNTPAAALL GTQEKLEKLR KEAEKNFYLS ISLKFDLEMV QSGLLPPVAI VSLE
|
| |