Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2076 |
Symbol | |
ID | 7408785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2195042 |
End bp | 2198161 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643716443 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002573926 |
Protein GI | 222530044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCA ACTTTAAAAT AAAACCATTC TGGTTTTGGA ATGGGAAAAT GGAAAATGAT GAAATAGCAG ATCAAATAGC TCAGATGCAT GAAAAAGGAA TTGGTGGATT CTTCATTCAT CCACGGCAAG GTCTCGAAAT ACCCTATCTT TCTCACGAGT GGTTTGAAAA GGTTTCTGTT GCAATTGAAT GTGCAAAAAA ATATAACATG GAAGTGTGGC TTTATGATGA ATATCCTTAT CCAAGTGGAA TTTCAGCTGG TGAAGTAGTT GTTCAGCATC CTGAATATCA AGCTTTTATA TTAGATTATA AAGTGTTTGA GGCAAAAGAT AATGAAGAAA TTTGCATTGA AATTCCCATG TGTGAAGTGT TATTAGCAAG AGCTTATAGA ATAAGGAACA ATATTATAGT ATGGAATGAA TACATAGATT TGATTGATTA TATTGGTGTA ATCTACAAAG AACACATTTA CCAAGAAAGT GGGCTTACGT TTTACAATAG GAAAAGATAC TTTGTTGGAG ATGGGGCAAA AGCACTTAAA TGGAAAGTAC CAAAAGGAAA GTGGAAGATA TTTTTGTTTT ATCAGTATCC ATTAAAAAAT TTTAAATATT TCGGGACATT TATTGATCCT CTAAATAAAG ATGCTGTAAG ACTGTTTATT CAAACCACTC ATGAAAAATA TAAAAAATAC TTGGGTCATG AGTTTGGAAA AACAATTAAA GGAATTTTTA CAGATGAAAC AGCTCCAGTT GCTGGCAAAC TTCCTTGGTC AAAATTGCTT CCTAAGCTTT TTGAGCAAAC ATATGGGGAA AATCTTATTG AAAAATTACC TCAAATTATT TGCACAGACA TATTTGATAC AGCCGGTTCA AAAATTAGGT ATCAGTTTTG GAAACTTGTT GTGGACACAT TTATTGAGAG CTACGACAAA CAAATTCTTG AATGGTGTCA TCAGAACAAT CTTTTGTATG TGGGCGAAAA GCCAATTTTG AGAAGCTCAC AATTAGCTTT TATGGACATT CCCGGAATAG ATGCAGGGCA CCAAAAGGCT GGTGACATTC CACAGGTTGT ATCTGAAAAC TACAGAGCAA ATCCAAAGAT AGCATCATCT GCCGCTCACT TTTATAAAAA GGAAAGGGTT TTGTGTGAAT GCTTTCACAG TATTGGCTGG AGCATGACAA TGCAAGATAT GAAATGGATA TTTGACTGGC TAATATTGCA GGGAATAGAT ATGTTTGTCC CCCATGCCTT TTATTATAGT GCAGATGGAC TCAAAAAACA CGATGCCCCA CCTTCAGCCT TTTTCCAAAT GCCTTGGTGG AAACATCAAA AAATATTGTC TGAGTATGTA GAAAATGTAA CTAAAATGCT TAAAAATTGT AAAAGAAAAG TTGATGTACT TATAGTAGAT CCAATTACAA GCCAGTGGAC CTGTTTTAAC GACAAAGAAG TAAAAGAGAA GATTTCGATG GATTTCTGTA GAATTCAGCA AATTCTTTTA GAAGAAAATG TAGACTATTA TGTGATTGAC CAGTCATTAG TAGGAAGTTT GGAATGCAGG AATCAGAAGA TTTATTATGA CAATGAAAAA TTTGAATTAT TGATCATTCC ACCTGTGACC AATTTAGAAA AAGAGGCTTA CATGAAGATA AAAGATCTAA TATTGAAAGG ATGTAAAGTT GTATTTATTG GCTGTTTGCC ATTCCAAACC ATCGAAGATT TTGATGTTGC CAAAGATATT AGCAATTTTC TTGGAGTAAA TTCTATGGAC ATCGCAAAAG CATATAAAAC AGGTTCTAAA TTGAACAATA CAGTTTTTCT TAACAGTTGT ATCTTCATTG GTAATATAGA AGATTTAGTA GCAAAAATTG ACAAGATTTG TAAAAAGCCT GTAAGTATAT CATATGAATC TTCTAATGAC CGTGGTATTC TCTGTGCTTA CTTTGAGGAC GCAGAACACG ACTTTTTATT TATGATTAAC CCAACTAATG AAAAGAAGAT CTGCAAAGTA TACTTGCGGT ACAGGCCTGA TGAAATAAAC AAAATTTATT CGGTTCCATT GACATCGCAA GAACTTGATA AGGAAATAAA TTTCGAAAAT TCTTTAGATA AAAAGCAAAT AACTTTTTCC ATGAATTTTG AACCTTTTCA ATCGTATTTA ATTAAATTAG AAAAAAGTTT TGTTAAGAAA AACGACCATA AAAATGATGT TGAAAAGAGA GTTTTTGAGT ACAAAATCCC TCTTGCAACT GTTTGGGAAT TTTCAATTGA GAGTTTAAAT CCTTTAAGGC TCGGCAGATG GAATTTAAAG TTGATTTTCA ACAACGAAAA TGAACAGTAT TCAATATCTT CAAAAATTCC AGTTACACCA AAACCAATAA TTGATCAGAT AGAAGAGGCA AAAATTCCAA TACCACTAAA AACAAAAAGT TTCTTTGGAT GTCCAAAAGA AATAACATTG CCAAGCTTTG AGGCAATATA CACTACTTCA TTTTTTATAG ATGCAGCAAG CCAAAAATTC TGGCTTGTAA TTGAAGATGA AGGTATAAAA GGTGAATGGG TTGTGCTTTT GAACAATCAT ACTATTTTAC CAAGAGATTT TGTACTTAAA AGATTTTATT CTCATACTAA TTTGGCATAT GATATTAGCA ACTTAATTAA ACTGGGGGAA AATCAGCTTT GTGTATGTGT AAAGATTAGC AGGTCTTTTG ATGGACTTCT TACACCCATT TATATCTTCA GCACGGCAGG TGTATTCAAA GTTGATGATA GCTGGCATAT AGACAAACTT CCAACTCAAG GCTGCTTTGG TAAAGACCTT GAAAATGGCA TTCCTTTTTA TGCTGGTTTT ATAAAGTACG AAAAAGAAGT TCAAATGCCA TCTTTTGAAA ATGGTTTTGT GGAATTCTTT ATTGAAGATA ACATAAATCA GTGTGTAAGT CTTTATATAA ATGATGAATT TATAGGTACA AGGTGCTGGC AGCCTTATAG ATGGAAGGTA GATTCTGATT TACTTTCTTC AAAGAAGGTA AAGCTCACAC TTGAAGTATC AACTTCGAGC CTGCAGCTGT TCGAAGGTGA AGTTATTGAA CCAATAACAC ATAAAATTAA GACAATATAA
|
Protein sequence | MNINFKIKPF WFWNGKMEND EIADQIAQMH EKGIGGFFIH PRQGLEIPYL SHEWFEKVSV AIECAKKYNM EVWLYDEYPY PSGISAGEVV VQHPEYQAFI LDYKVFEAKD NEEICIEIPM CEVLLARAYR IRNNIIVWNE YIDLIDYIGV IYKEHIYQES GLTFYNRKRY FVGDGAKALK WKVPKGKWKI FLFYQYPLKN FKYFGTFIDP LNKDAVRLFI QTTHEKYKKY LGHEFGKTIK GIFTDETAPV AGKLPWSKLL PKLFEQTYGE NLIEKLPQII CTDIFDTAGS KIRYQFWKLV VDTFIESYDK QILEWCHQNN LLYVGEKPIL RSSQLAFMDI PGIDAGHQKA GDIPQVVSEN YRANPKIASS AAHFYKKERV LCECFHSIGW SMTMQDMKWI FDWLILQGID MFVPHAFYYS ADGLKKHDAP PSAFFQMPWW KHQKILSEYV ENVTKMLKNC KRKVDVLIVD PITSQWTCFN DKEVKEKISM DFCRIQQILL EENVDYYVID QSLVGSLECR NQKIYYDNEK FELLIIPPVT NLEKEAYMKI KDLILKGCKV VFIGCLPFQT IEDFDVAKDI SNFLGVNSMD IAKAYKTGSK LNNTVFLNSC IFIGNIEDLV AKIDKICKKP VSISYESSND RGILCAYFED AEHDFLFMIN PTNEKKICKV YLRYRPDEIN KIYSVPLTSQ ELDKEINFEN SLDKKQITFS MNFEPFQSYL IKLEKSFVKK NDHKNDVEKR VFEYKIPLAT VWEFSIESLN PLRLGRWNLK LIFNNENEQY SISSKIPVTP KPIIDQIEEA KIPIPLKTKS FFGCPKEITL PSFEAIYTTS FFIDAASQKF WLVIEDEGIK GEWVVLLNNH TILPRDFVLK RFYSHTNLAY DISNLIKLGE NQLCVCVKIS RSFDGLLTPI YIFSTAGVFK VDDSWHIDKL PTQGCFGKDL ENGIPFYAGF IKYEKEVQMP SFENGFVEFF IEDNINQCVS LYINDEFIGT RCWQPYRWKV DSDLLSSKKV KLTLEVSTSS LQLFEGEVIE PITHKIKTI
|
| |