Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2086 |
Symbol | |
ID | 7408795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2209280 |
End bp | 2210818 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716453 |
Product | Beta-glucuronidase |
Protein accession | YP_002573936 |
Protein GI | 222530054 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000473091 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTACATCA TTATATTCCA CTTTCAGTAC ATTTTTATTT TATCATTAAC ACCTTTTGAA GTTGAGATTA ATAAATTTGC CCAAATTGGC TGTGAGAATA GACTTACAAT TGTTGTAAAC AACATCTTGG ACTGGAGTTG TCTTCCACCA GGGTTTATAA GGGAATACAA TGACCCAATG CATCCAGAAG GGTATAAAAC TCAGGAATAT CTTTTTGACT TTTTCAACTA TTCAGGTATT CACAGACCAG TTTTGCTCTA CACCACTTCC AAAACATATA TTGAGGATAT TAAGATTGAA ACCCAGATTG AGGGTCAAAA GGGTATAGTT TGCTTTAAGG TGGCTGTAAG TGGCGAAAAA AAGGATGAAT GTCAGATAGC AGTAGCTTTG TATGACAAAG ATGGAAAGCA AATAGCAAAG GTCGAAGGGC CAGAGGGTAT GATAGAGGTT GGAGATGCGA TATTTTGGGA GCCTTCAAAT CCATATCTTT ACAAACTAAA TGTAACTTTA ATACACGATG AAAAGGTGGT AGATGAATAT TATCTTCCTG TGGGAATAAG GACAGTTGAG GTAAAAGGCA AAAGACTTTT CCTAAATGGT AAGCCAGTGT ATCTTAAAGG TTTGGCAAAG CATGAAGACA GTGATATAAG GGGCAAGGGA TACGACCCTG TGATAGCTGT GAAAGATTTC AACCTCCTAA AATGGATAGG AGCAAACTCA TTCAGAACAT CACATTATCC TTACGCAGAA GAGATTTTAA ACTTGGCAGA CGAGTATGGT TTTTTGGTAA TTGACGAGGC ACCAGCTGTT GGCATGAATT TCTTTAACAA AAACGAAAAA GTGTTTACCG CGGAGAGAGT AAACCAAAAG ACATTAGAAC ATCACTTAGA AGTTATAAGA CAACTTATTG CAAGGGATAA AAACCATCCA AGTGTGATTA TGTGGAGTGT GGCAAATGAG GCTGCAACAT ATGAAGATGG GGCATATGAA TATTTCAAAA GAGTAATAGA TGAGGTGAGA AAGCTTGACC CGACAAGACC GGTGACGCTG GTTGAATCCT CTTTTCCAGA TGAGACCAAA GTGGGAAGTC TTGTTGATGT TATATGTGTA AACAGGTACT ATTCATGGTA TTCTGATCCT GGCAGACTGG ATTTGATAGA GTTCCAGCTT GAAAAGGAGC TGAAAAGGTG GTTTGAGCTT TATCAAAAAC CAGTGATAAT AACAGAGTAT GGGGCAGATA CAATTGCAGG ATTTCATTCA AGTCCTCCAA TGATGTTTTC TGAGGAATAT CAGTGTGAGA TGCTTGAAAG ATATCATAGG GTGTTTGACA GGCTGGATTT TGTGATAGGC GAACACATAT GGAACTTTGC AGACTTTGCA ACAAAACAAG AGGTTCGAAG GATTATGGGC AACAGGAAAG GAATCTTTAC AAGGCAAAGA CAGCCAAAAG CCGCAGCTTT CTTGCTCAAA AAAAGATGGC AAAATTCAGA GCACAAAAGG CTGGAGGAAA ATGTTTCAGA AGATAAAACA CGTAATTAA
|
Protein sequence | MYIIIFHFQY IFILSLTPFE VEINKFAQIG CENRLTIVVN NILDWSCLPP GFIREYNDPM HPEGYKTQEY LFDFFNYSGI HRPVLLYTTS KTYIEDIKIE TQIEGQKGIV CFKVAVSGEK KDECQIAVAL YDKDGKQIAK VEGPEGMIEV GDAIFWEPSN PYLYKLNVTL IHDEKVVDEY YLPVGIRTVE VKGKRLFLNG KPVYLKGLAK HEDSDIRGKG YDPVIAVKDF NLLKWIGANS FRTSHYPYAE EILNLADEYG FLVIDEAPAV GMNFFNKNEK VFTAERVNQK TLEHHLEVIR QLIARDKNHP SVIMWSVANE AATYEDGAYE YFKRVIDEVR KLDPTRPVTL VESSFPDETK VGSLVDVICV NRYYSWYSDP GRLDLIEFQL EKELKRWFEL YQKPVIITEY GADTIAGFHS SPPMMFSEEY QCEMLERYHR VFDRLDFVIG EHIWNFADFA TKQEVRRIMG NRKGIFTRQR QPKAAAFLLK KRWQNSEHKR LEENVSEDKT RN
|
| |