Gene Athe_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2086 
Symbol 
ID7408795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2209280 
End bp2210818 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content38% 
IMG OID643716453 
ProductBeta-glucuronidase 
Protein accessionYP_002573936 
Protein GI222530054 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000473091 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTACATCA TTATATTCCA CTTTCAGTAC ATTTTTATTT TATCATTAAC ACCTTTTGAA 
GTTGAGATTA ATAAATTTGC CCAAATTGGC TGTGAGAATA GACTTACAAT TGTTGTAAAC
AACATCTTGG ACTGGAGTTG TCTTCCACCA GGGTTTATAA GGGAATACAA TGACCCAATG
CATCCAGAAG GGTATAAAAC TCAGGAATAT CTTTTTGACT TTTTCAACTA TTCAGGTATT
CACAGACCAG TTTTGCTCTA CACCACTTCC AAAACATATA TTGAGGATAT TAAGATTGAA
ACCCAGATTG AGGGTCAAAA GGGTATAGTT TGCTTTAAGG TGGCTGTAAG TGGCGAAAAA
AAGGATGAAT GTCAGATAGC AGTAGCTTTG TATGACAAAG ATGGAAAGCA AATAGCAAAG
GTCGAAGGGC CAGAGGGTAT GATAGAGGTT GGAGATGCGA TATTTTGGGA GCCTTCAAAT
CCATATCTTT ACAAACTAAA TGTAACTTTA ATACACGATG AAAAGGTGGT AGATGAATAT
TATCTTCCTG TGGGAATAAG GACAGTTGAG GTAAAAGGCA AAAGACTTTT CCTAAATGGT
AAGCCAGTGT ATCTTAAAGG TTTGGCAAAG CATGAAGACA GTGATATAAG GGGCAAGGGA
TACGACCCTG TGATAGCTGT GAAAGATTTC AACCTCCTAA AATGGATAGG AGCAAACTCA
TTCAGAACAT CACATTATCC TTACGCAGAA GAGATTTTAA ACTTGGCAGA CGAGTATGGT
TTTTTGGTAA TTGACGAGGC ACCAGCTGTT GGCATGAATT TCTTTAACAA AAACGAAAAA
GTGTTTACCG CGGAGAGAGT AAACCAAAAG ACATTAGAAC ATCACTTAGA AGTTATAAGA
CAACTTATTG CAAGGGATAA AAACCATCCA AGTGTGATTA TGTGGAGTGT GGCAAATGAG
GCTGCAACAT ATGAAGATGG GGCATATGAA TATTTCAAAA GAGTAATAGA TGAGGTGAGA
AAGCTTGACC CGACAAGACC GGTGACGCTG GTTGAATCCT CTTTTCCAGA TGAGACCAAA
GTGGGAAGTC TTGTTGATGT TATATGTGTA AACAGGTACT ATTCATGGTA TTCTGATCCT
GGCAGACTGG ATTTGATAGA GTTCCAGCTT GAAAAGGAGC TGAAAAGGTG GTTTGAGCTT
TATCAAAAAC CAGTGATAAT AACAGAGTAT GGGGCAGATA CAATTGCAGG ATTTCATTCA
AGTCCTCCAA TGATGTTTTC TGAGGAATAT CAGTGTGAGA TGCTTGAAAG ATATCATAGG
GTGTTTGACA GGCTGGATTT TGTGATAGGC GAACACATAT GGAACTTTGC AGACTTTGCA
ACAAAACAAG AGGTTCGAAG GATTATGGGC AACAGGAAAG GAATCTTTAC AAGGCAAAGA
CAGCCAAAAG CCGCAGCTTT CTTGCTCAAA AAAAGATGGC AAAATTCAGA GCACAAAAGG
CTGGAGGAAA ATGTTTCAGA AGATAAAACA CGTAATTAA
 
Protein sequence
MYIIIFHFQY IFILSLTPFE VEINKFAQIG CENRLTIVVN NILDWSCLPP GFIREYNDPM 
HPEGYKTQEY LFDFFNYSGI HRPVLLYTTS KTYIEDIKIE TQIEGQKGIV CFKVAVSGEK
KDECQIAVAL YDKDGKQIAK VEGPEGMIEV GDAIFWEPSN PYLYKLNVTL IHDEKVVDEY
YLPVGIRTVE VKGKRLFLNG KPVYLKGLAK HEDSDIRGKG YDPVIAVKDF NLLKWIGANS
FRTSHYPYAE EILNLADEYG FLVIDEAPAV GMNFFNKNEK VFTAERVNQK TLEHHLEVIR
QLIARDKNHP SVIMWSVANE AATYEDGAYE YFKRVIDEVR KLDPTRPVTL VESSFPDETK
VGSLVDVICV NRYYSWYSDP GRLDLIEFQL EKELKRWFEL YQKPVIITEY GADTIAGFHS
SPPMMFSEEY QCEMLERYHR VFDRLDFVIG EHIWNFADFA TKQEVRRIMG NRKGIFTRQR
QPKAAAFLLK KRWQNSEHKR LEENVSEDKT RN