Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5533 |
Symbol | |
ID | 8016424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 119672 |
End bp | 120559 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644827700 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_002978900 |
Protein GI | 241518272 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR02439] catechol 1,2-dioxygenase, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.247277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00000648247 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGATG CACATGAAAA GGGTTTCTTC ACGGAAGAGA ACTCCGTTGA GGTGGTCACG AGCCGCAACG CCACCACCAA GGACCAGCGC CTGAAGCGCG TGATGGAGGT CGTGACACGC AAGCTGCATG AGGCGGTGAA GGAGCTTGAG CCGACGCAGG ACGAATGGAT GGAGGCAATT CTCTTTCTGA CCCGCACAGG ACATACATGC AACGAATGGC GACAGGAATT TATCCTGCTG TCGGACGTGC TCGGCGTGTC GATGCTGGTC GACGCCATTA ATAACCGCAA GCCCTCAGGC GCCTCCGAAA GCACTGTTCT TGGCCCGTTT CACGTTGCCG ACGCGCCGGA ACTGCCGATG GGCACCAATA TCTGCCTCGA TCACAAGGGC GAGGACATGG TGATCGGCGG CAGCATCCGT AGCACGGATG GCAGACCGAT TGCCGGCGCT GTCATCGACG TCTGGCAGGC CAACGACGAA GGCTTCTACG ACGTGCAGCA GAAGGGGATC CAACCAGACT TCAACCTTCG CGGCATCTTT CGCAGCGGCG CGGATGGCCG CTATTGGTTT CGCGCAGTCA AGCCCAAGTA TTACCCGATC CCGGACGATG GACCGGTCGG CAAGCTGCTC GGCGCGCTCG GTCGTCACCC CTACAGGCCC GCTCACCTGC ACTACATCAT CAAGGCCGAC GGCTTCGAGA CGCTCACGAC GCACATCTTT GATCCGGACG ATCCGTACAT CCACTCCGAC GCAGTCTTTG GCGTGAAGGA GAGCTTGCTT GCCAAGTTCC AGCAAGTCGA GGATTCGGTA CGCGCTGACG AGCTTGGTTT CTCTGGCAAG TTTTGGCAGA TAGAGCACGA TTTCGTGCTG GCTCGGCCCG AGGAGTAG
|
Protein sequence | MIDAHEKGFF TEENSVEVVT SRNATTKDQR LKRVMEVVTR KLHEAVKELE PTQDEWMEAI LFLTRTGHTC NEWRQEFILL SDVLGVSMLV DAINNRKPSG ASESTVLGPF HVADAPELPM GTNICLDHKG EDMVIGGSIR STDGRPIAGA VIDVWQANDE GFYDVQQKGI QPDFNLRGIF RSGADGRYWF RAVKPKYYPI PDDGPVGKLL GALGRHPYRP AHLHYIIKAD GFETLTTHIF DPDDPYIHSD AVFGVKESLL AKFQQVEDSV RADELGFSGK FWQIEHDFVL ARPEE
|
| |