Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1511 |
Symbol | |
ID | 8012595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1492829 |
End bp | 1494190 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644824099 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_002975341 |
Protein GI | 241204245 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000621512 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.470006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA CATCGATCCA GACGTCGGAT GGTGAAGCCG CGACAGCAAA CACGCTGAAA TATATGCCGG GGTTCGGCAA TGACTTCGAA ACCGAGTCGC TTCCCGGCGC CTTGCCGCAA GGCCAGAACA GTCCGCAGAA ATGCAACTAT GGTCTCTATG CGGAGCAGCT TTCCGGCTCG CCGTTCACCG CGCCGCGCGG GACCAACGAA AGGTCCTGGC TTTACCGCAT CCGCCCGAGC GTGCGTCACA CCCGTCGCTT CTCCAACGCG TCCTATCCGC TCTGGAAAAC CGCACCTTGC CTGGACGAAC ATTCGCTTCC TCTCGGCCAG CTTCGCTGGG ATCCCATCCC CGCACCCTCG GAGAAGCTGA CGTTTCTCGA GGGGGTGCGG ACCATCACCA CGGCAGGCGA TGCCACCACC CAGGTGGGCA TGTCAGCCCA TGCCTATGTC TTCAATGAGG ACATGGTCGA CGATTACTTC TTCAACGCCG ATGGTGAATT GCTGATCGTG CCGCAGCTCG GCGCCATCAG AGTGTTCACC GAAATGGGCA TCATGGATGT CGAGCCCCTG GAAATATGCC TGATCCCGCG CGGCATGATG TTCAAGATCA TGAGGGGTGG CGACCAGACG GTCTGGCGTG GCTACATCTG CGAGAACTAC GGCGCGAAAT TCACCCTGCC GGACCGCGGA CCGATCGGCG CCAACTGCCT GGCAAACCCG CGTGACTTCA AGACGCCTGT CGCCGCATTC GAGGATAAGG AAACGCCGTG CCGCGTGCAT GTGAAGTGGT GCGGAAAATT CTATGTCACC GACATCGGCC ATTCGCCGCT GGATGTGGTG GCCTGGCACG GCAACTACGC CCCGTTCAAA TACGACTTGC GGACGTTCTC GCCGGTCGGC GCTATCCGCT TCGATCATCC CGATCCGTCG ATCTTTTCGG TGCTGACCGC GCCGACCGAA GATGCGGGTA CGGCGAACGT CGATTTCGTG ATCTTTCCGC CGCGCTGGCT GGTCGCCGAA CATACGTTTC GACCGCCTTG GTACCACCGC AACATCATGA GCGAATTCAT GGGCCTGATC CATGGCCAAT ATGACGCCAA GGAGGAGGGC TTCGTGCCGG GCGGCATGAG CCTGCACAAC ATGATGCTTC CCCACGGGCC GGACGCGCTC GCCTTCGAAA AGGCATCCAA TACCGAGCTC AAACCCGTGA AGCTCGATCA CACCATGGCC TTCATGTTCG AGACCCGGTA CCCGCAGCAA CTGACGAAAT ACGCAGCCGA GCTCGAAACG CTGCAGGATA ATTACCTGGA ATGCTGGGAC GGCCTGGAAC GCAAGTTCGA CGGAACCCCC GGCATCAAGT GA
|
Protein sequence | MDQTSIQTSD GEAATANTLK YMPGFGNDFE TESLPGALPQ GQNSPQKCNY GLYAEQLSGS PFTAPRGTNE RSWLYRIRPS VRHTRRFSNA SYPLWKTAPC LDEHSLPLGQ LRWDPIPAPS EKLTFLEGVR TITTAGDATT QVGMSAHAYV FNEDMVDDYF FNADGELLIV PQLGAIRVFT EMGIMDVEPL EICLIPRGMM FKIMRGGDQT VWRGYICENY GAKFTLPDRG PIGANCLANP RDFKTPVAAF EDKETPCRVH VKWCGKFYVT DIGHSPLDVV AWHGNYAPFK YDLRTFSPVG AIRFDHPDPS IFSVLTAPTE DAGTANVDFV IFPPRWLVAE HTFRPPWYHR NIMSEFMGLI HGQYDAKEEG FVPGGMSLHN MMLPHGPDAL AFEKASNTEL KPVKLDHTMA FMFETRYPQQ LTKYAAELET LQDNYLECWD GLERKFDGTP GIK
|
| |