Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4462 |
Symbol | |
ID | 6977556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 94138 |
End bp | 94983 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393640 |
Product | hypothetical protein |
Protein accession | YP_002278458 |
Protein GI | 209546540 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0193122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCTGC AGGCGATGAT CGACGAATGG TATCCGGTTG GGCTTTTCAG TCAGCTCGAC AGCGCCGGTC GCAAGACGAG GCTGATGGGC GAGCCAATCG ACGTGGCGAG CGACGCCGAT GGCAATGCGA GGGTCACGGG CGGCGACGGC CGTGTTCTGC CGGTGCGCGT GCGTTACGGC CATGTCTGGT CCTCGCTCGG CGAACCGCAG AAGGAAATTT TCCCGATTCC CGAGGCCGAC CAGCCCGGCC GCCGTTTCGT CGACGTCGGC GTGGTGCGCG TGCGCTGCTC GCCCCTGCGC GCTGTCGAAA ACTTCCTCGA CATTGCCCAT TTCCCCTTCG TCCATACCGA CATTCTCGGC GCAGAGCCGC ACACCGAGGT TCAGAACTAC AAGGTCGAGA TCCGCGAGGA GGAAGATGAA GTCTGGGCAA CGCAGGTGAA GTTTTACCAG CCGCAGGCCG CCAAATCGGC AAGCGGCGGC ATCACCACGG AGTATATGTA CCGCGTGCCG GCACCGACCT GCTCCGTGCT CTACAAGACC TGTCCGCCGC GCCCGAGCGA ATGGGATGTC ATCACGCTCT TCGTGCAGCC GCTGGCCGAG GACCTGTGCG ACGTCTGGCC ATGGATGGCG CTTTTCGATG ACGAGACCGC GATGACCGAT CTCATCCACT TCCAGCAGAC GATCTTCCTG CAGGACCGTT CGATTCTGGA AAACCAGATC CCACCGCTTC TGCCGCTCGA CCCCGGCATG GAAATCCCGA CGCGGGCCGA TCTCACATCG ATCGCTTACC GACGCTGGCT GAAGCGTCAC AATTATACCT ATGGCGCACA GCTGGTGGCG CAATGA
|
Protein sequence | MSLQAMIDEW YPVGLFSQLD SAGRKTRLMG EPIDVASDAD GNARVTGGDG RVLPVRVRYG HVWSSLGEPQ KEIFPIPEAD QPGRRFVDVG VVRVRCSPLR AVENFLDIAH FPFVHTDILG AEPHTEVQNY KVEIREEEDE VWATQVKFYQ PQAAKSASGG ITTEYMYRVP APTCSVLYKT CPPRPSEWDV ITLFVQPLAE DLCDVWPWMA LFDDETAMTD LIHFQQTIFL QDRSILENQI PPLLPLDPGM EIPTRADLTS IAYRRWLKRH NYTYGAQLVA Q
|
| |