Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5234 |
Symbol | |
ID | 8007408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 645650 |
End bp | 647035 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644822142 |
Product | transcriptional regulator, GntR family with aminotransferase domain |
Protein accession | YP_002973402 |
Protein GI | 241113567 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.214132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGT GGCGCCCCGA TCCCTCGCAA CTTCGCCGGC CGGCCTATCT TTCGCTTGCC GAACAGATCG CAAACGCGAT CACCGACGGC AAGCTCACCG ACGGCACACA ATTGCCGCCG CATCGCAAGC TGGCGGACGA CCTGCATCTT TCCGTGCAGA CGGTCAGCCG CGCCTATGAC GAGCTGATCC GTCGGGGGCT GATATCAGGC GAGATCGGAC GCGGCAGCTT CGTCCAGACC AGGCCGCGTG AACCGGAGCC GCCCTATCTG CCGGAACGGC TGGGCGAGGT GATCGATCTG TCGATCCTGA AACCGGTCTG CGAGCAGATC CATTTGGAAA GGCTGCGGCA GGCCTTCGGC TGGCTTTCGG AAAACCTGCC TTCCAGCTCG GCGCTGTCGT TCAGGCCGAA CATGGTGTTT CCGCGCCACC GCGCCGTCGC CACGGAATGG CTGGCGCGCT GCGGGCTGGA GATATCGCCG CTGAACATCA GCGTGACGAA CGGTGCGACA TCGGGCATGA CCGTGGCGTT GATGAGCGTT GCCCCGCCCG GCTCGACGGT TGCGACCGAA GCCATCAGCC ATCATACGCT CGTTCCGCTC TCGACCTATC TCGGTCTGCA CCTGGAAGGG TTGGCGATCG ACGAGGAGGG CATGATCCCC GATGCGCTGG ACGAAGCCTG CCGGAAGGGA CCGATCCGGG CGATTTTCCT GCAGCCCTCG GTGATCAATC CGATGGCGGC GCTGATGAGC GCGGAACGCA GACAGGCGCT CGCCACTGTC GCCGCCAAAC ATGATATCGC GATCATCGAA AACGATATTC TCGGCCCGAT GGTCGAGAAT CGTGCGCCGC CGATGGCCGC GTTTGCGCCG GAGCGGACAC TCTACGTTAC GAGCTTCACG AAAATTACCG TTCCGGGCCT GCGGATCGGC TATCTCACCG CGCCTGACCG CTATGTCGCC GCCGTCGCCA ACCGGCATCT CGTCTCCAAC TGGATGGCGA CGCCTGCCAT GGCCGAGATC GCCACCCGCT GGGTCAGCGA CGGCACGGCG ATGGAACTGG TCAACTGGCA GCGCCGCGCC CTTTTGAGCC GGCATGCGAT CGCGGCGGAG ATGCTGGCCG GCCAGCCATA CCGGGCGCAT CCGCAAAGCC TGCACGTCTG GCTGCCGCTC TCCGGCAACC ATACCGAAGA CGGGTTCGTA TCGCAGGCAA GGCTGCGCGG CGTGGCGATC GCGCCCGGCA AATCGTTCCA CACCACCGAT CAGGGCTGGA CGCCGGCCGT GCGCATCTCG CTCGGTTCGA CGACCGAAAG CGAGCTGCGG ACCGGGCTCG GCATCGTCGC CTCATTGGCG CAGGGAAATC CGCAAGAGCT GCTGCTCGCG ATCTGA
|
Protein sequence | MTKWRPDPSQ LRRPAYLSLA EQIANAITDG KLTDGTQLPP HRKLADDLHL SVQTVSRAYD ELIRRGLISG EIGRGSFVQT RPREPEPPYL PERLGEVIDL SILKPVCEQI HLERLRQAFG WLSENLPSSS ALSFRPNMVF PRHRAVATEW LARCGLEISP LNISVTNGAT SGMTVALMSV APPGSTVATE AISHHTLVPL STYLGLHLEG LAIDEEGMIP DALDEACRKG PIRAIFLQPS VINPMAALMS AERRQALATV AAKHDIAIIE NDILGPMVEN RAPPMAAFAP ERTLYVTSFT KITVPGLRIG YLTAPDRYVA AVANRHLVSN WMATPAMAEI ATRWVSDGTA MELVNWQRRA LLSRHAIAAE MLAGQPYRAH PQSLHVWLPL SGNHTEDGFV SQARLRGVAI APGKSFHTTD QGWTPAVRIS LGSTTESELR TGLGIVASLA QGNPQELLLA I
|
| |