Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0210 |
Symbol | |
ID | 8011437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 221486 |
End bp | 222634 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822803 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002974060 |
Protein GI | 241202964 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.278356 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.347271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACAG GGTCTGAATC CGACGATGAT CCTGCCGCGG AACAGGAACG GCCGGCGCAT ATGGAGCGGC GGCGCTGGCC GCGTGAACCG GCCATGCGCA AGGAACGCCC GCCCCATGCC TCCCCTGCAC TCGTGCCGCT GCGATTCTCG ACGCAGGACC TACCACCGGC AGAACAATTC CAGGCCTGGC GGGCACATAT GGCGCCGCTC GTGGATGTTC ATCTGCCGGA GGGAAAATCA CCGGAAGACG GGTTCCTCGC GGAGCAGATC GGCTGGCATC TCGGCGATAT CCTGATCGTC CAGCAGCGCG CGCATGCCCA CAGATATGTC CGCGATCAGG CCATGCTTCG ATCGAGCCCT ATCGACCATT GGAACGTCGG CCTGCAGCGC AGCGGCCAGG CCTGGACCGA GGTCAACCGT CGTGTCACCG AGACCGGTGC CGGCGAGATA TTTTTCATGT CGCTCGGCAG CCCCTATCGC GGACGGACGA CCGATACCGA AGCTTTGCTC GTGTTCCTGC CGTATGAGAT GCTGGCTCGC GATGCGAGCC TTCTCCAGAG CGCCGGCAAC ACGGTTCTTT CGGGCAGCCA CGCCGAGTTG CTCACGGGCT ATCTCACGGG CCTCGAAACG AACCTCGGGA ATCTGACGAT AGAGGAAGTG CCCCGGATCA TTCAAACCAT CGGCGATATG GTCGTTGCGG GCGCCGCATC GTCCACAAGG ACTGATACCG GTCAAAACCA GGCCAACATG GGACTGATGG AGCGGGCGCA CCGCTACATT CACGTCAATC TCCATTCGGA AAACTTGACA CCGGACGTGA TGTGCCGCGC GTTGGGGATT TCGCGAACGC GGCTTTATCA ACTGTTTGAG GCGAGCGGAG GCGTGCTCAA TTATATTCGC AAGCGGCGAT TGCTGCAGGC CTATGCGGAT CTCAGCAATT CGGCCGACCA CAGGCCGATT TCAGAAATCG CAGAAGCCGC CGGTTTCGAG GTCGCCGCCA ATTTTACGCG CGCCTTCATC CACGAATTCG GGCTGAGCCC GCGTGAAATC CGAAGGACGA TGGCAACCCA ACAGCGGCCG GCTCCGGCCA TCCGCTCCAC GCGGCGTTAC GAAAAAACGA TCGGTGATTG GCTGGCCCTG ACGGGTTGA
|
Protein sequence | MATGSESDDD PAAEQERPAH MERRRWPREP AMRKERPPHA SPALVPLRFS TQDLPPAEQF QAWRAHMAPL VDVHLPEGKS PEDGFLAEQI GWHLGDILIV QQRAHAHRYV RDQAMLRSSP IDHWNVGLQR SGQAWTEVNR RVTETGAGEI FFMSLGSPYR GRTTDTEALL VFLPYEMLAR DASLLQSAGN TVLSGSHAEL LTGYLTGLET NLGNLTIEEV PRIIQTIGDM VVAGAASSTR TDTGQNQANM GLMERAHRYI HVNLHSENLT PDVMCRALGI SRTRLYQLFE ASGGVLNYIR KRRLLQAYAD LSNSADHRPI SEIAEAAGFE VAANFTRAFI HEFGLSPREI RRTMATQQRP APAIRSTRRY EKTIGDWLAL TG
|
| |