Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4832 |
Symbol | |
ID | 6977926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 475671 |
End bp | 476690 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393993 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002278811 |
Protein GI | 209546893 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0998154 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGTC GTTCCCTTAT GCCCGTCAAT GGCAGCTCCG TCCACGATCC GGCGAGGCAG AGCCCGCCGT TTTGGGAGAC GGCGTTCAGC GGCACGGATC CGGACGCACT CTCGGAAATA TTGTCGACGC CGAACTCTCC GATCAAGGCT GAGGCGAAAG CCGACGCACC CATCGCCTAC CGCTGCAATT TCGTCGCGAC GGAAGAGCTG GCCATTGCCG ACTGCGCCTA CGAAGGCACG ATCTCGATCC GGCGGGAGGC GCCCAGTGGC AAGGTGATCA TATTCCTGCC GATGGAGGGG GACGCCCTCT TCGATGCGGG CAAGGAGCAT ATCCATTCGG TTCCCGGCCG CGGCGCCATT CTCGGGGCGG GCCGCGTCTC GGGTGCCCGC CTGCTCGGCC CGCGCCGTCA TCTCGGTCTG TTCATCGATC AAGCCAGGAT CAACAGGCAC CTCACGCAGA TGTTCGAGCG AACGATTATC GGCGACACGG ATTTTCGTCC CTCTATCGAT CTGACGACCG GGCCGGGGCT CGTGTTGCAG CAACTCGCCG CGAACCTCCA TTACGGGCTC AGCGGCGACG GTCCGCTGCT GCAGTCGCCG CTGGCTCTGA GCGCGCTCTG CGATGCGACG ATCTATCTGC TTCTGGAGAC CTGTCCCCAT CGCTATTCGG AGGCGCTTGC GCTTCCCGCT CCTGCCCCGG CTCCTCGCCA TGTAAAATGG GCGATCGAAT TCATGCAGGA ATATATTGCC GAGCCGATCT CGCTCAACGA CATCGCGACC GCAGCCAAGG TCAGCGTCCG CACCTTGCAA CAGGGTTTCC GGCAGTTCAG AGATACCACG CCGATGGCCT ACCTGCATGA ACTCCGGATG CTTGCCGCCC ACCGGGATCT GCTCGAATCC GGCACGCGAC AAGCCGTCGC CGACGTCGCG GTCAGATGGG GATTTACCCA TCTCGGGCGA TTTTCAGCAG AGTATCGGAA GCGTTTCGGT CAACTGCCGT CACAGGCCCT GAAGCGCTGA
|
Protein sequence | MSGRSLMPVN GSSVHDPARQ SPPFWETAFS GTDPDALSEI LSTPNSPIKA EAKADAPIAY RCNFVATEEL AIADCAYEGT ISIRREAPSG KVIIFLPMEG DALFDAGKEH IHSVPGRGAI LGAGRVSGAR LLGPRRHLGL FIDQARINRH LTQMFERTII GDTDFRPSID LTTGPGLVLQ QLAANLHYGL SGDGPLLQSP LALSALCDAT IYLLLETCPH RYSEALALPA PAPAPRHVKW AIEFMQEYIA EPISLNDIAT AAKVSVRTLQ QGFRQFRDTT PMAYLHELRM LAAHRDLLES GTRQAVADVA VRWGFTHLGR FSAEYRKRFG QLPSQALKR
|
| |