Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0844 |
Symbol | |
ID | 8012000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 845760 |
End bp | 846698 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644823431 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002974682 |
Protein GI | 241203586 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCCAA ATCCTATGAA GGGTGATCGC CGCCTATCCA GGCGAGGGGA GATGGTCGCC TTGGCCGGAC GTCTCGCCCC CCGTCACGGA TATAATCCGA CCGCGCTCGC TCCCGTCCGC ATATTGCGCA CGGAAGCCGT GCTCCACGAC ATCCCGGTGC TCTATAGACC GGGCGCGGTT TTCGTCCTGC AGGGCAGCAA GCAGGGCATC CTCGAAGGCG AGGTCTTCCT CTATGACGAG GAGCACTATC TGGCGGTGTC GCTGCCCGTT CCATTCCGGA TGACGTCGAC GGCAAGTCCC GAGCGGCCAT TGCTTGCGGT CTATGTCGAG TTCGATATGC AGATGGCGGC CGAGATCGCA TTGCAGGTGG AAAAGCATGC CGAACTGCCA GGCGACGAGC CGAGAAGTCT CGTGTCGAGC AGGATGTCCG GTGATATCGA GGATGTCCTG CTGCGCCTGC TGACGGCACT TGGCAGCAGC GTCGAGACGG ATGTGCTTGG CGCCGGCATT CTGCGCGAAC TGCACTACCG CGTCCTGGTC GGTCCGCAAG GCGGTGCGAT GATCGCCGCC CTCCAGCAGA AGGGCAGATC CGGGAAAATC ATTCAGAGTC TGGCCTGGCT GCGGGAAAAC TATGGCCTGG AGATCGCGGT CACCGATCTG GCAAGGGAGG TGGGCATGAG CGTTCCCTCC TACCATGTCC ATTTCAAGGG TCTGACCGGC AACAGTCCGA TGCAATACGT CAAGGCCATG CGGCTTCACG AAGCGAGATT GATGATCGCG CGCCAGACGA GAACGATCGC TGATGTCGCG GCTTCGGTCG GCTACGCCAG CCCGGCGCAG TTCAGCCGCG ACTTCAAACG GCATTTCGGG CGCACGGCAT CGGAGGAGAT CAAGTGGGTC CAGCGCCATC TTGGCGAACT GGGTGACGAT CACGGCTAA
|
Protein sequence | MLPNPMKGDR RLSRRGEMVA LAGRLAPRHG YNPTALAPVR ILRTEAVLHD IPVLYRPGAV FVLQGSKQGI LEGEVFLYDE EHYLAVSLPV PFRMTSTASP ERPLLAVYVE FDMQMAAEIA LQVEKHAELP GDEPRSLVSS RMSGDIEDVL LRLLTALGSS VETDVLGAGI LRELHYRVLV GPQGGAMIAA LQQKGRSGKI IQSLAWLREN YGLEIAVTDL AREVGMSVPS YHVHFKGLTG NSPMQYVKAM RLHEARLMIA RQTRTIADVA ASVGYASPAQ FSRDFKRHFG RTASEEIKWV QRHLGELGDD HG
|
| |