Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1052 |
Symbol | |
ID | 8012181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1027627 |
End bp | 1028571 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644823635 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002974886 |
Protein GI | 241203790 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.2685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.950103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCGT TATCCGATGT TCTCGCATTG CTCAAACCGC GCAGCTATGT TTCCGCGGGG CTTGACGCTG GCGGTGCTTG GGCGATCGAT TTTCCTCCCC CTGACGGCAT CAAGTTCAAC GCAGTGATTT CAGGCGCGTG CTGGCTGAGC GTCGATGGCG TCCCCGAAGC TGTCCGCCTG GAGGAAAGCG ACTGCTTTCT GCTGACGAGC CGCCGAGCCT TTCGTCTCGC CAGCGATCCG GCTCTCGAAG CGATCCCGTC CGATGCGATC TATTCGATCG CCCGCGACGG CATTGCGACT TGCAATGGCG GCGGCGATTT CTTCCTGATC GGCAGCCGTT TTTCCTTTTC GGGAGGAAAT TCGGACATCC TTCTCGGAAT CCTGCCGCCG ATCGTCCACG TGAAGAGGGA TTCCGATCAC GCCGCCGTGC TGCGCTGGTG TCTCGATCGG ATGACGCGCG AATTGCGCGA CCAGCAGCCG GGTGGCTTTC TGATGGCGGA GCATTTTGCT CATGTTATGC TCATGCAGGT GTTGCGCCTC CATATCGCAT CGCCGAATGC GCGCGGCGTC GGCTGGCTTT TCGCGCTTAC CGACCGGCGG ATCGGTGCGG CTATCGGTGC CCTGCATGCC GATCCAGCCC GCAAGTGGAC GCTGCAGTCA CTGGCTGAAC GTGCCTCGAT GTCGCGGTCC AGTTTTGCTC TCCACTTCAA GGAAAAGGTC GGGCTTGCGC CGATGGATTA TCTGACGCGC TGGCGCATGC TTCTCGCCGG TGACCGATTG ACAAACTCGG CCGAAGCAAT TGCCGGTGTC GCCCTGTCGC TCGGCTATGA ATCCGAAAGC GCATTCAGCA CCGCTTTCAA GAGAGTGATG GGATGCTCGC CGCGGCAATA TGGTCGCGCT CATCCGCCCG CCGGTGCCAT GCGAGATAGT CTCGGCGCTC AGTGA
|
Protein sequence | MDPLSDVLAL LKPRSYVSAG LDAGGAWAID FPPPDGIKFN AVISGACWLS VDGVPEAVRL EESDCFLLTS RRAFRLASDP ALEAIPSDAI YSIARDGIAT CNGGGDFFLI GSRFSFSGGN SDILLGILPP IVHVKRDSDH AAVLRWCLDR MTRELRDQQP GGFLMAEHFA HVMLMQVLRL HIASPNARGV GWLFALTDRR IGAAIGALHA DPARKWTLQS LAERASMSRS SFALHFKEKV GLAPMDYLTR WRMLLAGDRL TNSAEAIAGV ALSLGYESES AFSTAFKRVM GCSPRQYGRA HPPAGAMRDS LGAQ
|
| |