Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3473 |
Symbol | |
ID | 8014344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3506715 |
End bp | 3507605 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644826037 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002977258 |
Protein GI | 241206162 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.368579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAC ATATCCCTAC CTATGAACTC TACGGCGAAA AGACCGGGCG AGAGCCGGAT TTTTGGGTGC ATTGCGAAAC TATTCGCTCC CGCAGCAGTT TGCATCAATG GGAGATTAGC CCGCATCGTC ACGAGAGTTT CTTTCAGATA TTGTACATCG AAAGCGGTTC GGGCGATGCG ATCTTCGGCG AAAAGAGCCA TGCCATCCTT CCGCCGGCGA TCATCACCGT GCCGCCCGGG CTCAATCACG GCTTTCGTTT TTCACGCGAT ATCGACGGCC TGGTGATCAC CCTGTTGAGA TCCCATCTCA GCCATCCGCC CGGCGATCGA AGCCAGCTCG GCGAATGGCT GGCGGGGCCG CATCTGACGC CGCTCGATCC CGATCATGCC GAGGCCGTCT ATGTGATGCA GACGTTGAAG CGGCTGGGCG ACGAATTCGA AAATCGCCGC AGCGGCCGCA ACGAGGCCTT GGCCGCCTAT GTCGCCCTGG CGCTGCGGCT GACGGCGAGG ATTTCCCATG AGGGGAATGC GCACGAACTT CCGCCCAACG AGAACGAGCG GCGGATGGAC ATGCTGAGCG AGCTCATTCA GCAACATTTC CGATCACACA AACCCGCGTC CTTCTACGCC AGGGAGCTTG GGCTTTCGCC GACGCATCTC ACCCGCATCG TCCGGACGAT GACCGGCAAC ACGCCGCATG AATTGATCGC CGGCAAACTC GTCGAAGAGG CGAAACGCCA ACTGGTTTTT ACACTGGGCA GCGTTCAGGA GATCGGATTT CGACTCGGCT TTGCCGACCC AGGCTATTTC TCGCGCTTCT TCGTTAAATA CACCGGAGAA ACGCCGCGGG TCTGGCGCAT GAAGGAAAAA GTCCGGCTCG AACGTGCATA G
|
Protein sequence | MSKHIPTYEL YGEKTGREPD FWVHCETIRS RSSLHQWEIS PHRHESFFQI LYIESGSGDA IFGEKSHAIL PPAIITVPPG LNHGFRFSRD IDGLVITLLR SHLSHPPGDR SQLGEWLAGP HLTPLDPDHA EAVYVMQTLK RLGDEFENRR SGRNEALAAY VALALRLTAR ISHEGNAHEL PPNENERRMD MLSELIQQHF RSHKPASFYA RELGLSPTHL TRIVRTMTGN TPHELIAGKL VEEAKRQLVF TLGSVQEIGF RLGFADPGYF SRFFVKYTGE TPRVWRMKEK VRLERA
|
| |