Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5023 |
Symbol | |
ID | 8007614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 408908 |
End bp | 410779 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821938 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_002973198 |
Protein GI | 241113363 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.494336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.650955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGACT GGCCCCTGGA TGCCAGGTTT TCCGGCCGCC AAACGAGCGT ATCGGAGGCA GCGGGGGCGG GGCCGGTGGA TCGCATGAAC AAGGAGCAAT ATCCGTTTCG GATGTTCCTG CTCGGGCCCT TTGCCCTTGT GGACGCCGGG GGGCGGTCGG TTGCTCCGAA ATCCAAAAAG GCACAGGCTC TTTTGGCAAT GCTTGCATTG TCCACCCGGG GCTCGCGCTC GAGAATCTGG CTTAGGGACA AGTTATGGAG CGATCGCTCC GACGACCAGG CGGCAGCCAG TCTACGCCAG GCGCTTTTGG ACATTCATAA GAGTCTGGGG CCGGCACGTG ATCTCTTGAT TGCGGATAAG AATACCGTTT GGCTGGATAT GGACCGACTG GCGCTCGATA CCGACCTGGT GGTTCGGACG GAGCGGTCTG CGGATCAAGT CACCGACGAA TTGCTCGAAG GTATCGACAT CCGCGATCCT GAATTCGAGG ACTGGTTGGC GCTGGAAAGG CAAAACTGGT ATCGCCGTCT CGATGAAGGA CAAGTCCACG ACGTCTTCGA GCCGCGACAG CAGCCGAGCC GCGATATCGC CAAACATTCC GCCCTGCTGC CGTTAACTGG CGCCCCGGAT ACGTCGAGAA CAGGCAAACC ACCCGTGGAT ATCGCCAGCA GCGGTCCCTC TGGGCGGCGG GTTGGCGGTG ACTGGCGATG GATGATGGCT CTTCTGTCCC CCATCGTGGT GGGTGCCGGC GAGGGCGGGC AAATTGCTGC GACACGGTTC CAAAACCTCA TTGCAAAAGC CATCATCGAT GGGCTGGGCT TTGGCGTCAC CGACCTCTCG TTCACCTCGC CGCATATTGA AGAGAGTGAA CAGCAAATCA GCCTTCCCCT ATGCCTTCAG CTTCGCCTGA CGTTTGATGG TGACATGGTG CTGATCGAAC TGGTGATGAA GCACCTGATC AACAACCGCA TTCATTGGCT GGGAAGTCAG GCAATCAACC GCACGCAGTT CGAGCGCGGC GAGTTCGGCA TCGCCGCTGC GCTGATCAGC CAGGCAGTCG ATCAACTGGC CTATTTCGAG GAGATCCAGG CAACCGACAG CAGATTGTCG CAAGACGGTC TCCTGATCGA CGCCGTCAAT GCGATCTTTC GGCTGTCGCG CGACGACCTC GACAACGCGG AACGGCGCCT GGAAGAACAG ATCCAGTATC AGCCGCGATC ATCGACTTTT GCCTGGCTGT CATTCATTCG GACTTTCCAG GTCGGCCAGC GTTTCAACGC GCTGGATGCC CATCTGATCG AGGAAGCCCA GGCCTATGCA CGCAAGGCGC TGGAACTCGA TCCGCAGAAT TCCGTGTCGC TCGCGCTCGT CGGCCACGTC CATTCGTTCC TGTTCGGCGA ATACGACTAT GCGGCCAACC TGTTCGAAAA ATCGATCCGC CTGAATCCGG CCCTGCCGCT CGGTTGGGAC CTCTACGCGA TGCTGCACTG CTATGCAGGC CAGCCCGACA AGGCGGTGGC GATGGCGCGT TGGGTACAAG AGCTCGGCGT CTACAGCCCG CATAAATATT ACTTCGATAC GACCAAATGC ATTGCGGCAG CGCTTGCAGG CGATCATGCC GCAGCCATAT CTGCAGGCGA AGAAGCCCTG CGGGCACGGC CGAACTTCAA CAGCCTGCTG CGCTATCTTG CCTCCAGTCA TGCCCATTCC AACGATCTCG GCGGCGCGCG GCATTACCTG CAGCGTCTTG AGGCAGTCGA GGGCGGTTTC TCCATCGACG CCTTCCGCGG CAGCGGCTAT CCGCTGCTTG ACACAGGCGG CGGGCAGATC CTGATCGACG GCCTGCTCAA GGCCGGCGCC AAGCTGCGCT GA
|
Protein sequence | MADWPLDARF SGRQTSVSEA AGAGPVDRMN KEQYPFRMFL LGPFALVDAG GRSVAPKSKK AQALLAMLAL STRGSRSRIW LRDKLWSDRS DDQAAASLRQ ALLDIHKSLG PARDLLIADK NTVWLDMDRL ALDTDLVVRT ERSADQVTDE LLEGIDIRDP EFEDWLALER QNWYRRLDEG QVHDVFEPRQ QPSRDIAKHS ALLPLTGAPD TSRTGKPPVD IASSGPSGRR VGGDWRWMMA LLSPIVVGAG EGGQIAATRF QNLIAKAIID GLGFGVTDLS FTSPHIEESE QQISLPLCLQ LRLTFDGDMV LIELVMKHLI NNRIHWLGSQ AINRTQFERG EFGIAAALIS QAVDQLAYFE EIQATDSRLS QDGLLIDAVN AIFRLSRDDL DNAERRLEEQ IQYQPRSSTF AWLSFIRTFQ VGQRFNALDA HLIEEAQAYA RKALELDPQN SVSLALVGHV HSFLFGEYDY AANLFEKSIR LNPALPLGWD LYAMLHCYAG QPDKAVAMAR WVQELGVYSP HKYYFDTTKC IAAALAGDHA AAISAGEEAL RARPNFNSLL RYLASSHAHS NDLGGARHYL QRLEAVEGGF SIDAFRGSGY PLLDTGGGQI LIDGLLKAGA KLR
|
| |