Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6036 |
Symbol | |
ID | 6977422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 464859 |
End bp | 466811 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393488 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_002278306 |
Protein GI | 209546416 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0951775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.443759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGGA CATCCGGACA GATGTCGAAG AACAGGCTCT GCCTGCTCGG AAGACCGCGG CTGCTGGCAG CGGGGAGGGA ACTTCCCTTG CCGGAGAAGT CTTATTTTCT CCTCGCCATG CTGACCGCCG AAGCCAATCT CGAACTCGAC CGCGAAACCG TCAGGCGGCA GCTCTGGCAA TCGGAACTGC CGGAAAAGCG TGCGGGCAGC CTGCGCCAGC TGCTGTCGCG TATCGAGCAG AGCATTCCCG CCGATCTTCC ACCGCTGCTT GCCGCGACCC GAACCCATAT CGCGCTTGCG GATGGCTGGG AGGTCGATGT TCATATCCTG AAACAGAAAG GGCCGCTCGC CCCCGAAGAC AGCGATATGC TGAACGGCGA ACTGCTGGAA GGCGCCAAAT CGCCGACGCA GGGCGCCGAG GACTGGCTGA CCTTTGAGCG CCAGCGTGTC GACGAATTGC GCTCGGCCCA TCTCACCCGG CTGATCGAAA CATCGGAAGA CGGATCGGAT GACGAGCAGG TGGCGTTTGC CCGGCGCCTG CTGGAACTCG ACCCTGCCAG CGAGACGGCC TATCGGGCCC TGATGCGCAC CTATGTCAGG CTCAACGATG CGGCCGCGGC CCGTCAGGCC TATCTGAAGT GCAAGAGCCA GCTGAAGGAC GACTTCGATA CCGAGCCGGA GGAAAGCACC ACCGCGCTTG CCCGCGAACT CGGCCTGATC CCGGCAGCGC AGGCAGCCGC GCCGGAGCGC CCGCCTGCAT CGGGCATGTT CGCCAATCTG CTCGGCCAGC CGCGCATCAT CATCCTGCCG CCCGAAAGCA TCTTCACCGA TCCGCTGATG GAGCGTGTCG GCCGGGCGCT GCTCGAAGAC GTCACCATCG GCCTCAGCCA GCAGCGCGGC TTCAAGGTGA TCGCCGCGCA TACGAGCCTC GAAATCCTCA GCCGCTCGAT CGATCCGGCG CGGGCCGTGC CCGGCCCGCT CGACCTCAGC TTCGATTATG CGGTCTACGT CACCATCCAG GGCCGCGACG AGGATGTCTT TGCCACCTGC CGGCTGACGC GGACGACGAC GTCGGAGGTG ATCTGGGCGC TGGAACTGCC GCTGGTGATG CAAAAGATCA GCGAGTCCTT CGCGCATCTG ACGCGGCGGA TCGTCTCCTC GCTCGCCGAC ACGATCGAGC GCCACGAACT GGCAATGCCG ATCGGCGATG CGCCGGCCTC CGCCTATCGC CTTTATCTCG AAGGCAAGCG GCTGATCGCC CAGACCGACC TGCAGCATCT GCGCCAGGCG CGCAAATGGT TCAAATCTTC GCTCAATCGT TACGAGCATT TCTCGGCCGC CCATGCCGGC GTGTCGCGGG CGCTCGGCAT GGAATGGCTG ATCCGCGGCA TGCGCGACAA GGAACTGCTC GACGAGGCGA ATGGTGCCGC CCGGCAGGCG CAGCAGTCCG ACCCGAACAG CGGCCGGGCC TATCGCGAGC TCGGTTTTGT GGCGCTTTAT CGCCGCCGCT TCGACGAAAG CCTGGAATAT TTCCAGCAGG CCCAGGATCT CAATCCCAAC GATGCCGACA TCCTCGCCGA TTTCGCCGAC GCGCTTTCCC ATGACGGCGA TTTCGATCGG GCGCTGGAGC TCAGCCGCGC GGCCTTCAAA CTCAATCCAC TGCCGCCGGA TTATTATTAC TGGAACCTCG GCGGCATCCA CTTCATGCGC GAAGAGTACG AAAAGGCGAT CGACGCGCTG GAACCGGTGA AGACCAAACA GGCGACGGCG CGTCTGCTTG CCGCCTCGCA TGCGATGGCG GGCGAGACCG GCAAGGCTCA GAACTATGCC CGGACGGTGC TGGAAAACTT CCCCGATTTC CGCAGCGAGG ACATTCGTCA TTTCGTCCCC GATCGCGATC CCGCCTTCAC AGAACCGCTG ATAAAAGGCC TGCAACTCGC CGGTCTTCCC TGA
|
Protein sequence | MQRTSGQMSK NRLCLLGRPR LLAAGRELPL PEKSYFLLAM LTAEANLELD RETVRRQLWQ SELPEKRAGS LRQLLSRIEQ SIPADLPPLL AATRTHIALA DGWEVDVHIL KQKGPLAPED SDMLNGELLE GAKSPTQGAE DWLTFERQRV DELRSAHLTR LIETSEDGSD DEQVAFARRL LELDPASETA YRALMRTYVR LNDAAAARQA YLKCKSQLKD DFDTEPEEST TALARELGLI PAAQAAAPER PPASGMFANL LGQPRIIILP PESIFTDPLM ERVGRALLED VTIGLSQQRG FKVIAAHTSL EILSRSIDPA RAVPGPLDLS FDYAVYVTIQ GRDEDVFATC RLTRTTTSEV IWALELPLVM QKISESFAHL TRRIVSSLAD TIERHELAMP IGDAPASAYR LYLEGKRLIA QTDLQHLRQA RKWFKSSLNR YEHFSAAHAG VSRALGMEWL IRGMRDKELL DEANGAARQA QQSDPNSGRA YRELGFVALY RRRFDESLEY FQQAQDLNPN DADILADFAD ALSHDGDFDR ALELSRAAFK LNPLPPDYYY WNLGGIHFMR EEYEKAIDAL EPVKTKQATA RLLAASHAMA GETGKAQNYA RTVLENFPDF RSEDIRHFVP DRDPAFTEPL IKGLQLAGLP
|
| |