Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7193 |
Symbol | |
ID | 8022899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 614467 |
End bp | 616392 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644834026 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_002985160 |
Protein GI | 241667076 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0854307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.06484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACGA ACAGGCTATG CCTGCTCGGA AGACCGCGAT TGCTGGCGGC GGGGAGGGAG CTCCCCTTGC CGGAGAAGTC GTATTTTCTC CTTGCCATGC TCGCCGCCGA ACCCAATGTC GAACTCGACC GCGAAACCGT CAGGCGGCAG CTCTGGCAAT CGGAGCTGCC GGAAAAGCGG GCCGGCAGCC TGCGCCAGCT GCTGGCGCGC ATCGAACTGA GCATTCCCGC CGGCCTCCCA CCGCTGCTTG CCGCGACCCG AACCCATATC GGCCTTGCCG ATGGCTGGGA GGTCGACGTT CATATCCTGA AACAGAAAGG GCCACTCGCA CCCGAAGACG GCGGCCTGCT GAACGGCGAA TTTCTCGACG GCGTCAAATC GCCGACGCAG GGCGCCGAGG ACTGGCTGAC CTTTGAGCGC CAGCGTGTCG ACGAATTGCG CTCCGCCCAT CTCACCCGGC TGATCGAGAC ATCGGAAGAC AGATCGGATG ACGAGCAGGT CACACTCGCC AGGCGCCTGC TGGAACTCGA CCCTGCCAGC GAGACGGCCT ATCGCGCCTT GATGCGCACC TATGTCCGGA TGAACGATCC GGCAGCGGCG CGTCAGGCCT ATCTGAAGTG CAAAAGCCAG CTGAAGGACG ACTTCGACAC CGAGCCGGAA GAAAGCACCA CCGCTCTTGC CCGCGAGCTC AACCTGGTGC CGGCGGCACA GGCGACGTCA GCCCAGCCGG CGCCTGATCT GTCCATCAAT TCGGTCGGCC AACCGCGCAT CATCATCCTG CCGCCGGAAA GCATCTTCAC CGATCCGCTG ATGGAGCGCG TCGGCAGGGC GCTTCTTGAA GACGTCACCA TCGGCCTCAG CCAGCAGCGG GGCTTCAAGG TGATCGCGGC GCATACCAGC CTGGAGATCC TCAGCCGTTC GATCGATCCG TCGCGTGCTG TGCCCGGTCC GCTCGACCTG CGCTTCGACT ATGCGGTCTA CGTCACCATC CAGGGCCGCG ACGAGGATGT TTATGCCACT TGCCGTCTGA CGCGGACGAC GACGTCGGAG GTGATCTGGG CCGTCGAACT GCCGCTGGTC ATGCAGAAGA TCAGCGAATC CTTCGCGCAT CTGACGCGGC GGATCGTCTC CTCGCTCGCC GACACGATCG AGCGTCACGA ACTGGCGATG CCGATCGGCG ACGCGCCGCC GTCAGCCTAC CGTCTCTACC TGGAAGGCAA GCGGCTGATC GCCCACACCG ACCTGCAGCA TCTGCGCCAG GCACGCAAAT GGTTCAAATC CTCGCTCAAT CGTTATGAAC ATTTCTCTGC CGCCCATGCC GGCATGTCGC GCGCGCTCGG CATGGAATGG CTGATCCGCG GCATGCGCGA CAAGGAACTG CTCGACGAGG CGAACGGCGC TGCCCGGCAG GCGCAGCAGT CCGACCCGAA CAGCGGCCGG GCTTACCGCG AGCTCGGCTT CGTGGCGCTT TATCGCCGTC GTTTCGACGA AAGCCTGGAA TATTTTCAGC AGGCCCAGGA TCTGAATCCC AACGATGCCG ACATCCTCGC CGACTATGCC GACGCGCTTT CCCATGATGG CGATTTCGAT CGAGCGCTGG AGCTCAGCCG CGCTGCCTTC AAACTCAATC CGCTGCCGCC GGACTATTAT TACTGGAACC TCGGCGGCAT CCACTTCATG CGCGAAGAGT ATGAAATGGC GATCGAGGCG CTGGAACCGG TGAAGACGAA GCAGGCGACA GCCCGCCTGC TCGCTGCCTC GCATGCCATG GCGGGCGAGA CGGGCAAGGC CAGGAACTAC GCCAGGGTGG TGCTGGAAAA CTTCCCCGAT TTCCGCAGTG AGGACATCCG TCATTTCGTC CCCGATCGCG ATCCCGCCTA TACGGAACCG CTCATACAGG GCCTGCAACT TGCCGGACTT CCCTGA
|
Protein sequence | MLTNRLCLLG RPRLLAAGRE LPLPEKSYFL LAMLAAEPNV ELDRETVRRQ LWQSELPEKR AGSLRQLLAR IELSIPAGLP PLLAATRTHI GLADGWEVDV HILKQKGPLA PEDGGLLNGE FLDGVKSPTQ GAEDWLTFER QRVDELRSAH LTRLIETSED RSDDEQVTLA RRLLELDPAS ETAYRALMRT YVRMNDPAAA RQAYLKCKSQ LKDDFDTEPE ESTTALAREL NLVPAAQATS AQPAPDLSIN SVGQPRIIIL PPESIFTDPL MERVGRALLE DVTIGLSQQR GFKVIAAHTS LEILSRSIDP SRAVPGPLDL RFDYAVYVTI QGRDEDVYAT CRLTRTTTSE VIWAVELPLV MQKISESFAH LTRRIVSSLA DTIERHELAM PIGDAPPSAY RLYLEGKRLI AHTDLQHLRQ ARKWFKSSLN RYEHFSAAHA GMSRALGMEW LIRGMRDKEL LDEANGAARQ AQQSDPNSGR AYRELGFVAL YRRRFDESLE YFQQAQDLNP NDADILADYA DALSHDGDFD RALELSRAAF KLNPLPPDYY YWNLGGIHFM REEYEMAIEA LEPVKTKQAT ARLLAASHAM AGETGKARNY ARVVLENFPD FRSEDIRHFV PDRDPAYTEP LIQGLQLAGL P
|
| |