Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3207 |
Symbol | |
ID | 8014101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3209393 |
End bp | 3211240 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825768 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_002976995 |
Protein GI | 241205899 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTCATC TGCAGACATT TGGCGATCTG CGGTTGATCG AGACGAATGG CGACCAGGTT CGCTATCCGA TCAAGGGCCT GTTGATGATG GCCCATCTCT ATGCCGGCGC GAGCCATGAG CTCAGCCGGT ACGAATTGGC CCAGTTTCTG TGGAATGATG TCGAACCAGA GCTGGCGCGG CTCAATCTGC GCAAACTGCT TTCCCGCATC CGCGAGACAG ACGGCGGGCG CGCCGAATTG CCCTTCGATT TCACCGCCAC CACGGTTCGT CTCAACACGC AGGCAGTTTC CAGCGATCTG GATATTTTTC GCGCCGGCGG TTCGCCGCTG GAGCGGCTGA ACGCAATCGC CGAATTAACC CAGCGCGGGT TCATCGGAAA TATCAAACCA GCGACAAAGC TGATAGACGC CTGGATCAGG GCCCAACGGG ACGCTCAGGC GCTGCAGTTG CGGCAGGCAC TGCTGGACGC GTTGCCGGAT GCGCAAAAGC CCGGCGCTAC GCGCTCCATT TCCGGCGCCG CTCTGCAGAT CCTCGAACGG GATCCGAATG ACGAGCAGGT CAGAGCCTTA CTGCACCGGT TATCCGGCGG TTCGTCCCTC AGCGAAAGAT TGCCGAACGA TGACGGCCAT GCCAGGGTAC AGGTGAAGCG TTCCGAGGCC GGCGGTGAGG CAACTGCCGA CATCGAACGC ATGTCGCCAG CGCCGCTGAT CCTGCCTCGA CTGGTGCTGC TTCCCCCGAC ATCGAAACAT GCCGATGCCG GGCTTGCGCT TGCCAATGCC CTGATCGAAG ACGTCACGAT CGAACTCTGC GCGCTCAGAA ACATTTCCAT CGTAGCGCCC CATACGGCCG GCCAGATCCG CCGCGATTCC GAGAAGGCTG CTGTGGTCGC CCGTCACTCG ATCGCCTATC TTCTGGATAC ACGGCTTTCC GAAGAGGGAT TGTTCGCCCA GCTGGTTTAT TTTCCCACGG ACGAGATCAT CTGGGCCAAT CGCTTTACGA TGACGCCCGA TATATTGCCG CGTCAGAGAC GGCTGATTGC GCAGCAGTTG ACTATGTCGG TGGCCCGCGA GCTGGCTGAA AACGAGGAAG AGCGCTTACG CTTCGAAGCC AATCCTGAGG CCTATCACGC CTATCTGGTC GGCTCGAGCC TGATGAGCAA ATTGACATTG CCGCATATCC GCCGGGCACG AAAAGCTTTC AAGCAGTCGC TGTCGCACAA GGCGGATTTC TCTCCGTCAT TTACCGGGCT GGCGAGAACC TTCACCAGCG AGTGGCTCGT GACGGCGCAG GGTAACAATG AGCTGCTGCA TCTGGCGGAG CAGAACGCGC TTCGGGCCAT CGAGCGGGAT CCGGCCTCGG CGGCGGGCCA TCGCGAGCTT GGCGTCACCA AACTCTATCT CGGCGATGTC GATGCCAGTG TCGCCGCACT TGATCTGGCG GAACAACTCA GCCCGCATTT TGCCGATGTC ATCTACAGCC ATGCCGACAC GCTCGTGCAT GCATCCCGCC CTGGCGACGC GCTCGCCAAG ATCAGAAGAG CGATTTCGCT CAATCCGATC GCGCCTGACG CCTACCTCTG GTGCGCTGCG GGAGCGAGCT TCTTCCTGGA ACAGTACGAG GAGGCCATCG CCTATGTCGA GGCGATGAAA GACAAGGCGC CGGCCCATCG CATCGCCGCG GCCAGTTGCG CAATGATCGG CGATCGAAAA CGGGCGCTGT TCCATCGGCA ACGAGCCGAA AGCATCAACC CGGTCTTCGA CGTCGAGAAG TGGCTCACCA TCGTTCCCTT CAAGGAAGAT TGGCAAAAAG AGCTTTATCG GGAGGGCCTG CTGAAGGCCG GTTTTTAA
|
Protein sequence | MLHLQTFGDL RLIETNGDQV RYPIKGLLMM AHLYAGASHE LSRYELAQFL WNDVEPELAR LNLRKLLSRI RETDGGRAEL PFDFTATTVR LNTQAVSSDL DIFRAGGSPL ERLNAIAELT QRGFIGNIKP ATKLIDAWIR AQRDAQALQL RQALLDALPD AQKPGATRSI SGAALQILER DPNDEQVRAL LHRLSGGSSL SERLPNDDGH ARVQVKRSEA GGEATADIER MSPAPLILPR LVLLPPTSKH ADAGLALANA LIEDVTIELC ALRNISIVAP HTAGQIRRDS EKAAVVARHS IAYLLDTRLS EEGLFAQLVY FPTDEIIWAN RFTMTPDILP RQRRLIAQQL TMSVARELAE NEEERLRFEA NPEAYHAYLV GSSLMSKLTL PHIRRARKAF KQSLSHKADF SPSFTGLART FTSEWLVTAQ GNNELLHLAE QNALRAIERD PASAAGHREL GVTKLYLGDV DASVAALDLA EQLSPHFADV IYSHADTLVH ASRPGDALAK IRRAISLNPI APDAYLWCAA GASFFLEQYE EAIAYVEAMK DKAPAHRIAA ASCAMIGDRK RALFHRQRAE SINPVFDVEK WLTIVPFKED WQKELYREGL LKAGF
|
| |