Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6503 |
Symbol | |
ID | 6983573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 168513 |
End bp | 170462 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643399499 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_002284255 |
Protein GI | 209552340 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCA GGCTCGAATC ATTTGGTGAG TTGCGGCTGA TCGACCCCGC CGGGCGGCCG GCGGCGTTCC CGGAAAAAGG CCTGCTGGCG ATTTGTTATC TGCGGGACCG TGATTTGCGC GACGGAGCCG GGAGCGAATA TCCGCGCTCG GCGCTGGCGC GGTTCTTGTG GGACAGCCAC GACAATTCTG ACATCATGGC CAACCTGCGC AAAACCATAT CGCGCATTCA GAAGCGTCAG ACCGAACTCG GTGCCGAGCT TCTGGCCTTC ACGGCGACCG GCGTGCGGGT CAGGTCGGAT GCATTTTCAG GTGATCTCTT CGAACTTGCC GACCCCGAAG GCGACGACGC GCTTGGCAGG CTGCGGCGGC TGGTTGCCAT CTTTCGCCAG GATTTCCTGG CGGGGCTCGC CGACCAAAGC GCCACTACGC GGCAGTGGAT CGCCGGCCAG CGTGACCGGC ACATGGCAGT TCTGGTGGAT GCGCTGAAGA CGGCGCTGCC CGCTGCCCGG TCGCGAGAGG ATGTCAGCCT GATCAAGGAG GCGGCGCTCA GGATTTTCCG CGAGGCACCG GATGACGAGA ACATTCGTCG CATCCTTCTG GAGGCCTATG AGTCCGAAGG TCAGCTGGAG AAAGCGCGCC GCCTTTTCGA GACGAGCAGG CATCAGCTGG AAAGCGCAGT CGATATCGGC CTTGACGTGC AGGCGCTGGG AGAGGTCCGC AAGATCTTCG CCGGTAGTCG ACCACCGCAG AGCGCTGCGG TACCGCTCAT CGATGGCGGC GTTCGAGTTC CCGGTCCCCT CCCCCGTCTG GTGCTGCTGC CGCCGAGCGG GACGGATGCC ATCGGTGCCT TGCCGATGCT CTCCGAGGCG CTGATCGAGG ACGTGACCAT CGGCCTTTGC GCGCTGAATA CGGTTTCGGT GGTCGCACCC CATACGGCCG CTCGCATCGC ACACAATGCC GACAAGGCCC AGCTGATCCT TCGCCATTCG ATTTCCTATG TTCTCGACAC GAGGCTGACC AATCGTGCGG GCAGCCCTGC GCTCTTTGTC CAGCTGATTT ATGCCGGCAG CGATGAAGTC ATCTGGGCCG AGCGTTTCAG CCTCGAGAAA TACGAATTGA TAAGCCACCG GCGCGACATC GCGCGGCAGA TTGCCAAAGA GCTTGCCGGG CAGGTCCGCC GGCATGAAAC CATGCGCGAT GCCTTCGAGG GCAATTCGGC CGCCTATCAT AGCTACCTTC TCGGTCTGCG GGATATCAAG CGGCTCGCCC TGCCCGACGT GCGCCGCTCC CGGAAGGCTT TCCGCGAAGC GCTTCAGCAC AGCGCACATT TTGCCCCTGC GCTCAGCGGA TTGTCGCGCA CGTTTCTTGT CGAATGGTTG CTGACGGCAC GCGGCGACAG CGAGTTGCTC GGGCTCGCGG AAGACTATGC GAACCGCGCG ATCGTTGCCG ATCCGTCGTT TGCGGCCGGC TTTCGTGAAC TCGGCGTTGC CAAACTTTAT CTCGGGGAGC TTGATGAAAG CGTCGTGGCG CTGAAGCTCG CGGAAGAGCT GAGCCCGCAT TATGCAGACG GCATTGCAAG TTATGCCGAT ACGCTCGTCC ATGCGTCGCG CCCTGCCGAT GCATTGGCCA AGATCGAGCG GGCCATCTCG CTCAATCCGT TGAGCCCGAC CGACTATCTA TGGACGGCGG CCGGCGCGAA TTTCGCCCTT GGCCATTATG CTGAGGCGCT CGAGCAGATT TCTATCATGG ACGACCGTAC ACCGGCCGAC CGGCTTTCGG CTGCCTGCTG GGCGATGCTC GGCGATATGA AAAACGCGCG TATCTATATG CGCAAGGTGC GCGAAATCTA TCCGGATTTC GATGTCGACA AATGGCTTTC GGTCGTTCCA TTCAAGGAGC AATGGCAGAA GGAACAGTAT CGCGAGGCAT TACGTAGGGC CGGCTTCTGA
|
Protein sequence | MKLRLESFGE LRLIDPAGRP AAFPEKGLLA ICYLRDRDLR DGAGSEYPRS ALARFLWDSH DNSDIMANLR KTISRIQKRQ TELGAELLAF TATGVRVRSD AFSGDLFELA DPEGDDALGR LRRLVAIFRQ DFLAGLADQS ATTRQWIAGQ RDRHMAVLVD ALKTALPAAR SREDVSLIKE AALRIFREAP DDENIRRILL EAYESEGQLE KARRLFETSR HQLESAVDIG LDVQALGEVR KIFAGSRPPQ SAAVPLIDGG VRVPGPLPRL VLLPPSGTDA IGALPMLSEA LIEDVTIGLC ALNTVSVVAP HTAARIAHNA DKAQLILRHS ISYVLDTRLT NRAGSPALFV QLIYAGSDEV IWAERFSLEK YELISHRRDI ARQIAKELAG QVRRHETMRD AFEGNSAAYH SYLLGLRDIK RLALPDVRRS RKAFREALQH SAHFAPALSG LSRTFLVEWL LTARGDSELL GLAEDYANRA IVADPSFAAG FRELGVAKLY LGELDESVVA LKLAEELSPH YADGIASYAD TLVHASRPAD ALAKIERAIS LNPLSPTDYL WTAAGANFAL GHYAEALEQI SIMDDRTPAD RLSAACWAML GDMKNARIYM RKVREIYPDF DVDKWLSVVP FKEQWQKEQY REALRRAGF
|
| |