Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4550 |
Symbol | |
ID | 6977644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 187615 |
End bp | 188535 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393727 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_002278545 |
Protein GI | 209546627 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.597489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.103743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCA GACGCCTCAA ATCTTTCATC GTGATCGTCG ACAGCGGCAG CATCACGCGA GCGGCGGATC TTTTGCATAT CGCCCAGCCG GCTCTCAGCC AGCAGCTTGC GGCGCTGGAA GAGCATTTCG GCCACAAGCT GCTGATCCGC AGCCAGCAGG GTGTCAGCAT GACCGATGCC GGACATGCGG TATATCGCCA TGCGCAAATC ATCCTTCGGC AGATGGAGCA GGCGCAGGCT GATGCATCGG CCGCCGGCAA TTCGCTTGCC GGCCGCGTGT CTGTCGGCCT CGTGCCGTTC AGCAGCGCGG CGACGCTCTC GGTCGATCTG CTGGCGGAAA CCCGCAAACG CCATCCCGGC ATTCTCCTGC ATCTGACCGA AAGCGTCGGC CAGACCTATA GCCAGATGAT CATGAACGGC CGGCTGGAGA TGGCGCTTCT GCATGGAACC GGGCCGATCA AGGGCGTGCG GTTCGAACCG ATCCTGAGTG AAGAGTTTTT CCTGGTCGCC CACCGCGACT TTGCCATCGA AGCCGATGCG AAACCCGTTC CGGTCAACGC GCTCGACGGA ATACCGCTGC TCCTGCCGCC GGCCTATAAT TTCGTCCGCC GCGCCGTCGA TACCGCCTTT ACCCGCACGC GCACCAATTT GAAGGTCGTA GCGGAAGTCG AAATCGTTCG CACGCTCGCC CGCGCGGTCG GCAGCGGTCT CGGCGCGACG ATCATGCCGA AAGCCATCGC CGATCGCATC GTCTCGGAAT CGAGCGAGCC GCTGATCTGC CGGCTTGTCT CACCGCGGAT CGAGGAAACC CTGTCGCTTT GCGTTTCCGA TCAGAATCCC CTGTCGGAGC CGGCGCTTGC CGTCCGCGAC ATTCTTCTCG AGCTGACGGC GCGGCTGAAA GCCGAGGTGG CGGCCGGCTA G
|
Protein sequence | MDIRRLKSFI VIVDSGSITR AADLLHIAQP ALSQQLAALE EHFGHKLLIR SQQGVSMTDA GHAVYRHAQI ILRQMEQAQA DASAAGNSLA GRVSVGLVPF SSAATLSVDL LAETRKRHPG ILLHLTESVG QTYSQMIMNG RLEMALLHGT GPIKGVRFEP ILSEEFFLVA HRDFAIEADA KPVPVNALDG IPLLLPPAYN FVRRAVDTAF TRTRTNLKVV AEVEIVRTLA RAVGSGLGAT IMPKAIADRI VSESSEPLIC RLVSPRIEET LSLCVSDQNP LSEPALAVRD ILLELTARLK AEVAAG
|
| |