Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5542 |
Symbol | |
ID | 8016433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 129094 |
End bp | 130044 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644827709 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_002978909 |
Protein GI | 241518281 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0196792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00000723817 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATAG ACGAACGTCA CCTTGTCCAG CTTGCGGCAG TCGTTAAGAC CGGCGGGGTC ACCGAGGGAG CTGCCCTGCT CGGCCTATCG CAGCCGGCTG TTTCGCGCAC GCTCGCAATG CTCGAGGCTC GCATCGGCGA GCCACTGTTC GTCAAGGGGC GGCGGCCGTT GCAGCCAACG TCATTGGGTC GGGCGCTGGC CGATCACGGC CAGACGATGC TGTCGGCATC GCGCAAGGCC TCCGATGTCG TCGAAAGCTT TCGGGCCGGC AGAAGCGGCG TAGTGCGTGT CGGTGGCACG CCTTTTTTCA TGGATGCGTT GATCGCCGGC ATGATCGCCG AGTTCCAGAA CCTGCATCCA GACGTACGCA TCGACCAGAG CTATGGTTAC TTCCCAGACC TCCGTGCCGC GCTCAATGCC GACCAGATCG ACCTTGCCAT CTGCCCGATC GATATTCTCG ACGAGGGCTC CGGCCTCGAA TTTCAGCAAA TCCTGCCCGG CCGCAACGTC GTCGCCTGCC GGGTCACCCA TCCTTTGCTG TTGAAGCGGC GCCCTCAGCC GGCGCACCTG CTCGATTTCC CTTGGGTGGC GCCTCCGCCA GGCAGCCCGC TACTCACCGA TCTGCGCAGC ATGCTGCTGT CCTTCGGCGC AACCGAAGTC AAGATCCGCT ATTCGGGCGG CTCCCTCATG AGCGTCGTCC AGTACATGAA GGCGGCGGAT GCGCTGACCA TCATGCCGCA CAGCGTCGTC TTCGCATTGC GCAACGAAAA GTCCATCACC GCCCTGCCCG TTCCCATCCC CCATTCGGAA CGCGCGCTCG GCTTGCTGAA GCGTTCTGAC GCACCCCGCA CACCCGCCGC CGACAACTTC GCCCGCCACA TCCGCACCGG CTTCGATAAC CTCAAGCACC TGATTAAGCG GCATGAGCAG TCAGTGGTCT GGGGCTCATG A
|
Protein sequence | MKIDERHLVQ LAAVVKTGGV TEGAALLGLS QPAVSRTLAM LEARIGEPLF VKGRRPLQPT SLGRALADHG QTMLSASRKA SDVVESFRAG RSGVVRVGGT PFFMDALIAG MIAEFQNLHP DVRIDQSYGY FPDLRAALNA DQIDLAICPI DILDEGSGLE FQQILPGRNV VACRVTHPLL LKRRPQPAHL LDFPWVAPPP GSPLLTDLRS MLLSFGATEV KIRYSGGSLM SVVQYMKAAD ALTIMPHSVV FALRNEKSIT ALPVPIPHSE RALGLLKRSD APRTPAADNF ARHIRTGFDN LKHLIKRHEQ SVVWGS
|
| |