Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5418 |
Symbol | |
ID | 6978512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1060449 |
End bp | 1061519 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394520 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_002279338 |
Protein GI | 209547420 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00263593 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAGATT CCGCTCATCC GAAACGGCCC GTCACGGTCG CCGACGTTGC GAAGGCCGCA AAGGTCTCGA AGGCGACGGC CGCCCGCGTG CTCGGCGGCT ATGGCGTCGT CAGCGTCCAG ATCAAGGATC AGGTGATGGC GGCGGCAGCC GCGCTCGAAT ACCGCCCGAA CGAACTCGCC CGGAGCATGA GCACGGGACG ATCGGGAATC ATCGGCGTCG TTGTCGGCGA CATCGAGAAC GCTTTCTTCA GCCTGGCGGT GCGCGGCATC AGCGATGCGG CCCGCCTTGC CGGCTTCAAC ATCATCATCG CCAATTCGGG TGAACAACTC GATGCTGAAA GGTCGGCCGT CGACCTGCTG ATCGGTAAGC GCGTCGATGG CCTGATCGTC ACGCCCGCCC GTTGCGACCG CCTCGATCAC CTGCAGCATG TCCGCCGCAC GGGCGTGCCG CTGGTGCTGT TCGACCGGGC CATCCCGGAA CTCGATGTCG ACGCCGTGAC CGGCGACGAC CGGGAGGCAG CCCTCACCGC GACCCGATAT CTGATCGGCC AAGGGCATCG CCGCCTTGCC TATGTCTCCG CCATGGATGC CGAGAAAGGC GGGCTCACCG ATATCGCGCT GATCTCGAAT TCCGCCGTGC GCGAACGCGT GGAAGGCTTC GTCAGCGCCC TGACCGAGGC GGGCTTGCCG AACCCTCTTC ATTACATCAG GCTCGGCGCC ACGGACCAGC ATCAGACGGA CGCCGTAATG AAACGTCTGC TTTCGGAAAC AGCGCCGCCG ACGGCGCTGC TGGCATCCGA CAGCCTCGTC GGCTTGCGCA TTTTCAAGTC GCTGCAATCG CTCGGCCTAT CGATGCCGCA GGATGTCTCG ATGATTTCCT TTCTGGACGC CGACTGGACC AGCGTCACCG TTCCACCGAT CACCATCGTC GACCAGCGCG TTTACGAGCT AGGCAAACTC GCCGGTGAAC GGCTCGTCGC CCGCATCGAG CGCACCCCGC TTGCCGTCGA ACATCTGCGC GTTACCACGA GCCTTGTCGT GCGCGGCTCC GTGGCGACGA TCGGCCCGTG A
|
Protein sequence | MEDSAHPKRP VTVADVAKAA KVSKATAARV LGGYGVVSVQ IKDQVMAAAA ALEYRPNELA RSMSTGRSGI IGVVVGDIEN AFFSLAVRGI SDAARLAGFN IIIANSGEQL DAERSAVDLL IGKRVDGLIV TPARCDRLDH LQHVRRTGVP LVLFDRAIPE LDVDAVTGDD REAALTATRY LIGQGHRRLA YVSAMDAEKG GLTDIALISN SAVRERVEGF VSALTEAGLP NPLHYIRLGA TDQHQTDAVM KRLLSETAPP TALLASDSLV GLRIFKSLQS LGLSMPQDVS MISFLDADWT SVTVPPITIV DQRVYELGKL AGERLVARIE RTPLAVEHLR VTTSLVVRGS VATIGP
|
| |