Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4574 |
Symbol | |
ID | 8015325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4699604 |
End bp | 4700680 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644827151 |
Product | hypothetical protein |
Protein accession | YP_002978351 |
Protein GI | 241207255 |
COG category | [S] Function unknown |
COG ID | [COG4320] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.134631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA TATCGCAATC GGTCCGCAAT TTCGAGACCT GGCTGGCCGT CGAACTCGGC GACGATCTCG TCAAGGACGA TCTCCGGGAA AAGCACGAGA AGATGCGAAG TGGCGATTTC GTCTTCCTGC GCGCCACCTA CTGGCGCTGG TGCGAGATCA TCCTCGATAT CTGCCCGGAA CTTAAAGGCG CGCCTGAGAT ATTGGCGATC GGCGATACGC ATCTGGAGAA TTTCGGTACC TGGCGCGATA TCGAGGGCCG GCTCGTCTGG GGTGTCAATG ATTTCGACGA CGCGGCGGTG ATGCCCTATG CATTTGATCT CGTTCGCCTT GCGGCAAGCG CTGTCCTGGC CCGCGGCGAC GACGGTCCCT CGGTTCGCAT GATCGGCGAA TTGATCTTGA GCGGCTATCG CCGGGGCCTT GAAAATCCGT TGCCCGTCAT CCTCGAGCGC GACCACAAAT GGCTGCGCAA GGCGCTGCTG CTGCCGAATT CCGAACGCCG AGAATTCTGG GAGAAATACG AGATGTTGCT TCCCGGCAGC AAGCCGCCAC CATCAGCCTA CACCAAGGCG CTCGCAGACG CGCTGCCGTC CGGGGCAGGA CCCTTCGTGC CGAAGCCGCG CAGCGGCGGC ACCGGCAGTC TCGGTCGGCT GCGTTTCGTC GCCTATGCCG AATGGCAGGG TGGGCCGGTG CTGCGCGAGG CGAAGGCGCT GCTGCCGTCC GCCTGGTCGC TTCGCCACAA CCCGCAGGAC ATGGCGATCC ATGCGGAAGA GATCGCCAAC GGGCGGGCGC GCTCAGCCGA TCCGCACTAC CGGGTCTCCG GCCGCATCCT CGTGCGCCGG CTCTCACCGA ACAGCTGCAA GGTCGAAGTC GACCGGCATC CCGAGATCCT GCTTTCGCCG ACGATGCTCG AACTGATGGG CTTCGAGATC GCCAATTGCC ATTCCGACGA TGCGGCGGCC GTTGCGGCGA TCCTGAAGGA TTTGGCGGCG CGGGGAAACG AATGGCTGCA TGAGGCGGCA AGGGCCGCGG CATCGAGCGT CAGCGCCGAG CAGAAAGCCT ATTCGCGTGC CAGCTAA
|
Protein sequence | MTTISQSVRN FETWLAVELG DDLVKDDLRE KHEKMRSGDF VFLRATYWRW CEIILDICPE LKGAPEILAI GDTHLENFGT WRDIEGRLVW GVNDFDDAAV MPYAFDLVRL AASAVLARGD DGPSVRMIGE LILSGYRRGL ENPLPVILER DHKWLRKALL LPNSERREFW EKYEMLLPGS KPPPSAYTKA LADALPSGAG PFVPKPRSGG TGSLGRLRFV AYAEWQGGPV LREAKALLPS AWSLRHNPQD MAIHAEEIAN GRARSADPHY RVSGRILVRR LSPNSCKVEV DRHPEILLSP TMLELMGFEI ANCHSDDAAA VAAILKDLAA RGNEWLHEAA RAAASSVSAE QKAYSRAS
|
| |