Gene Rleg2_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1107 
Symbol 
ID6979826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1131202 
End bp1132389 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content61% 
IMG OID643395819 
Producthypothetical protein 
Protein accessionYP_002280627 
Protein GI209548710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.585839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTT ACCGGACAAC GCGGCTGATG CTGTCGGGCG CCGCCTTTTT CTCGCTCGCC 
GGCTCGGCTT TCGCTCTCGA CGGCACCGAT CTCTTGAAGA AGATCAATGC CGCCTATGCC
GCCCAGGGTG GGACGATTGC GGCTGAAAGC GTCGATATCG ACGGCACGAC CGTCACGTTG
AAGAATGTCA CCGTCAAGCC GACCGGCGGC GAGAGCCTGC CCATCGGCGA AATCACCCTT
TCCGGTGTCG AGGAAGACGA GGATGGCGGC TACTACATCG AGGAAGCCGC CTTCCCCGAC
ATCAACAAGA CGCAAGACGG CGTGACCGTG ACGGCGCAGG AGCTGACGCT CGGCGGCATC
TCCGTGCCGG CAACGCCGGG CGGCGACACG CTCGACACCA TGATGCTCTA TGAAACCGCC
CATATCGGCC CGCTGAAGGT GGTCAAAGAC GGCGCGGAAG TGTTCTCGCT GCTCGAAAGC
AACATGAACC TGACGCTGCG CGAAGACGAA TCCGGCTTCG ATTTCGACGG CGCCTTCAAA
AGCATGAAGG CCGACCTCAC CAAGACCGAA GATGCGCAGA GCAAGGATGC GATCGAGAAG
CTCGCCCTGC AGCACGTCCA AGGCGACATC ACCATGAAGG GCGCCTGGGA GCTCGCCCCC
GGCACGATCG ACATTTCGGA ATTCGCCTTC GACTTCACCA ATGTCGGGAA GCTGAACCTC
GGCTTCAAGA TCTCCGGCTA CACGATGGCC TTCATGAAGT CGATGCAGGA TGCGATGAAG
GAATCCGAAG CCAATCCGAA CAAGGAACAG TCGCAGCAAG CGCTCGGCCT CGCCATGCTC
GGCCTGATGC AGCAGCTTTC CTTCGAGGCC GCGCAGGTGC GTTTCGACGA TGCCTCGATC
ACCAAGCGCG CGCTCGATTA TGCCGGCTCG CAGCAGAACA TGTCGGGCAA GCAGATGGCC
GATTCGCTGA AGGCGATGAC GCCGATCATG CTGGCGCAGC TCAATATCCC GGAACTGCAG
AATGCCGTTT CGGCTGCCGT CAACACCTTC CTCGACGATC CGAAGAGCCT GACCGTCAAG
GCCGCTCCCG AAAAGCCGGT GCCGTTCCCG ACGATCGTCG GCGCTGCCAT GGGCGCTCCG
AACACGCTGC CGCAGGTGCT CGGCGTCAAG GTTTCGGCCA ACGACTGA
 
Protein sequence
MNFYRTTRLM LSGAAFFSLA GSAFALDGTD LLKKINAAYA AQGGTIAAES VDIDGTTVTL 
KNVTVKPTGG ESLPIGEITL SGVEEDEDGG YYIEEAAFPD INKTQDGVTV TAQELTLGGI
SVPATPGGDT LDTMMLYETA HIGPLKVVKD GAEVFSLLES NMNLTLREDE SGFDFDGAFK
SMKADLTKTE DAQSKDAIEK LALQHVQGDI TMKGAWELAP GTIDISEFAF DFTNVGKLNL
GFKISGYTMA FMKSMQDAMK ESEANPNKEQ SQQALGLAML GLMQQLSFEA AQVRFDDASI
TKRALDYAGS QQNMSGKQMA DSLKAMTPIM LAQLNIPELQ NAVSAAVNTF LDDPKSLTVK
AAPEKPVPFP TIVGAAMGAP NTLPQVLGVK VSAND