Gene Rleg2_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1872 
Symbol 
ID6980611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1918337 
End bp1919293 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content60% 
IMG OID643396595 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_002281383 
Protein GI209549466 
COG category[R] General function prediction only 
COG ID[COG5006] Predicted permease, DMT superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.102364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0795178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAT CCCCTCATGA ATCGGCCGCC ATGTCAGAGC ATCCAGCCCA TCAAAACTCC 
CTGCAGGGCA TGGCCATCAT GTCCGGCGCC ATGCTGATCC TGCCGACCAT GGACGCCATT
GCCAAATATA TGGCGACCTT CGAGGCGATG TCCCCGGGTC AGGTGACTTT CTACCGATTC
TTCTTCCAGC TCGCCTGCAC CCTGCCGATT CTCTTCGCCG TTTTCGGGCT GAAGGCGCTT
TCGGCCAAGC GGCCATGGAT GAACCTGCTG CGCGGCGTGC TGCATGGCGC TGCGAGCCTG
CTGTTCTTCG TCGCCGTCAA ATACATGCCG CTCGCCGACG TCTTCGCCAT CTATTTCGTC
GAGCCGTTCA TGCTGACGGC CATGTCGGCG CTGTTCCTCG GCGAGAAGGT CGGCTGGCGG
CGCTGGATGG CGATCGTCGT CGGTTTCGGC GGCGCGATGA TCGTCATTCA GCCGAGCTAC
GAAATATTCG GTCTGAAGGC GCTGCTGCCG GTCGCCTGCG CCTTTTTATT CTCGCTCTAT
CTCTTCCTCA ACCGCGCCAT CGGCGAGGCC GATTCGCCGC TGACCATGCA GACGATGTCC
GGCATCGGTG GAACGCTGTT CATGGCCGCC GCCCTTTTCG TCGGCAGCGG CTCCGGCAAT
GCCGATTTCG CCATGTCTCT GCCCTCCTCC GGTCTCGGTC TTGTCCTGCT TCTCGCCCTC
GGTTCGATCT CCGGCTATGC GCATCTGCTG ATCGTCCGGG CTTTCCGGCT TGCACCGCTG
TCGCTGCTTG CGCCGTTCCA ATATTTCGAG ATCATCTCGG CGACCGTTCT CGGTTATGCG
CTGTTCAACG ATTTCCCGAA TTTCTCCAAA TGGATCGGCA TCTTCATCAT CGTCGCATCC
GGCCTCTTCA TCATCTGGCG CGAGCGGCTG CAGGCGCGAT CGCTAAAATC CTCCTGA
 
Protein sequence
MASSPHESAA MSEHPAHQNS LQGMAIMSGA MLILPTMDAI AKYMATFEAM SPGQVTFYRF 
FFQLACTLPI LFAVFGLKAL SAKRPWMNLL RGVLHGAASL LFFVAVKYMP LADVFAIYFV
EPFMLTAMSA LFLGEKVGWR RWMAIVVGFG GAMIVIQPSY EIFGLKALLP VACAFLFSLY
LFLNRAIGEA DSPLTMQTMS GIGGTLFMAA ALFVGSGSGN ADFAMSLPSS GLGLVLLLAL
GSISGYAHLL IVRAFRLAPL SLLAPFQYFE IISATVLGYA LFNDFPNFSK WIGIFIIVAS
GLFIIWRERL QARSLKSS