Gene Rleg_5300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5300 
Symbol 
ID8006937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp705453 
End bp706637 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content62% 
IMG OID644822206 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002973466 
Protein GI241113631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CCCCGCAGCA GCGGATCTAT GTCTGCTTCT TTCTCTTCGC TGTGTCGCTG 
GGGGCGCTGC TGTCGAGGAT GCCGGATTTG CAGGTTGCAC TTGGCGTCAA TAAGTCCGAG
CTTGGGTTGA CCCTGATAGG GGCTGCGATC GGCGCCTTGA TTTCGTTGAC TTTGTCTTCG
CCCTTGATCG CCCGGCTCGG CGCGCGTACG ACGGCATTCA TTACTGTTCT CGGCACGTCT
GCGCTGCTAT CTCTGGTGCC GTGGATTGGT GCGGCGCCGG TCGTGTTCTG TGTGCTTTTC
GTCGAGGGGC TGCTCGCCGG GGCGCTGGAG ATCAATCTCA ATGTTGAGAT CGACCGTATC
GAAGCGCAGC TAGGACGCGG TGTGATGAAC AGGGCGCATG GTTTCTGGAG CCTCGGCTTC
TTCGTCACGG CGCTTGTCTC CTCGGTCGTT CGCCAAGCCG GGATTTCGAT GGAACTCCAT
CTCGCCGTGA CCTTTGTCGC GGTTGTTGTC ATCGGCATCT GGGCGATTTC CGGCATGCGG
AATGCGCCGG CGCGGATCGC GTTGCATGAA GGCAAGGCAC CGCTGGTGGC GCTTCCCACC
TGGGGCCTCA TGCCGCTGTG CGTGATCGGC ATCGCGGCCT TTCTCGTCGA AGGCGCCGGG
ATCGACTGGT CGGCGATCTA TATGCGCGAT GTGTTTTCGG TCGAGCCCTT CATTGGCGGA
CTGGGATTGA CGCTCTTTAC CTTCTGCATG GCGCTGGCGC GCCTGTTCGT CGATCCGCTG
GTCGATCGGT TTGGCGCGCG GGCCGTCGCC ACGATGTTGC TTGTTCTTTC GGCGATCGGC
ATCTGCGCCG TGTCGGGGGC GCCGCATCCC TATGTCGCGC TGGCGGGCTT TGCCTTGATG
GGCGCCGGCT GCAGCGCGGT CTATCCTCTC GCTGTCTCGG CGGCGGCCCA ACGCACCGAC
CGCGCGGCGT ATCTCAACGT CGCCGCCCTC GGCCAAATGA GCTTTGTCGT CTTTTTCCTG
GCGCCGCCGC TGCTCGGTTT CATTGCCGAA CATGCTGGCA TCCGGACATC CTATCTCGTT
TGCCTTCCCC TTATCATTTA CGCGCTCTTT TCGGCCAAGG CGCTTGCTAC GCGCCGGGCT
GCCGGCGGCG GTAGCGCTGC GACTGCTCGG AGCGTCAACG GGTAA
 
Protein sequence
MKIAPQQRIY VCFFLFAVSL GALLSRMPDL QVALGVNKSE LGLTLIGAAI GALISLTLSS 
PLIARLGART TAFITVLGTS ALLSLVPWIG AAPVVFCVLF VEGLLAGALE INLNVEIDRI
EAQLGRGVMN RAHGFWSLGF FVTALVSSVV RQAGISMELH LAVTFVAVVV IGIWAISGMR
NAPARIALHE GKAPLVALPT WGLMPLCVIG IAAFLVEGAG IDWSAIYMRD VFSVEPFIGG
LGLTLFTFCM ALARLFVDPL VDRFGARAVA TMLLVLSAIG ICAVSGAPHP YVALAGFALM
GAGCSAVYPL AVSAAAQRTD RAAYLNVAAL GQMSFVVFFL APPLLGFIAE HAGIRTSYLV
CLPLIIYALF SAKALATRRA AGGGSAATAR SVNG