Gene Rleg_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2197 
Symbol 
ID8013208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2196535 
End bp2197755 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID644824783 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002976013 
Protein GI241204917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0740426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCCG CTCAAAATTC CCCGACGGGT GAGGGAGCGC CGCCGCGCTT CCGGCTTCTC 
AGTGCGCTTT CCTATTGTGC GCCGCTGCTC GTCAACGGCA TCGTCCTGCC GTTCTTTCCC
GTCTGGCTCG CAACCCATAG CTTTAGCGAT CATGAGATCG GCATCATCCT TGCCATACCC
ATGGTGGTTC GCGTGCTGGT GGCGCCTGTC GTCGCCATGA TCGCCGATCG ATTGAAGGAG
CGCGCCGACG TGCTGCTCTG GTCGGGCGGC CTTTCGCTGC TGACGGCGGT CGCGCTGTTC
TGGACGACGA CTTTCTGGCC GGTGACGATC GTCTATGCGC TGCAGGGCGC CACCTTCGCA
CCCTATGTGC CCGTCGTCGA ATCGATCGTC ATCTCGGGCG TGCGCCGCTG GGGGCTCGAT
TACGGGTCGA TGCGCGTGTG GGGCTCCATC GCCTTTATCG TCTCGACGCT GGTCGGTGGC
CAGATGATCA GCCGGTGGGG CGGCGGAATG GTGCTCGATG TCATGGTGTT CGGCTTTGTC
ATGACCGTTG TCATGGCGAT CTTCTGTCCG CGCATCGGGC CAACGCGACG GCGGGGCCAG
CCGATCAACA TCCCAGCCGC TACCGGCAGT GGCCTGCGCG AGCCGCACCT GCTGCTGCTT
TTGATCGGCG TTGCCATCCA GCAGTCGAGC CATGCGGTGC TGAACGCTTT TTCCTCGATC
TACTGGCATC AGCTCGGCTT CTCCGGCACT GAGGTCGGCC TGCTCTGGAG CGCCGGCGTC
GCCTCGGAAG TGACGGTGTT CTTCCTGTCG AAGCGTCTCA ACCGTCGCTT CGATGCCTGG
ACGCTGATCC GCTTCGGCTG CGCCATCAGC GTCTGCCGCT GGATCCTGTT TCCGATGAAT
ACCGGTTTTG CCGGTTTCTT CCTGCTGCAA TGTTTCCACG GCTTCACCTA TGCCTTCGTG
CATACCGGCG TGCAGCGACG GATCATGGCG ACGGTGCAGG AGACGCAGGA ATCTTCGGCA
CAGGGCGCCT ATTTCTTCTA TGTCGGCATG GCGATGGCGC TGATGACCCT GGCGTCGGGT
TATCTCTACG CCTGGCTCGG CGTCGTCAGC TATTACGTCA TGGCGCTGGT CGCGTTTTCC
GGCCTCGGCC TCGTCATCTT CGCCTATTAC CTTCAGCCCC AAAGGGTGCT TTCCGGCGGA
AAGACCAGCG AAGCGGCGTA G
 
Protein sequence
MIPAQNSPTG EGAPPRFRLL SALSYCAPLL VNGIVLPFFP VWLATHSFSD HEIGIILAIP 
MVVRVLVAPV VAMIADRLKE RADVLLWSGG LSLLTAVALF WTTTFWPVTI VYALQGATFA
PYVPVVESIV ISGVRRWGLD YGSMRVWGSI AFIVSTLVGG QMISRWGGGM VLDVMVFGFV
MTVVMAIFCP RIGPTRRRGQ PINIPAATGS GLREPHLLLL LIGVAIQQSS HAVLNAFSSI
YWHQLGFSGT EVGLLWSAGV ASEVTVFFLS KRLNRRFDAW TLIRFGCAIS VCRWILFPMN
TGFAGFFLLQ CFHGFTYAFV HTGVQRRIMA TVQETQESSA QGAYFFYVGM AMALMTLASG
YLYAWLGVVS YYVMALVAFS GLGLVIFAYY LQPQRVLSGG KTSEAA