Gene Smed_3188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3188 
Symbol 
ID5324067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3357221 
End bp3358612 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content65% 
IMG OID640792136 
Productmajor facilitator transporter 
Protein accessionYP_001328847 
Protein GI150398380 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.988714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATTG CGGCCGGGCG AATGGATGAG CGGCCAGGGA CCTCGCGTCT CGGCATTGCG 
GGCTGGATGC TCTTCGACTG GGCCGCGCAA CCCTTCTTCA CCGTCATCAC GACCTTCATC
TTCGCACCCT ATTTCGTTTC CAGGCTGACG GCCGACCCGG CACAGGGTCA GGCGGTCTGG
GGGTATACGC TTACTGCCGC GGGGATCGTG ATTGCCCTGC TTTCTCCCGT CCTCGGCGCT
ATCGCCGATG CGACCGGCCC GCGAAAACCG TGGATTGCCT TCTTCGCCGC AGTGAAGATC
GCCTCGCTCG CCCTGCTCTG GTATGCGGCG CCCGGCTCAA GCCTGATCTA CGCGGCGCTC
CTTCTGGCGC TAGCCACGGT GGCCGCCGAA TTCTCCATCG TCTTCAACGA TTCCACGATG
ACGCGGCTCG TGAGCGAAAG GGAAGTGGGC CGTATCTCGA ACATCGCCTG GGGGCTCGGC
TATCTCGGCG GCATGATCGT GCTCATCGCG GTAGTCGTGC TCGTTGCCAG CGATCCGCAG
ACGGGAAAGA CAGTGCTCGG GCTCGATCCC CTGTTCGGGC TCGATCCGGC AAAGGGCGAG
GATGCCCGCA TCACCGGCCC GATCTCCGCC ATCTGGTATC TCATTTTCAT CCTGCCCATG
TTTTTCTTCA CGCCGGATGC GATCCGGGCG ACGATGACGA TGGCAGAGGC AGCCGCGCGG
GGGCTCGAGG AGCTGAGGGG AACGCTCAGC GAACTCAAGG AGAGGGCCGG CATATTGCGG
TTCCTGATCG CCCGGATGAT CTTCCAGGAC GGCGTAAACG GCCTGCTCGC GCTCGGCGGC
ACCTTCGCTG CCGGCATGTT CGGCTGGCAG ACGATGGAGC TCGGTGTTTA CGGCATCATC
CTCAACATCG TCGCAATCGC CGGCTGCCTC GTTGCGAGCT GGCTCGATGC CCATCTCGGC
TCGAAGAAGA TCGTGGTCGC AAGCCTCGTC TGTCTCACGA TCGCCACCCT CGGCATCGTC
TCGACCGGAC CGGGCTTCAC CCTCTTCGGA CTCGCACCGC TGCCGAGCAC GGATTCGGGC
GGCCTGTTCG GCACCGCTGC CGAAAAGGCC TACATTGCCT ATGGCCTGCT TGTCGGCATC
GCCTTCGGCC CCGTCCAGGC CTCGTCTCGA TCCTATCTCG CCCGCAGTGT ATCGCCCGAC
GAAGCCGGGC GCTATTTCGG CCTCTATGCG CTTTCCGGCC GCGCGACGTC ATTCCTGGCG
CCCGCCTCGG TCGCCACCGT CACCTTGCTG ACCGGCTCGG CCCGCATCGG CATGGTAGCG
CTCGTCGCTT TCCTCGCCCT TGGACTCGTC CTTCTCCTGC GGACGCCCTA TCCGGCTCAT
CGCCCGACGT GA
 
Protein sequence
MDIAAGRMDE RPGTSRLGIA GWMLFDWAAQ PFFTVITTFI FAPYFVSRLT ADPAQGQAVW 
GYTLTAAGIV IALLSPVLGA IADATGPRKP WIAFFAAVKI ASLALLWYAA PGSSLIYAAL
LLALATVAAE FSIVFNDSTM TRLVSEREVG RISNIAWGLG YLGGMIVLIA VVVLVASDPQ
TGKTVLGLDP LFGLDPAKGE DARITGPISA IWYLIFILPM FFFTPDAIRA TMTMAEAAAR
GLEELRGTLS ELKERAGILR FLIARMIFQD GVNGLLALGG TFAAGMFGWQ TMELGVYGII
LNIVAIAGCL VASWLDAHLG SKKIVVASLV CLTIATLGIV STGPGFTLFG LAPLPSTDSG
GLFGTAAEKA YIAYGLLVGI AFGPVQASSR SYLARSVSPD EAGRYFGLYA LSGRATSFLA
PASVATVTLL TGSARIGMVA LVAFLALGLV LLLRTPYPAH RPT