Gene Smed_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2079 
Symbol 
ID5322938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2135064 
End bp2136374 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID640791016 
Productmajor facilitator superfamily metabolite/H(+) symporter 
Protein accessionYP_001327747 
Protein GI150397280 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.611047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0017507 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACG CGACAACCTC GCTGTCGCCG CAGGATGGTG CGTTGCATCG ACAGGCCGTG 
AACTCCCCGG CCCGGGTGCT GTTCGCCAGC CTCGTCGGCA CGACGATCGA ATTCTTCGAC
TTCTATGTCT ATGCGACAGC AGCGGTGATT ATTTTCCCGC ACCTTTTCTT CCCTGCAGCT
GATCCGACCT CGGCAATGCT GCAGTCCTTG GCGACTTTCT CGATCGCCTT TTTCGCCCGT
CCCCTTGGCG CCGTGATCTT CGGCCACTTC GGCGACAGGA TCGGCCGCAA GGCGACGCTC
GTCGCCGCGC TGATGACTAT GGGGATTTCG ACGGTCGTGA TCGGCCTGCT GCCCACCTAC
GCGACGATTG GCGTCGTGGC GCCGCTTCTC CTTGCGCTCT GCCGCTTCGG CCAGGGCCTG
GGCCTCGGCG GTGAATGGGG CGGCGCGGTG TTGCTAGCGA CCGAGAATGC GCCGGAAGGC
AAGCGGAGCT GGTATGCAAT GTTCCCCCAG CTCGGCGCGC CGATCGGCTT CATCCTGTCG
GCCGGGACCT TCCTCGTGCT CGGCGAGGTC ATGAGCGACG AGGCCTTCCT CGCCTGGGGC
TGGCGAATTC CCTTCGTCGC CAGCGTGCTG CTCGTGATCG TCGGTCTCTA TGTCCGCCTG
AAGATTACCG AAACGCCGGA ATTCCAGAAG GCAATCGATA AACGGGAGCG CGTCGAGGTA
CCGGTGGCGG CGATATTCCG CTCGCATAAG CGAAGCCTCG CGCTCGGCAC CTTCGTGGCA
CTCGCGACCT TCGTCCTGTT CTATCTGATG ACCGTCTTCT CGCTCTCCTG GGGCACGACG
AAGCTCGCCT ATTCGCGCGA GCAGTTCCTG CTTGTACAGA TGACCGGCGT CGTTTTTTTC
GGCCTGATGA TTCCCGTCTC CGGCATTCTT TCGGACCGCT TCGGACGCCG CCTGGTGCTG
GTGCTCACAA CAATCGGCAT CGGCATATTC GGCCTCGTCA TGGCGCCGCT TCTGACATCC
GGTCTCGGCG GCGCCTTCGT CTTCTCGATC CTCGGACTCG GCCTGATGGG CCTTACCTAC
GGGCCGATCG GCGCGGCGCT GGCGGCTCCC TTTCCGACTG CAGTGCGTTA TACCGGCGCC
TCGATGACCT TCAACCTCGC AGGCATCTTC GGCGCGTCGC TGGCACCCTA CATCGCCACC
TGGCTCGCGA CCAACTACAG CCTCGGCCAT GTCGGCTATT ATCTGATGGG CGCCGCATTG
ATCACGCTCG TCTGCCTGCT GCTTTCGAAC GAGGAAGAGG TCTCGGGCTG A
 
Protein sequence
MTDATTSLSP QDGALHRQAV NSPARVLFAS LVGTTIEFFD FYVYATAAVI IFPHLFFPAA 
DPTSAMLQSL ATFSIAFFAR PLGAVIFGHF GDRIGRKATL VAALMTMGIS TVVIGLLPTY
ATIGVVAPLL LALCRFGQGL GLGGEWGGAV LLATENAPEG KRSWYAMFPQ LGAPIGFILS
AGTFLVLGEV MSDEAFLAWG WRIPFVASVL LVIVGLYVRL KITETPEFQK AIDKRERVEV
PVAAIFRSHK RSLALGTFVA LATFVLFYLM TVFSLSWGTT KLAYSREQFL LVQMTGVVFF
GLMIPVSGIL SDRFGRRLVL VLTTIGIGIF GLVMAPLLTS GLGGAFVFSI LGLGLMGLTY
GPIGAALAAP FPTAVRYTGA SMTFNLAGIF GASLAPYIAT WLATNYSLGH VGYYLMGAAL
ITLVCLLLSN EEEVSG