Gene Smed_4859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4859 
Symbol 
ID5318844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1360143 
End bp1361162 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content62% 
IMG OID640776644 
Producttaurine ABC transporter, periplasmic binding protein 
Protein accessionYP_001313576 
Protein GI150376980 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0843259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCACT ACAGAAAATT CAAGCTTATA TCCGGCGCCC TTGCGATAGC GACCGGGCTG 
TTAGCCGGCT TTGCGGCCCG GGCGGAAACC AGTGTCGTCG TCGGCTACCA GCAGATCGTC
GGCCCGTTCA TTTCGGCAAT CGCGGATGGC CGCTTCGATG CCGCAGCCAA GGAGGCCGGC
TACTCGATCG ATTGGCGCCA GTTCAGCTCG GGAGGCGACA TTTCGACGGC GCTTGCATCG
GGTAATGTGC CGATCGGCGT TATCGGTTCG ACCGGTACGA CAGCCGCCGC GACCCGCGGC
GTCGAGCTCG AACTTTTCTG GATCCTCGAC AATATCGGCA AATCGGAAGC GCTTGTCGCA
CGCGAGGGAT CCGGCATCGC AAAGCCGGAA GATCTGATAG GAAAGAATGT CGGCGTTCCC
TTCGTGTCGA CCTCTCACTT CCATCTGCTG GTCGGCCTGG GAGAGGTCTG GAAAATCGAT
CCGCGGGAAG TGAACATCCT CAACATGAAG CCGCCGCAGA TCGTCGCCGC CTGGCAGCGC
GGCGATATCG ACGCCGCCTA TGTCTGGCCG CCGGCCCTTT CGGAGCTCCT GAAAACGGGT
AAGGTGATCT CGGATTCCGA GGCGGTCGGC GCGGCGAGCG TGCCCACATT CGACGGCCTC
GTGGTCGATA AGAAATGGGC CGAGGAAAAT CCGGATTTCA TGGCGGCCTT CACCAGGGTG
CTCGCCGAGT CCTATGCCGA TTTCAAGGCC AATGGAAGCG GCTGGACGGC GGACTCGCCG
GAGGTGCAGG GCATGGTCAA GTTGATCGGC GGCGACGCCG AGGGTATCGT CCAGGCCCTC
AACCTTCTAT CCTTCCCGAC CGCCGAGGAA CAGGTCTCCG ACAGGTGGCT TGGCGGCGGT
GCCGTCCGGG CACTGGAGGC GAGCGCCAGG TTTCTGGTCG AGCAGAAGCA GATCGACAAT
GCGCTCGACG ATTACGCGCC CTTCGTCAAC AGCGCCTACG CGAAAGAAGT CTCCAAGTAG
 
Protein sequence
MIHYRKFKLI SGALAIATGL LAGFAARAET SVVVGYQQIV GPFISAIADG RFDAAAKEAG 
YSIDWRQFSS GGDISTALAS GNVPIGVIGS TGTTAAATRG VELELFWILD NIGKSEALVA
REGSGIAKPE DLIGKNVGVP FVSTSHFHLL VGLGEVWKID PREVNILNMK PPQIVAAWQR
GDIDAAYVWP PALSELLKTG KVISDSEAVG AASVPTFDGL VVDKKWAEEN PDFMAAFTRV
LAESYADFKA NGSGWTADSP EVQGMVKLIG GDAEGIVQAL NLLSFPTAEE QVSDRWLGGG
AVRALEASAR FLVEQKQIDN ALDDYAPFVN SAYAKEVSK