Gene Smed_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1959 
Symbol 
ID5322818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2010852 
End bp2011796 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content62% 
IMG OID640790897 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001327628 
Protein GI150397161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.874851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.400976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAG GCGGAGAGTT CTTTTGCGCG GCGGCGCTGG CGGTAGCGAT GACTGCGCCG 
GCAGCCGCGG CGGATCTGGT AATCGCGATG CCGCCGTGGC CGTCAGGTCA GGCGGCGGCG
AACATCCTCA AATTCGGCAT CGCCAAGAAA TTCAGCCTCG ATGCGGAGGT GCGGGAACTC
GGTACGCTCA ACGCTTTCGT CGGCCTAGAG AAGGGCGAAA TCGACATCCA GCCGGAGGTT
TGGCGGCCAA ATTTCGACGA GCTCGTCCGC AAGTTCGTGA CCGAAAAGGG CGCCGTGACG
CTGAGTACGC GTGCGGTACC TGCATGGCAG GGGATTTGCG CCACGCCGGA GGCGGCCGCG
ACGATCAAGA CCGTTGCGGA TCTCGGCGAC CCGGCCAAGA CGAAATTCCT GGACACCGAC
GGTGACGGAC GCGGAGAAAT GTGGATCGGC GCCGCCGAAT GGCTTTCGAC CGGAATCGAA
CGTGCGCGGG CGGCCGGCTA TGGCTATGCG GCAAACCTGA CGCTTGTCGA GGCCAAGGAA
GATGTTGCAA TGGCGGCGGT GGATGCGGCA ATCGCGACGG CGCGGCCGAT GGTCTTCTAC
TGCTACGCTC CGCATCATGT TTTCAAGCTG CACCAGATCT CCCGGCTTGA GGAGCCGCCC
CATGATCCTT CCAAATGGAA AATCGCGCCG CCGAACGATC CGCTATGGGT CAGCAAGTCG
AGCGCGTCCA CGGCCTGGGA CGCGGGCCAG TTCCAGATCG GCTATGCGAC GGCTTTTGCA
AAGAAACATC CCGAAGTCGC GCAGTTCCTT CAGAATGTGG ACTTCTCCCC GGATGAAGTG
ACGGCGATGA GTTATGCGCT CCAAGTCGAG CGGCAGACGC CGGTGGACTA CGCCAGGAAG
TGGGTTGAAA GCCACGCGGA ACGGATCGAC GGATGGGCGA AATGA
 
Protein sequence
MRRGGEFFCA AALAVAMTAP AAAADLVIAM PPWPSGQAAA NILKFGIAKK FSLDAEVREL 
GTLNAFVGLE KGEIDIQPEV WRPNFDELVR KFVTEKGAVT LSTRAVPAWQ GICATPEAAA
TIKTVADLGD PAKTKFLDTD GDGRGEMWIG AAEWLSTGIE RARAAGYGYA ANLTLVEAKE
DVAMAAVDAA IATARPMVFY CYAPHHVFKL HQISRLEEPP HDPSKWKIAP PNDPLWVSKS
SASTAWDAGQ FQIGYATAFA KKHPEVAQFL QNVDFSPDEV TAMSYALQVE RQTPVDYARK
WVESHAERID GWAK