Gene Smed_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0650 
Symbol 
ID5321486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp702769 
End bp703767 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID640789586 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001326341 
Protein GI150395874 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.324621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TTCTTGCAAC GACATGCCTG GCCGCTGGTC TTCTTGGACT GGGCAGTACG 
GCATCGGCGG CGGAATGCGG CGATGTGACC ATTGCCAACA TGAACTGGCA GAGCGCTGAA
GTCCTGGCGA GTGTGGACAA GTTCATTCTG ACCGAGGGTT ACGGCTGCAA TGCCGATCTC
GTCGTGGGCG ATACGGTGCC GACCATCACC TCGATGATCG AAAAGGGCGA GCCGGACATT
GCGCCGGAAG GCTGGGTCGA TCTGCTGCCC GACGTCGTGA ACCGTGGTCT CGAGGAAGGC
AAGCTTGTAG GCGCCGCAGT GGCGCTTTCA GACGGCGCCG TCCAGGGTTG GTGGGTGCCG
AAATATATCG TCGACGCCAA TCCGGACATC AAGACGATCG ACGACGTCCT GAAGCACAAG
GACCTCTTCC CGGATCCCGA AGATCCAAGC AAGGGCGCGA TTTTCAACGG CCCGCAGGGC
TGGGGCGGCA CGGTCGTTAC GACGCAGCTC TATAAGGCTT ACGGCGCCGA GCAGGCGGGC
TTCACGTTAG TCGATACCGG CTCGGCAGCC GGCCTCGACG GATCGATTGC CAAGGCGTAT
GAGCGCAAGC AGGGCTGGGT CGGCTACTAC TGGGCTCCGA CGGCGCTGCT AGGCAAGTAC
GAGATGGTCA AGCTCGGCCA TGGCGTTCCG AACGACATGG CGGAATGGAA GCGTTGCAAT
ACGGTTGCGG ACTGCCCGGA CCCGAAGAAG AACGATTGGC CGAAGGACAA GGTCCAGACG
CTGGTGACCA AGGAATTTGC CGATCGTGCT GGTCCGGCCA TGGAGTACCT CAATACGCGC
GCCTGGACGA ACGACACGGT GAACAAGCTG ATGGCCTGGA TGACCGACAA TCAGGCGAGC
GGCGAGGAAG GTGCGAAGCA CTTCCTCGAG GAGAACCCGG ACCTCTGGAC CAAGTGGGTC
TCTCCCGAAG TCGCCGAGAA GATCAAGTCG GCCCTTTAG
 
Protein sequence
MKKLLATTCL AAGLLGLGST ASAAECGDVT IANMNWQSAE VLASVDKFIL TEGYGCNADL 
VVGDTVPTIT SMIEKGEPDI APEGWVDLLP DVVNRGLEEG KLVGAAVALS DGAVQGWWVP
KYIVDANPDI KTIDDVLKHK DLFPDPEDPS KGAIFNGPQG WGGTVVTTQL YKAYGAEQAG
FTLVDTGSAA GLDGSIAKAY ERKQGWVGYY WAPTALLGKY EMVKLGHGVP NDMAEWKRCN
TVADCPDPKK NDWPKDKVQT LVTKEFADRA GPAMEYLNTR AWTNDTVNKL MAWMTDNQAS
GEEGAKHFLE ENPDLWTKWV SPEVAEKIKS AL