Gene Smed_2305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2305 
Symbol 
ID5323166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2384946 
End bp2385902 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content60% 
IMG OID640791243 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001327972 
Protein GI150397505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.738548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.268073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGA CACTTTCTCG CGAATTCATG CTGGCAGGTG CCGTTTGCAT GGCAACCCTG 
ACCGCCGGAC CCGCATTCGC GGCGGAGCCG GAAAGCTGCG GTACGGTCCG CTTCTCCGAC
GTCGGCTGGA CCGATATCAC AGCAACCACC GCGACCGCGA CGACCATCCT CGAAGCGCTC
GGTTACGAGA CGGACGTGAA GGTTCTGTCG GTGCCCGTTA CCTACACCTC GCTGAAGAAC
AAGGACATCG ACGTCTTTCT CGGCAACTGG ATGCCGACCA TGGAAGCGGA CATCGCCCCC
TATCGCGAAG ACAAGTCCGT CGAGACGGTA CGCGAGAACC TCGCAGGTGC GAAATACACG
CTTGCGACAA ATGCCAAGGG CGCGGAGCTC GGCATCAAGG ACTTCAAGGA TATCGCCGCG
CACAAGGAGG AGCTCGACGG CAAGATCTAC GGGATCGAGC CGGGCAATGA CGGCAACCGC
CTGATCATCG ACATGGTCGA AAAAGGCACT TTCGATCTCA AGGGCTTCGA AGTCGTCGAA
TCTTCCGAGC AGGGCATGCT CGCGCAGGTC GCCCGCGCTG AAAAATCCGG CGACCCGATC
GTTTTTCTCG GATGGGAGCC GCATCCGATG AACGCGAATT TCAAGCTCAC CTATCTATCC
GGTGGCGATG ACGTGTTCGG CCCCGACTAC GGTGGCGCCA CCGTGCATAC CAATGTGCGC
GCCGGCTACA CGACCGAATG CCCCAATGTC GGCAAGCTTC TCCAAAACCT CTCGTTTTCG
CTCCAGATGG AGAACGAGAT CATGGGCAAG ATCCTGAACG ATGGCGAAGA CCCGGAAAAG
GCTGCAGCTT CGTGGCTGAA GGACAATCCG CAAGCAATCG AACCGTGGCT TGCGGGGGTC
ACCACGAAGG ACGGCGGCGA TGGGCCGGCC GCCGTCAAGA GCGCGCTGGG CCTCTGA
 
Protein sequence
MIRTLSREFM LAGAVCMATL TAGPAFAAEP ESCGTVRFSD VGWTDITATT ATATTILEAL 
GYETDVKVLS VPVTYTSLKN KDIDVFLGNW MPTMEADIAP YREDKSVETV RENLAGAKYT
LATNAKGAEL GIKDFKDIAA HKEELDGKIY GIEPGNDGNR LIIDMVEKGT FDLKGFEVVE
SSEQGMLAQV ARAEKSGDPI VFLGWEPHPM NANFKLTYLS GGDDVFGPDY GGATVHTNVR
AGYTTECPNV GKLLQNLSFS LQMENEIMGK ILNDGEDPEK AAASWLKDNP QAIEPWLAGV
TTKDGGDGPA AVKSALGL