Gene Smed_3713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3713 
Symbol 
ID5318526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp155064 
End bp156074 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content57% 
IMG OID640775526 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001312459 
Protein GI150375863 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCC TTTCTGGTCT TTTGACGTGT GTTTCGCAGT GCATACAGAA CCTAAGAAGA 
GGGAACATGA CAATGCTGAA ACAGATCACT GCAATGACCA CCGGACTGCT CATGACGATC
GGCGTCGCCA ATGCCGCTGA GCCGGAGAGC TGCAAGACCG TCCGCTTCTC CGATGTCGGC
TGGACCGACA TCACGTCTAC GACGGCTGTC GCCTCGACCA TTCTCGAAGC GCTCGGCTAT
GCGCCGGAAA GCAAGCTGCT ATCGATCCCC GTCACCTATG CGAGCATGAA GAACAAGGAC
ATCGACGTCT ATCTCGGCGA TTGGCAGCCG AGCATGGAAG CCGACCGCAA GTCTTTCCTA
GAGGACAAGT CGATCGAAGT GATCGGCCCG AACCTCACCG GCGCCAAATA TACATTCGCC
GTGCCGAAAT ACGTCGCGGA CGCAGGCGTC AAGGACATTT CCGATCTCCA GAAGTTCGCC
GACAAGTTCG GCCGGAAGAT CTACGGGATC GAACCCGGCA ACAACGGCAA CCGCATGATC
CTCGACATGA TCAATAAGGG TGACTTCGGG CTGACCGGCT GGGAGCTCGT CGAGTCATCC
GAGCAGGGCA TGCTTGCCGA GGTCGAAAGA GCAACAAAGG ACGAGCAGTG GATCGTCTTT
CTCGGATGGG CTCCGCATCC GATGAATACG CGCTACCGGA TCGATTATCT CTCGGGCGCG
GACGCCTATT TCGGCCCGAA TTACGGGGGC GCCGATATCT ATACCAACAT TCGGGCCGGC
TATGCGCAGG AATGTCCGAA TGTCGCGAGA TTTGTGACGA ATCTTCGCTT CACCCTGGAC
ATGGAAAACG AGATCATGAA CGGGATATTG AACGACGGAA AGGACCCGAA GGAAGCCGCA
GCCGATTGGT TGAAGGCTCA TCCGGATGCA GTCGCTCCGT GGCTTGAAGG TGTGACGACA
TACGATGGCG GGCCGGCAGC GGGCGCGGTC GAGATGGCGC TGAAAAGCTA G
 
Protein sequence
MSRLSGLLTC VSQCIQNLRR GNMTMLKQIT AMTTGLLMTI GVANAAEPES CKTVRFSDVG 
WTDITSTTAV ASTILEALGY APESKLLSIP VTYASMKNKD IDVYLGDWQP SMEADRKSFL
EDKSIEVIGP NLTGAKYTFA VPKYVADAGV KDISDLQKFA DKFGRKIYGI EPGNNGNRMI
LDMINKGDFG LTGWELVESS EQGMLAEVER ATKDEQWIVF LGWAPHPMNT RYRIDYLSGA
DAYFGPNYGG ADIYTNIRAG YAQECPNVAR FVTNLRFTLD MENEIMNGIL NDGKDPKEAA
ADWLKAHPDA VAPWLEGVTT YDGGPAAGAV EMALKS