Gene Smed_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2164 
Symbol 
ID5323024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2233979 
End bp2235424 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID640791102 
Productextracellular solute-binding protein 
Protein accessionYP_001327832 
Protein GI150397365 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAGA AGGAGAAAGA CCTCATTGGC GCATTCCTGC GCGGAGAGGT GGACCGTCGC 
GGCCTGTTGA AGGGCCTTGG CGCGGCAGGC CTGACGGCGG GCACCGCCGG CACCCTGTTC
AACATGATGT CGACCCAGGC CCTTGCTGCC GACTTCGACT GGAAGGCACA TTCCGGCAAG
TCGCTGAAAC TGTTGCTGAA CAAGCATCCT TACGCGGATG CGATGATTGC CAATCTGCAG
GCGTTCAAGG ACCTTACCGG TATCGAAGTC ACCTATGACG TCTTCCCGGA GGACGTCTAT
TTCGACAAGG TGACGGCGGC GCTTTCTTCC GGCTCGTCGG AATACGACGC CTTCATGACC
GGCGCTTACA TGACCTGGAC CTACGGGCCG GCCGGCTGGA TCACCGACCT CAATGAATGG
ATCAAAGATC CGTCGAAGAC CAATCCGCAA TATGGCTGGG ACGACTTCCT GCCGGGCGTC
AAGGCATCCT GCGCCTGGAA CGGTCAGCCG GGCGGCGCGC TCGGTTCGGA AGATGCCAAG
CAGTGGTGCA TTCCGTGGGG CTACGAGCAG AACAATCTCT CCTATAACCA GGAAATGTTC
GAAAAGGCCG GCGCCAGCGT TCCGAAGAAC CTCGATGAAC TCGTCGCTAC GGCGGCAAAG
CTCAACAAGG ATGTCGGCGG CGGTGTCTAC GGCATCGGCG TGCGTGGTTC CCGTTCCTGG
GCAACCATTC ATCCGGGTTT CCTCTCCGGC TACGCCAATT TCGGCCAGAA GGATCTGAAC
GTCTCGGAGG ACGGCAAGCT TTCGGCCGCG ATGAACACGG CGGAGTCCAA GTCCTTCCAC
GCCAAATGGG TGCAGATGAT CCAGGAAAGC GGCCCCAAGG ACTGGTCGAC CTATACCTGG
TATCAGGTCG GCACCGACCT CGGCGCCGGC GCTTCCGCCA TGATCTACGA CGCCGACATC
CTCGGCTATT TCATGAATGG CGGCGACAAC AAGATGGCCG GCAAGCTCGC TTACGCGCCC
TTTGCCGCCA ACCCTGAGGC GAAGGCTCCT ACGCCGAACA TCTGGATCTG GTCGCTGGCC
ATGTCCAATT TCGCGAAGGA TAAGGATGCG ACCTGGTATT TCCTGCAATG GGCATCGGGT
CTCGAGCACG CGATCTTCGG CGCAACCAAG ATGGACTTCG TCAACCCGGT CCGGGCATCC
GTCTGGAAGG ACGAGATCTT CCGGGAGCGG CTGAACAAGA GCTATCCCGG TTATGTGGAG
ATGCACGACG TTTCGGCGCC GGGCGCGAAG ATCCACTTCA CCGCCCAGCC TCTCTTCTTC
GATCTCACCA CCGAATGGGC GGCGACGCTG CAGAAGATGG TGGCGAAGGA AGTGCCGGTC
GACGAAGGTC TCGACAGGCT TGCCGAGAGC ATCAACCGGC AACTTGCGGA AGCCGGGCTC
GGCTGA
 
Protein sequence
MYEKEKDLIG AFLRGEVDRR GLLKGLGAAG LTAGTAGTLF NMMSTQALAA DFDWKAHSGK 
SLKLLLNKHP YADAMIANLQ AFKDLTGIEV TYDVFPEDVY FDKVTAALSS GSSEYDAFMT
GAYMTWTYGP AGWITDLNEW IKDPSKTNPQ YGWDDFLPGV KASCAWNGQP GGALGSEDAK
QWCIPWGYEQ NNLSYNQEMF EKAGASVPKN LDELVATAAK LNKDVGGGVY GIGVRGSRSW
ATIHPGFLSG YANFGQKDLN VSEDGKLSAA MNTAESKSFH AKWVQMIQES GPKDWSTYTW
YQVGTDLGAG ASAMIYDADI LGYFMNGGDN KMAGKLAYAP FAANPEAKAP TPNIWIWSLA
MSNFAKDKDA TWYFLQWASG LEHAIFGATK MDFVNPVRAS VWKDEIFRER LNKSYPGYVE
MHDVSAPGAK IHFTAQPLFF DLTTEWAATL QKMVAKEVPV DEGLDRLAES INRQLAEAGL
G