Gene Smed_2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2456 
Symbol 
ID5323317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2538036 
End bp2539391 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content61% 
IMG OID640791394 
Productextracellular solute-binding protein 
Protein accessionYP_001328123 
Protein GI150397656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.674954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TGCTGAAAAA GGGCTTGCTC GCCGCGGCGC TGATGGCCTC GACCGCGCCT 
GCGGCCATGG CGCAGGAATG CGCGGATGCG GATCGCGTGC CGATCACCTG GTCGACGATT
GCCGGCTTCT ACACCGATGC GATGGCAGAG CTGGTATCGG GCTTCGAGGC AGGCCATTGC
GTGAAGGTCA ATGTCGTCAA CATAGACAAC TCACAGCTCT ACAACAAGCA GGTCATCGAG
ATGGTCGGCC AGACCGGTGC CTATGATGTC GTGACCCTCG AAACCTCGGA GAAGGCGGAA
TTCGCAGAGA ACGGTTTCAT CCTGCCGATG ACCGAATATT TCGCGGACAA GAAGGCGCAG
CTCGACGACG TCGCGCCGAC GCTCGCTGCT CTGACGACGC AGTACAAGGA TGATGTCTGG
GGTCTGCCTT ATTACACCTA TACGGCCGGA TATATCTATC GTGCCGACCT GTTCGACGAT
CCCACGGAAA AGGAGGCCTT CAAGAAGCGC TTCAACTACG ACCTGGCCGT GCCGACCACC
TGGGCGCAGC ACCGCGATAT CGCCGAGTTC TTCACGCGTA AGGCCGGTGA GACGCTGAAG
GGCGAAAAGC TCACCAAGGA CTTTTACGGC GTCGGCCTGA TGGCCGGTCC TTTCCCGGAA
ATCCAGGACG AGATGTCCGG CGTGCTCTGG TCGCAAGGCG CGGACTGGCT CACCGACGAG
GGCAAGGTGC CGGTCGATGC CGTCGAAAAG GCGATGAACG ACTATCTGGA ACTGGTCAAG
TATGCGCCGC CCGCCGCGCT CACCGTTACC TATGACGGCG TCATGAACCA GATGAAGGAC
GGGCAGATCG CCCAGACCTA TTCCTTCTTC CTCGACCAAT GGCCGAATGC AGTGCAGACC
GAAACCAGCG TGGCCGGCGC CAAGATGGGC GTCGCCGAGG CGCCGGAGAA GAAGGCCTAT
ATCGGCGGCT TCCTGCTGGC GGTTTCCGCA TCCTCCGCCC ACCCGAAGGA GGCGATGGAC
TTCGTCGCCC ACATCGGCGG ACATGATGCG CAGATGGAGT TCGCCAAGGC CGGCGGTACC
TCGACGCTGA TGAGCGTTCT TTCCGATCCG GCCTTTGCCG CGCCGGAAAG CCGCGGGAAG
ACCGGCCACT TTTCAACGCT CCTGGAGATC TTCGACTCGA TGAAGGGTTT CCGCTCGAAC
CTGTTCGATA CGCCGTTCGG CGCGAAGATC TACAACACGA TGCAGATCCC GCTGCAATCG
GCCGCTGCCG GTCAGATTTC GGCGCGCCAG GCCGCCGAAC GGCTTGCCGT CGAAGTCGAG
AAGATTTGCG GCGGCCCGTG CCCGATCGGC AAGTGA
 
Protein sequence
MKSMLKKGLL AAALMASTAP AAMAQECADA DRVPITWSTI AGFYTDAMAE LVSGFEAGHC 
VKVNVVNIDN SQLYNKQVIE MVGQTGAYDV VTLETSEKAE FAENGFILPM TEYFADKKAQ
LDDVAPTLAA LTTQYKDDVW GLPYYTYTAG YIYRADLFDD PTEKEAFKKR FNYDLAVPTT
WAQHRDIAEF FTRKAGETLK GEKLTKDFYG VGLMAGPFPE IQDEMSGVLW SQGADWLTDE
GKVPVDAVEK AMNDYLELVK YAPPAALTVT YDGVMNQMKD GQIAQTYSFF LDQWPNAVQT
ETSVAGAKMG VAEAPEKKAY IGGFLLAVSA SSAHPKEAMD FVAHIGGHDA QMEFAKAGGT
STLMSVLSDP AFAAPESRGK TGHFSTLLEI FDSMKGFRSN LFDTPFGAKI YNTMQIPLQS
AAAGQISARQ AAERLAVEVE KICGGPCPIG K