Gene Smed_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2064 
Symbol 
ID5322923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2114764 
End bp2116113 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content61% 
IMG OID640791001 
Productextracellular solute-binding protein 
Protein accessionYP_001327732 
Protein GI150397265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0548204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGC CAATAATTTC TTCGCCCAGA CCGCACTTGA GCAGACGGCA GGTCTTGCAG 
GGCGTGGCGG CCGCGGCCGG TGCCGGATTG GCGGGATTTC CCGGACAGCT CGGCGCAGCA
AGTCAGGTCA AGGAACTGAT CGTTCTAACC GGAACCACCC CATGGCTCCC GGCTTACCAG
AAGGCCGCAG CAGCCTACGA GGCGGAGAAG GGCATCAAGA TCACGTTCCG CGCATTTCCC
TATGGCGGAA TGCGTACCCA GATGACCAAC GCAATTCAGA GCAAGAACGC GGCCTTCGAC
GTCTTCCAGC TCGACGAACC CTGGACCGGC CAGTTCTACG ACAACGGATG GGTCAAACCG
CTCGAGGAGA TTATCGAGGG CTACAAGCTC GACCCGAACA TTCTGACCTA TGACAGCCTT
CCCCTCTGGG ACAAACAGCA GAGGCGCGGC AAAGCCGGCG GGAAGATCAT GGGCCTGCCG
ATAAACGGCA ACGTCGATCT ATTCGTCTAC CGAAAGGACA TCTATGAGAA GCTGGGTCTG
ACGGTTCCGA AGACCTGGGA TGAGGCGATC GAAAACGGCA AGAAGGCCGT CGAAGCAGGC
GAAGTCCGAT ATGGTTATGT CACCCGTGGC CAACCCACGG CCGGCGGGCA ATCGGTCAGC
TTCGAGTTCA TGCACGTGCT TTACGGCTTC GGCGGCGATT GGTTCAAGGC CGACGGCGCT
ACACTCGTCC CGACAATCAA CAATGACGCT GCAAAGACCG CGGCAGCGAC TTTCCGCCGG
CTTCTTGAGC TTGGGCCCTC ACGGTCTCAA ACGGTCGGTC AGGCGGACTG CATCGCCCTG
ATGCAGAGCG GCCAGGCTCT GCAGGGTCAC TTCGTTGCTG CCGGCATGCC CCAGCTCGAA
GACGAGACCC GTTCATCGGT CGTCGGCAAA TGCGGCTACA CCATCGTTCT GGCAGGATCA
CTCGACCATC CCGTTCCGGC AAGTGGTGTC TGGTCGCTTT GCGTCCCGGC GGATCAGGCG
CCCGAACGGC AGCTCGCGGC AGCCGAGTTC ATTATGTGGA TGCTGGACAA GAAGCAGCAG
GAAGCCTTTG CAGGCGCTGG CGGGATGCCG ACCCGCAAAG ACGTCGACGT GTCCGGAGCA
GGCGCATTGC GACCGATCAT GGAAGCCGCC AGGGATTCGG CGGCCCTCAC GCAGGGCGCC
ATCCGCTATG TCTTCGCAGC CCAGATGCTT GAAGCCGTCG AACCGGTCAT CGGTCAGATC
GGCTCCGGTG ACCTGGCCGT CGACGAAGGA CTGGACGAAC TGCAAGCGAA ACTCGCGGAG
ATCGCCAAGG CGAGCGGCTT CGCGAAATAG
 
Protein sequence
MIKPIISSPR PHLSRRQVLQ GVAAAAGAGL AGFPGQLGAA SQVKELIVLT GTTPWLPAYQ 
KAAAAYEAEK GIKITFRAFP YGGMRTQMTN AIQSKNAAFD VFQLDEPWTG QFYDNGWVKP
LEEIIEGYKL DPNILTYDSL PLWDKQQRRG KAGGKIMGLP INGNVDLFVY RKDIYEKLGL
TVPKTWDEAI ENGKKAVEAG EVRYGYVTRG QPTAGGQSVS FEFMHVLYGF GGDWFKADGA
TLVPTINNDA AKTAAATFRR LLELGPSRSQ TVGQADCIAL MQSGQALQGH FVAAGMPQLE
DETRSSVVGK CGYTIVLAGS LDHPVPASGV WSLCVPADQA PERQLAAAEF IMWMLDKKQQ
EAFAGAGGMP TRKDVDVSGA GALRPIMEAA RDSAALTQGA IRYVFAAQML EAVEPVIGQI
GSGDLAVDEG LDELQAKLAE IAKASGFAK