Gene Smed_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1728 
Symbol 
ID5322586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1808944 
End bp1810179 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content62% 
IMG OID640790666 
Productextracellular solute-binding protein 
Protein accessionYP_001327398 
Protein GI150396931 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00458236 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.504509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAC GTTTTCTAGC CGCCGCTTTA GGCGCAACCG CTGCCTTGCC CTTCGGTGCG 
GCCAACGCGA CCGATCTCGA GGTCACGCAT TGGTGGACTT CCGGCGGTGA GGCGGCCGCG
GTTGCCGAGC TCGCGAAAGC TTTCGATGCA ACCGGCAACA AGTGGGTCGA CGGCGCGATC
GCCGGCTCCG GCGGAACGGC ACGCCCGATC ATGATCAGCC GCATTACCGG GGGCGATCCG
ATGGGCGCAA CCCAGTTCAA CCACGGCCGG CAGGCGGAAG AGCTGGTGCA GGCGGGCCTG
ATGCGCGACC TTACCGACAT CGCCACGCAG GAAAACTGGA AGGAGATCGT GAAGCCGTCG
AGCCTGCTTG ACTCCTGCAC GATCGAAGGG AAGATCTACT GCGCGCCTGT CAATATCCAT
TCCTGGCAGT GGCTGTGGCT CTCTAACGCC GCGTTCAAGA AGGCGGGCGT TGAAGTCCCG
AAAAACTGGG ACGAGTTCGT GGCCGCGGCT CCAGCGCTGG AGAAGGCCGG AATCGTTCCG
CTCGCCGTCG GCGGACAGCC GTGGCAGGCA AACGGTGCCT TCGACGTGCT GATGGTTGCG
ATCGCAGGCA AGGACACCTT TGAAAAGGTC TTTGCCGAGA AGGACGCCGA AGTGGCAGCC
GGACCGGAAA TTGCCAAGGT CTTCAAGGCA GCCGACGATG CCCGTCGCAT GTCGAAGGGC
ACCAACGTTC AGGACTGGAA CCAGGCGACG AATATGGTCA TCACCGGCAA GGCCGGTGGG
CAGATCATGG GCGACTGGGC CCAGGGCGAG TTCCAGCTCG CGGGACAGAA AGCCGGCGTC
GACTACACCT GCCTGCCGGG TCTCGGCGTG AACGAGGTGA TTTCGACGGG TGGCGATGCG
TTCTACTTCC CGCTTATCGA AGATGAGGAA AAGTCGAAGG CGCAGGGAGT GCTGGCATCG
ACCTTGCTTA AGCCGGAAAC GCAGGTGGCC TTCAACCTGA AGAAGGGCTC GCTGCCGGTG
CGCGGCGATG TCGATCTTGC GGCCGCCAAC GACTGCATGA AGAAGGGTCT CGATATCCTG
GCCAAGGGCA ATGTGATCCA GGGCACGGAT CAGCTTCTGT CGGCCGATAG CCAGAAGCAG
AAAGAGGACC TCTTCTCGGA GTTCTTCGCG AATCACTCAA TGACGCCGGA AGACGCGCAG
AAGCGTTTCG CCGACATCAT CGCGTCCGCG GATTGA
 
Protein sequence
MKLRFLAAAL GATAALPFGA ANATDLEVTH WWTSGGEAAA VAELAKAFDA TGNKWVDGAI 
AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDIATQ ENWKEIVKPS
SLLDSCTIEG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIVP
LAVGGQPWQA NGAFDVLMVA IAGKDTFEKV FAEKDAEVAA GPEIAKVFKA ADDARRMSKG
TNVQDWNQAT NMVITGKAGG QIMGDWAQGE FQLAGQKAGV DYTCLPGLGV NEVISTGGDA
FYFPLIEDEE KSKAQGVLAS TLLKPETQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLDIL
AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NHSMTPEDAQ KRFADIIASA D