Gene Smed_4991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4991 
Symbol 
ID5318712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1504933 
End bp1506063 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID640776773 
Producthypothetical protein 
Protein accessionYP_001313705 
Protein GI150377109 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.723293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00320461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACCTCG TCCTCAAGGG AATGACCTGG AACCACCCGC GCGGCTACGA CCCGATGGTG 
GCCTGCTCGC GGGCCTGGCA GGAGACGTCG GGCGTCGAGA TCCAATGGGA GAAGCGGTCG
CTTCAGGACT TCGAGACGTT TCCGGTGGAA GTTCTTGCAA GGGATTACGA CCTGATCGTC
ATAGATCATC CGCATGTCGG CCAGATCACC AGCGAAAACT GCCTGCTTCC GCTGGACGTG
CCCGGGCGTG AGGCCGATCG GCAGGCTCTC TCGGCGGCAA GTGTCGGCCC CTCCTATCGC
AGCTACGAAT GGAACGCCCG CCAATGGGCC TTTCCGATCG ATGCCGCGAC CCAGGTGCAG
GCCTGGCGGC CGGATCGGAC CGAGCGCCTC CGGACCTGGC GGGAGGTGCT GGACCTGGCG
CGTTCAGGCG GCGTCGTTCT GCCGCTTCGC CCGCCCCATT CGCTGATGAG CTTCTTCACC
CTCTGCGGCA ATCTGGGGCG GCCCTGCCGC AGCAACGGGC AGGGCGAGCT CGTCGATGCC
GAAACGGGTG CCGCTGCAAT CGAGCTGCTG AAGGAGATCG CAGCCCTGGT CGACCCGGAC
TGCTTCGACA TGGATCCGAT CGCAGCCTTC GAGGCGATGG CGGAAAAGGG GTCCGCCTTT
GCCTGCGCGC CGCTCATCTA CGGCTATGTC AGCTACTCGA TGGCGGGCTT TCGGCCGGCG
CTCATCCGCT TCGGCGATAT TCCTGAAATC GGCGCGGCAG GACCGGTCGG CTCGGCTCTC
GGCGGGACGG GCATTGCGGT ATCCGCCTTT TCGAAGGCGC CGGAGCAGGC GATCGATTTT
GCCTATTGGG TGGCGAGCGG CGACGTGCAG CGGGGTATCT ACGCCGCCTG CGGCGGCCAG
CCCGGCCATG GCGCCGCCTG GCAGGACGAG ACGGTCAATG CGGCGACGCA TGATTTCTAC
CGCGCGACCC GGGCGACGCT CGAGGCTGCC TGGCTCCGGC CGCGCCATGA CGGCTATATG
GCATTCCAGC AGGCGGGTTC GGATCGCCTG AACGAAGGGC TCAAGCGCGG CGAGAGGCCA
AGCCTCGTGG CCGAAGAACT CAATCGGCTG TTCTGCGAGA GTTTTCGCTG A
 
Protein sequence
MNLVLKGMTW NHPRGYDPMV ACSRAWQETS GVEIQWEKRS LQDFETFPVE VLARDYDLIV 
IDHPHVGQIT SENCLLPLDV PGREADRQAL SAASVGPSYR SYEWNARQWA FPIDAATQVQ
AWRPDRTERL RTWREVLDLA RSGGVVLPLR PPHSLMSFFT LCGNLGRPCR SNGQGELVDA
ETGAAAIELL KEIAALVDPD CFDMDPIAAF EAMAEKGSAF ACAPLIYGYV SYSMAGFRPA
LIRFGDIPEI GAAGPVGSAL GGTGIAVSAF SKAPEQAIDF AYWVASGDVQ RGIYAACGGQ
PGHGAAWQDE TVNAATHDFY RATRATLEAA WLRPRHDGYM AFQQAGSDRL NEGLKRGERP
SLVAEELNRL FCESFR