Gene Smed_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3345 
Symbol 
ID5324229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3545188 
End bp3546408 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID640792296 
Productphosphopentomutase 
Protein accessionYP_001329001 
Protein GI150398534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG CCTTCCTTTT CGTCCTCGAT TCCTTCGGCA TCGGCAATGC GCCGGACGCG 
GGGGCTTTCG GCGATCTCGG CGCCGATACG CTCGGACACA TAGCGGAATT CTGCGCGGCC
GGGGCCGCCG ACCGGGCGGG CCTCAGGGAA GGGCCGCTCA ATCTGCCAAA CATGTCAGCG
CTGGGCCTCA TGCATGCGGC GCGGCTGGCG ACCGGTCGGC TGCCCGCCGG CATGGCGCTG
CCGGAGCGCG TTTACGGCGT CTACGGCGCC GCCAGTGAAG TCTCGCGCGG CAAGGACACG
CCGTCCGGCC ATTGGGAAAT CGCCGGCACG CCGGTGACCT TCGACTGGGG CTATTTCCCC
GCGGACGGCG ACGCCTTCCC TCCCGAACTC GTCGAGGCGA TCTGTCGCGA GGGCGACGTG
CCCGGCATTC TCGGCAATTG CCATGCCTCG GGCACCGATA TCATCGCGCG TCTCGGCGAA
GAGCATATGC GGACCGGCAA GCCGATCTGC TATACCTCGT CGGACTCGGT CTTTCAGATT
GCCGCACACG AACAGACCTT CGGTCTGGAG CGTCTGCAGG ACCTTTGCGC GGTCGTCCGC
CGGCTCGTCG ACGAGTACAA TATCGGCCGT GTGATCGCTC GCCCGTTCGT CGGCAGCGAT
CCGGGCAGCT TCACGCGCAC CGGCAATCGG CGGGATTATT CGGTGCTGCC GCCGGCACCG
ACCGTTCTCG ACCGACTGAA GGAGGCCGGG CGAACAGTGC ACGCAATCGG CAAGATCGCC
GACATCTTCG CGCATCAGGG TGTAACCAGG CTTACCAAGG CCAACGGCAA CATGGCTCTG
TTCGACGCAA GCCTGGCGGC GATCGACGAG GCCGAGGACG GCGCGCTCAT CTTCACCAAT
TTCGTCGATT TCGATATGCT CTACGGTCAT CGCCGCGACG TGGCCGGCTA TGCCGCAGCG
CTCGAAGCCT TCGATGCACG CCTTCCCGAT CTCGACCGCC GCCTGAAGCC CGGCGACATG
GTCATCCTGA CTGCCGACCA TGGCTGCGAC CCGACCTGGC GCGGCACCGA CCACACCCGC
GAGCGCGTGC CCGTTCTGAT GTTCGGACCG ACGCTTCGGA GCCGCTCCGT CGGTATTGTC
GGGAGCTTCG CACATATCGG TGAAACCGTT GCAAGTCATC TCGGAATTGA CCCCGGCCCG
CATGGGAGGA GCCTCATTTG A
 
Protein sequence
MARAFLFVLD SFGIGNAPDA GAFGDLGADT LGHIAEFCAA GAADRAGLRE GPLNLPNMSA 
LGLMHAARLA TGRLPAGMAL PERVYGVYGA ASEVSRGKDT PSGHWEIAGT PVTFDWGYFP
ADGDAFPPEL VEAICREGDV PGILGNCHAS GTDIIARLGE EHMRTGKPIC YTSSDSVFQI
AAHEQTFGLE RLQDLCAVVR RLVDEYNIGR VIARPFVGSD PGSFTRTGNR RDYSVLPPAP
TVLDRLKEAG RTVHAIGKIA DIFAHQGVTR LTKANGNMAL FDASLAAIDE AEDGALIFTN
FVDFDMLYGH RRDVAGYAAA LEAFDARLPD LDRRLKPGDM VILTADHGCD PTWRGTDHTR
ERVPVLMFGP TLRSRSVGIV GSFAHIGETV ASHLGIDPGP HGRSLI