Gene Smed_4631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4631 
Symbol 
ID5319266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1139418 
End bp1140422 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content64% 
IMG OID640776429 
Productdeoxyribose-phosphate aldolase 
Protein accessionYP_001313361 
Protein GI150376765 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0274] Deoxyribose-phosphate aldolase 
TIGRFAM ID[TIGR00126] deoxyribose-phosphate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.698385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.34325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAAGACC TTGCTCTGAA AAGCACGGCG GTGCACCCCG CCGGCGCCGG CGATAACTGG 
CCGCGCAACG ACGGTGTCGA ACTCGACCTT TCCTGGGTCC TGGATCAGCG CGTCAATCTT
TCCGCGGCCG AACGCCGCGT TCAGACGCTG CCGGGCCGGC GCACCGTCAA GAAAGATGCG
CAGGCCGCCT GGCTCTTGAA AGCCGTCACA TGCATCGATC TGACGACGCT CAGTGGCGAC
GACACGACCG AACGCGTCAA GCGCCTGTGC GCCAAGGCGC GTCAGCCGGT GCGCCAGGAC
ATTCTCGACG CACTTGGTAT GGGCGATCGC GGCATCACGA CCGGGGCGAT CTGCGTCTAT
CACCGCTTCG TTTCGACTGC GGTGGATGCC CTGGAGGGCT CGGGCATTCC GGTCGCGGCA
GTGTCCACAG GCTTTCCGGC CGGCCTCGTG CCGCATGACG TCAAGCTCAG GGAAATCGAA
GCTTCGGTTG CCGACGGTGC CCGGGAGATC GACATCGTCA TCACGCGCGA GCATGTTCTG
ACCGGCAACT GGCAGGCGCT CTACGACGAA ATGCGGGATT TTCGTGCGGC CTGCGGCGAC
GCCCATGTCA AGGCCATCCT CGCGACCGGC GACCTGAAGA CGCTCCGCAA CGTCGCGCGC
GCTTCGCTTG TCTGTATGAT GGCGGGGGCC GACTTTATCA AGACATCGAC GGGAAAGGAG
GGGGTGAACG CGACGCTGCT CGTCACCCTC GCCATGCTCA GGATGATCCG GGCCTACGAG
GAGCGGACAG GATTGAAGGT CGGTTACAAG CCGGCCGGAG GAATCTCCGC CGCCAAGGAC
GTGTTGAACT ACCAGTTCCT CATGAGAGAG GAACTCGGGC GAGATTGGCT CGAGCCCGAT
CTTTTCCGCG TCGGGGCATC GAGCCTGCTC GGCGATATCG AGCGCCAGCT CGAGCATCAC
GTGAGCGGTG CCTATTCGGC CCTGAACAGA CACCCGATAG GATGA
 
Protein sequence
MEDLALKSTA VHPAGAGDNW PRNDGVELDL SWVLDQRVNL SAAERRVQTL PGRRTVKKDA 
QAAWLLKAVT CIDLTTLSGD DTTERVKRLC AKARQPVRQD ILDALGMGDR GITTGAICVY
HRFVSTAVDA LEGSGIPVAA VSTGFPAGLV PHDVKLREIE ASVADGAREI DIVITREHVL
TGNWQALYDE MRDFRAACGD AHVKAILATG DLKTLRNVAR ASLVCMMAGA DFIKTSTGKE
GVNATLLVTL AMLRMIRAYE ERTGLKVGYK PAGGISAAKD VLNYQFLMRE ELGRDWLEPD
LFRVGASSLL GDIERQLEHH VSGAYSALNR HPIG