Gene Smed_0684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0684 
Symbol 
ID5321521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp735437 
End bp736435 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content63% 
IMG OID640789621 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_001326375 
Protein GI150395908 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.35489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TCGCCGCGGT TGCGCTCGTC GTCGTCGCCA TCCTATGGCT TCTCAACACG 
TCGCTTCTGG CGACGCCGCC CTCCGGCGGC GCAAAGATCC TCGCTCATCG CGGCGTTCAT
CAGGTCTTCA ATTCGGAAGG TCTCGACAAT GAAACCTGCA CCGCCGAGCG CATCGAGGCG
CCGCGCCACA GCTATCTCGA AAACACCATT GCTTCCATGC GCGCCGCAAT GGAGAGCGGC
GCCGACGTCG TCGAGCTGGA TGTCCACCTG ACACCGGATC GCCAGTTCGC CGTGTTTCAC
GATTGGACAC TCGATTGCCG GACGAATGGG AGAGGCGTGA CCCAGGATAC GCCGATGTCC
GAACTGAAGA CGCTGGACAT CGGCTACGGT TATACGGCCG ATGGCGGCAA GTCCTTCCCC
TTCCGCGGCC AGGGAGCTGG CCAGATGCCG ACGCTGACAG AGGTCTTCAA GGCCCTGCCC
GAGGGCCGCT TTCTGATCAA TTTCAAGAGC GAGCGGCGGG AGGAAGGCGC GACCCTCGCC
GTTCTTTTGC GTTTCCACCC GGAGTGGCGC AAACAGGTGT TCGGCGTCTA TGGCGGCACT
GCCCCCACAC AGGAAACCCT GAGGCTGGTC CCAGGCATCA GGGGATACGA CCGACAATCC
ACACTCGCCT GCCTTGGCCG CTATGCCGCC TATGGCTGGA CGGGTATCGT ACCGGAAGCC
TGCCGCGACA CGCTGATCAT AGTGCCCGGC AATTATGCGC CATTCCTGTG GGGGTGGCCG
GACCGGTTCG CGGCCCGTAT GCAAGCCGCC GGCAGCGAAA TCATCCTGCT GGGGCCCTAT
AAAGGCGGCG ACTTCACCAC GGGCATCGAC AGCGCGGATG ATCTCGCTTT CGTTCCCGAA
GGTTTTTCCG GCTATGTCTG GACCAACAGG GCGGAAACGA TCGCACCTCT CTTCGGCAGG
CGATCCGGAG CCGGGAGCGA CCAGGCCAAC CGCCAGTGA
 
Protein sequence
MKKIAAVALV VVAILWLLNT SLLATPPSGG AKILAHRGVH QVFNSEGLDN ETCTAERIEA 
PRHSYLENTI ASMRAAMESG ADVVELDVHL TPDRQFAVFH DWTLDCRTNG RGVTQDTPMS
ELKTLDIGYG YTADGGKSFP FRGQGAGQMP TLTEVFKALP EGRFLINFKS ERREEGATLA
VLLRFHPEWR KQVFGVYGGT APTQETLRLV PGIRGYDRQS TLACLGRYAA YGWTGIVPEA
CRDTLIIVPG NYAPFLWGWP DRFAARMQAA GSEIILLGPY KGGDFTTGID SADDLAFVPE
GFSGYVWTNR AETIAPLFGR RSGAGSDQAN RQ