Gene Smed_4663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4663 
Symbol 
ID5319338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1173682 
End bp1174821 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID640776461 
Productphosphonate metabolism protein PhnM 
Protein accessionYP_001313393 
Protein GI150376797 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00935004 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAAG AACAGGTTTT GACCAACGCC CGCATCGTCT TGGAGGACCG GATTGTCGAT 
GGCTCGGTGC TGATCCGCGA CGGCAAGATC GCCGACATTT CCGAAGGAGC CAGTGCTGCG
GGCGAAGATT TCGAGGGCGA CTACCTGCTG CCCGGCCTCG TCGAGCTGCA CACCGACCAT
CTCGAGGCGC ATTATTCTCC GCGTCCCGGC GTGCGCTGGC TGAAGATCGC GGCGATCCAG
GCCCATGATG CCCAGGTCGT CACCTCGGGG ATCACCACCG TCTTCGATTG CCTGCGCCTG
GGCTCGGACG AGGATAGCGG CTTCCCGAAG GGCGAGATGC GCAGCATGGC CGATGCGCTC
GCCCAGGCCA AGCAAGAGGG CCGACTGAGG GCCGATCACC TCATTCACCT GCGCTGCGAA
GTGTCGACCG CCGACGTGCT GGAGCATTAC GAGGACTTCC GGAGCGATCC GCAGGTCCGC
CTCGTTTCGC TGATGGATCA CGCACCCGGT CAACGTCAGT TCCAGACGAT GGACCAATAT
ACGCTCTACT ACAAAACCAA GCGCGGTCTC ACCGATGATG CCTTCGCTTC GTTCATCGAG
CGCCAGCAGG CATTGTCGGC GCGCTATGCG ACACCGCACC GAACCGCCCT TGCGAAAGCT
TGCGCCGAAC GCGGCATCAC GATTGCCAGC CACGACGACG CCACGATCGA GCATGTCGAC
GAGTCGATCG GCTATGGTAT TCGCCTTGCG GAGTTTCCGA CGAGCTTCGA GGCGGCGGAA
GCTTCCCATC GGGCAGGTCT CAGCGTGCTG ATGGGCGCAC CCAATATCGT GCGCGGCAAA
TCGCATTCCG GCAACATCGC CGCGCGTGAT CTCGCCGAGC GCGGCGTCCT CGACGTACTG
TCTTCGGATT ACGTGCCGTT CAGCCTCATT CATGCGCCTT TCATACTCGC GGATGAGGTC
GAATCGATCG GCCTGCCCGA AGCCATCGCA ATGGTGACGG CCACTCCGGC GCGCACGGTC
GGTCTCGATG ACCGCGGCCG GATCGCCGTT GGCCTCCGTG CCGATCTCGC ACGCGTCCGC
AGGCCGGAGG GCATACCGGT CGTCCGCTCC GTCTGGCGCG AAGGTCGGCG TGTCGCATGA
 
Protein sequence
MPKEQVLTNA RIVLEDRIVD GSVLIRDGKI ADISEGASAA GEDFEGDYLL PGLVELHTDH 
LEAHYSPRPG VRWLKIAAIQ AHDAQVVTSG ITTVFDCLRL GSDEDSGFPK GEMRSMADAL
AQAKQEGRLR ADHLIHLRCE VSTADVLEHY EDFRSDPQVR LVSLMDHAPG QRQFQTMDQY
TLYYKTKRGL TDDAFASFIE RQQALSARYA TPHRTALAKA CAERGITIAS HDDATIEHVD
ESIGYGIRLA EFPTSFEAAE ASHRAGLSVL MGAPNIVRGK SHSGNIAARD LAERGVLDVL
SSDYVPFSLI HAPFILADEV ESIGLPEAIA MVTATPARTV GLDDRGRIAV GLRADLARVR
RPEGIPVVRS VWREGRRVA