Gene Smed_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3966 
Symbol 
ID5319064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp416464 
End bp418182 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content63% 
IMG OID640775775 
Productmetallophosphoesterase 
Protein accessionYP_001312708 
Protein GI150376112 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAC GTGCGCTGCC GGCAGTCGCC GTGATCGCCG ACGCGCATTT CCATGACCTC 
GAAGCAGATT TCGGCTTCGA CCGGGTGGAG GTGGAGGGCC GGAAAATCAC GATGCGCAGC
TGGGCAGAGA CCCGGAAATC GACCCGCGTC TTCAACGAAA GCGCGGACGC GTTCCTGGCA
GCGCTCGCGG AAGTGCGCCG GCGGGGCATT CGCCACGTCG TCCTGCTCGG GGATTACACG
GATGATGGCC AGCGCGCCAC CACCGGCGCG CTCCGGAACA TTCTCGACGA GCATTCCGCC
TTCGGCATGT CGTTCTATGC GCTCCCCGGC AATCACGACA TCTTCGGTCC GCAGGGCAGG
CATCACACCA AGCAATTCCT CGACTGCGCA GGTCGGGGCA TTCTCGTCAC CAGCGACGTG
AAGCGCGCCG GCAGCGGCGT TACGGTCAGT GATCGCATGT ATTGTGAAGG GTACCCGGCC
GGCCTCGATC CGATGGCTGG CTTCGGCTAC TTCCGCAAGC CGGAATATCT CCATTGGGAG
ACCCCGTTCG GTATGTCCGA TGCCGCGGAG GACCGCGAAT ACGAGGTTGC ATCGCCGGAC
GGCAGGAACC GTTACAACAT GATGGACGCG TCCTACCTGG TCGAGCCCGA GCCCGGTCTA
TGGCTGCTGA TGATCGACGC CAATGTCTTC GAGCCCGTGA ACGGGGTCTA TGAATGGGGC
GACGAGGCGG CCTTCATCGA CAGCACCTCG GGCGGATGGA ACGCGATGCT CCGCTGTAAG
CCTTTCGTCA TCCCGTGGAT TGCCGATGTC TGCGCACGGG CAGAAAGGCT CGGAAAAACA
CTGCTCGCCT TTTCGCACTA TCCGGTGCTC GACTCTTTCG ACGGTGCGAC CGGCGCCGAG
GGGGCGCTTT TCGGCGAGAC CAACATTGCC CGGCGCACAC CGCGAAGGGC AGTCGAACGC
GCGCTGCTGG CGGCCGGACT GTCGCTCCAT TTCAGCGGGC ATCTTCATGT GGAAGGCGTC
ACGCGCCGCG GCAGCGGCGA CCGATCACTG ACGAATGTCG CGGTCCCATC GCTCGTCGCC
TTTCCGCCGG CCTTCAAAAT CGCCCATCCG GGAGAGGGGA ACGTTGCAGT CGAGACGGTG
GAATTGTCGG GATTACCGGT CCATCCGCGG CTCCGGAACG CCTATGAGCG AGAGGCCGCC
CTGCTTGGTG AAGAACCGGA CGACGCCTTC TCAGCCCCCA GCTACGGAGC ATTTCTCAGG
GCGCACAAGC GGGCGCTGGT CAGCCACCGA TATTTCCCGG AGGAGTGGCC CCCGGCAATA
GTCGAGCGGG TGGCGGACCT CACACTGGAA GAGATCGCGT GTCTCTTCAC TGGCGAGTCC
GCCGGCAGCG CGCCAAAGCT CTCAGCTCTG GGGCAGGCTT CGGCAATCGA CATTGCCGAG
CTCGGACGCC TGCCAATGAT AGAACTCGTC GCCGACTGGT ATTGTCTCCG GCAGGGGGCA
TCGCTCGCTT TGGCGCATAT CGAAGAGTCG CGGCTGCCTC TCTATCGGTT TCTCGCCGAC
CGGTTCGGAT GCGAGCCACA GACCTGCCAC GACAGTCCGG AGAAAGGCTT CGTCGCCATA
TTCCTGGGAG CATTGGGTCT CTTTCTCGAA CGCGCGGGCA ATAGTCAGGC GCACATCGTC
GTCCAGTCAA ATCCCGCCCG GCAAGACGCT TCGGCGTGA
 
Protein sequence
MLKRALPAVA VIADAHFHDL EADFGFDRVE VEGRKITMRS WAETRKSTRV FNESADAFLA 
ALAEVRRRGI RHVVLLGDYT DDGQRATTGA LRNILDEHSA FGMSFYALPG NHDIFGPQGR
HHTKQFLDCA GRGILVTSDV KRAGSGVTVS DRMYCEGYPA GLDPMAGFGY FRKPEYLHWE
TPFGMSDAAE DREYEVASPD GRNRYNMMDA SYLVEPEPGL WLLMIDANVF EPVNGVYEWG
DEAAFIDSTS GGWNAMLRCK PFVIPWIADV CARAERLGKT LLAFSHYPVL DSFDGATGAE
GALFGETNIA RRTPRRAVER ALLAAGLSLH FSGHLHVEGV TRRGSGDRSL TNVAVPSLVA
FPPAFKIAHP GEGNVAVETV ELSGLPVHPR LRNAYEREAA LLGEEPDDAF SAPSYGAFLR
AHKRALVSHR YFPEEWPPAI VERVADLTLE EIACLFTGES AGSAPKLSAL GQASAIDIAE
LGRLPMIELV ADWYCLRQGA SLALAHIEES RLPLYRFLAD RFGCEPQTCH DSPEKGFVAI
FLGALGLFLE RAGNSQAHIV VQSNPARQDA SA