Gene EcSMS35_4561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4561 
SymbolphnM 
ID6144637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4661579 
End bp4662715 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content61% 
IMG OID641619377 
Productphosphonate metabolism protein PhnM 
Protein accessionYP_001746489 
Protein GI170682245 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATCA ATAACGTTAA GCTGGTGCTG GAAAACGAGG TGGTGCACGG TTCGCTGGAG 
GTGCAGGATG GCGAAATCCG CGCCTTTGCC GAAAGCCAGA GCCGCCTGCC AGAAGCGATG
GATGGCGAAG GCGGCTGGCT GCTGCCGGGG CTGATTGAGC TGCATACCGA TAATCTGGAT
AAATTCTTCA CCCCGCGCCC GAAGGTTGAC TGGCCCGCCC ACTCGGCGAT GAGTAGCCAC
GACGCGCTGA TGGTGGCGAG CGGCATCACC ACCGTGCTGG ACGCGGTGGC GATTGGCGAC
GTGCGCGACG GCGGCGATCG GCTGGAGAAT CTGGAGAAGA TGATCAACGC CATCGAAGAG
ACGCAGAAAC GCGGCGTCAA CCGCGCCGAG CACCGTCTGC ATCTGCGCTG CGAACTGCCG
CATCACACCA CGCTGCCGCT GTTTGAAAAA CTGGTGCAGC GCGAACCGGT GACGCTGGTG
TCGCTGATGG ATCACTCGCC GGGCCAGCGC CAGTTCGCCA ATCGCGAGAA GTATCGCGAA
TATTATCAGG GCAAATATTC CCTCACCGAT GCGCAGATGC AGCAGTACGA AGAGGAACAG
CTGGCGCTTG CCGCACGCTG GTCACAGCCT AATCGCGAAA CTATCGCCGC AATGTGCCGC
GCGCGACACA TTGCCCTCGC CAGCCACGAT GACGCCACGC ACGCCCACGT TGCCGAATCC
CACCAGCTTG GCAGCGTGAT CGCCGAATTT CCCACCACGT TCGAAGCGGC AGAAGCCTCG
CGCAAGCATG GCATGAACGT GCTGATGGGC GCGCCGAATA TCGTGCGCGG CGGTTCGCAC
TCCGGCAACG TGGCGGCCAG TGAACTGGCG CAGCTTGGCC TGCTGGATAT CCTCTCTTCC
GACTACTACC CCGCCAGCCT GCTCGATGCG GCATTTCGCG TCGCCGATGA CGAGAGCAAC
CGCTTTACGC TGCCGCAGGC GGTGAAGCTG GTGACTAAAA ATCCGGCGCA GGCGCTTAAT
CTTCAGGATC GCGGGGTGAT TGGCGAGGGC AAACGCGCCG ACCTGGTGCT GGCGCATCGC
AAGGGCAACC ACATTCATAT CGACCACGTC TGGCGTCAGG GTAAAAGGGT GTTCTGA
 
Protein sequence
MIINNVKLVL ENEVVHGSLE VQDGEIRAFA ESQSRLPEAM DGEGGWLLPG LIELHTDNLD 
KFFTPRPKVD WPAHSAMSSH DALMVASGIT TVLDAVAIGD VRDGGDRLEN LEKMINAIEE
TQKRGVNRAE HRLHLRCELP HHTTLPLFEK LVQREPVTLV SLMDHSPGQR QFANREKYRE
YYQGKYSLTD AQMQQYEEEQ LALAARWSQP NRETIAAMCR ARHIALASHD DATHAHVAES
HQLGSVIAEF PTTFEAAEAS RKHGMNVLMG APNIVRGGSH SGNVAASELA QLGLLDILSS
DYYPASLLDA AFRVADDESN RFTLPQAVKL VTKNPAQALN LQDRGVIGEG KRADLVLAHR
KGNHIHIDHV WRQGKRVF