Gene EcHS_A4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4336 
SymbolphnM 
ID5591595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4340652 
End bp4341788 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content61% 
IMG OID640923434 
Productphosphonate metabolism protein PhnM 
Protein accessionYP_001460879 
Protein GI157163561 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones77 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATCA ATAACGTTAA GCTGGTGCTG GAAAACGAGG TGGTGCACGG TTCGCTGGAG 
GTGCAGGATG GCGAAATCCG CGCCTTTGCC GAAAGCCAGA GCCGCCTGCC AGAAGCGATG
GACGGCGAAG GTGGCTGGCT ACTACCCGGC CTGATTGAGC TGCATACCGA CAACCTCGAT
AAATTCTTCA CCCCGCGCCC GAAAGTCGAC TGGCCCGCCC ATTCGGCGAT GAGCAGCCAC
GACGCGCTGA TGGTGGCAAG CGGCATCACC ACCGTGCTGG ACGCGGTGGC GATTGGCGAC
GTGCGCGACG GCGGCGATCG GCTGGAGAAT CTGGAGAAGA TGATCAACAC CATCGAAGAG
ACGCAGAAAC GCGGCGTCAA CCGCGCCGAG CACCGCCTGC ACCTGCGCTG CGAACTGCCG
CATCACACCA CACTGCCGCT GTTTGAAAAA CTGGTTCAGC GCGAACCGGT GACGCTGGTG
TCGCTGATGG ACCACTCACC GGGCCAGCGC CAGTTCGCTA ACCGCGAGAA GTATCGCGAA
TATTATCAGG GCAAATACTC CCTCACCGAT GCGCAGATGC AGCAGTACGA AGAAGAGCAA
TTGGCGCTCG CCGCACGCTG GTCGCAGCCG AATCGCGAAT CCATCGCCGC CCTGTGCCGC
GCGCGACAAA TTGCGCTCGC CAGCCACGAT GACGCCACAC ACGCCCACGT TGCCGAATCC
CACCAACTTG GCAGCGTGAT CGCCGAATTT CCCACCACGT TCGAAGCGGC GGAAGCCTCG
CGTAAGCATG GCATGAACGT GCTGATGGGT GCGCCAAATA TCGTGCGTGG CGGCTCGCAC
TCCGGCAATG TGGCGGCCAG TGAACTGGCG CAGCTTGGTC TGCTGGATAT CCTCTCTTCC
GACTACTACC CCGCCAGCCT GCTGGATGCG GCGTTCCGCG TCGCCGATGA CGAGAGCAAC
CGCTTTACGC TACCGCAGGC GGTGAGGCTG GTGACCAAAA ATCCGGCGCA GGCGCTGAAT
CTTCAGGATC GCGGGGTGAT TGGCGAGGGT AAACGCGCTG ACCTGGTGCT GGCGCATCGC
CAGGGCAATC ACATTCATAT CGACCACGTC TGGCGTCAGG GTAAAAGGGT GTTCTGA
 
Protein sequence
MIINNVKLVL ENEVVHGSLE VQDGEIRAFA ESQSRLPEAM DGEGGWLLPG LIELHTDNLD 
KFFTPRPKVD WPAHSAMSSH DALMVASGIT TVLDAVAIGD VRDGGDRLEN LEKMINTIEE
TQKRGVNRAE HRLHLRCELP HHTTLPLFEK LVQREPVTLV SLMDHSPGQR QFANREKYRE
YYQGKYSLTD AQMQQYEEEQ LALAARWSQP NRESIAALCR ARQIALASHD DATHAHVAES
HQLGSVIAEF PTTFEAAEAS RKHGMNVLMG APNIVRGGSH SGNVAASELA QLGLLDILSS
DYYPASLLDA AFRVADDESN RFTLPQAVRL VTKNPAQALN LQDRGVIGEG KRADLVLAHR
QGNHIHIDHV WRQGKRVF