Gene EcolC_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3931 
Symbol 
ID6064421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4316743 
End bp4317879 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content61% 
IMG OID641603344 
Productphosphonate metabolism protein PhnM 
Protein accessionYP_001726859 
Protein GI170021905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATCA ATAACGTTAA GCTGGTGCTG GAAAACGAGG TGGTAAGCGG TTCGCTGGAG 
GTGCAGAACG GCGAAATCCG CGCCTTTGCC GAAAGCCAGA GCCGCCTGCC GGAGGCGATG
GACGGCGAAG GCGGCTGGCT GCTGCCGGGG CTGATTGAGC TGCATACCGA TAATCTGGAT
AAATTCTTCA CCCCGCGCCC GAAAGTTGAC TGGCCTGCCC ACTCGGCGAT GAGCAGCCAC
GACGCGCTGA TGGTGGCGAG CGGCATCACC ACCGTACTGG ATGCCGTGGC AATTGGCGAC
GTGCGCGACG GCGGCGATCG GCTGGAGAAT CTGGAGAAGA TGATCAACGC CATCGAAGAG
ACACAGAAAC GCGGCGTCAA CCGCGCCGAG CACCGTCTGC ATCTGCGCTG CGAACTGCCG
CATCACACCA CGCTGCCGCT GTTTGAAAAA CTGGTGCAGC GCGAGCCGGT GACGCTGGTG
TCGCTGATGG ACCACTCGCC GGGCCAGCGC CAGTTCGCCA ACCGCGAGAA GTATCGCGAA
TATTATCAGG GCAAATACTC CCTCACTGAT GCGCAGATGC AGCAGTACGA AGAAGAGCAA
CTGGCGCTCG CCGCACGCTG GTCGCAGCCG AATCGCGAAT CCATCGCCGC CCTGTGCCGC
GCGCGAAAAA TTGCGCTTGC CAGCCACGAT GACGCCACCC ACGCCCAAGT TGCTGAATCT
CACCAGCTTG GCAGCGTGAT CGCCGAATTT CCCACCACGT TCGAAGCGGC GGAAGCCTCG
CGCAAGCATG GCATGAACGT GCTGATGGGC GCGCCGAATA TTGTGCGCGG CGGCTCGCAC
TCCGGCAACG TGGCGGCCAG TGAACTGGCG CAGCTTGGCC TGCTGGATAT CCTCTCTTCC
GACTACTACC CCGCCAGCCT GCTCGATGCG GCATTTCGCG TCGCCGATGA CGAGAGCAAC
CGCTTTACGC TGCCGCAGGC GGTGAAGCTG GTGACTAAAA ATCCAGCGCA GGCGCTTAAT
CTCCAGGATC GCGGGGTGAT TGGCGAGGGC AAACGCGCGG ACCTGGTGCT GGCGCATCGC
AAGGGCAATC ACATTCATAT CGACCACGTC TGGCGTCAGG GTAAAAGGGT GTTCTGA
 
Protein sequence
MIINNVKLVL ENEVVSGSLE VQNGEIRAFA ESQSRLPEAM DGEGGWLLPG LIELHTDNLD 
KFFTPRPKVD WPAHSAMSSH DALMVASGIT TVLDAVAIGD VRDGGDRLEN LEKMINAIEE
TQKRGVNRAE HRLHLRCELP HHTTLPLFEK LVQREPVTLV SLMDHSPGQR QFANREKYRE
YYQGKYSLTD AQMQQYEEEQ LALAARWSQP NRESIAALCR ARKIALASHD DATHAQVAES
HQLGSVIAEF PTTFEAAEAS RKHGMNVLMG APNIVRGGSH SGNVAASELA QLGLLDILSS
DYYPASLLDA AFRVADDESN RFTLPQAVKL VTKNPAQALN LQDRGVIGEG KRADLVLAHR
KGNHIHIDHV WRQGKRVF