Gene EcolC_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0426 
Symbolfmt 
ID6067707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp459769 
End bp460716 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID641599825 
Productmethionyl-tRNA formyltransferase 
Protein accessionYP_001723431 
Protein GI170018477 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase 
TIGRFAM ID[TIGR00460] methionyl-tRNA formyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.459204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00147266 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCAGAAT CACTACGTAT TATTTTTGCG GGTACACCTG ACTTTGCAGC GCGTCATCTC 
GACGCGCTGT TGTCTTCTGG TCATAACGTC GTTGGCGTGT TCACCCAGCC AGACCGACCG
GCAGGACGCG GTAAAAAACT GATGCCCAGC CCGGTTAAAG TTCTGGCTGA GGAAAAAGGT
CTGCCCGTTT TTCAACCTGT TTCCCTGCGT CCACAAGAAA ACCAGCAACT GGTCGCCGAA
CTGCAGGCTG ATGTTATGGT CGTCGTCGCC TATGGTTTAA TTCTGCCGAA AGCAGTGCTG
GAGATGCCGC GTCTTGGCTG TATCAACGTT CATGGTTCAC TGCTGCCACG CTGGCGCGGT
GCTGCACCAA TCCAACGCTC ACTATGGGCG GGTGATGCAG AAACTGGTGT GACCATTATG
CAAATGGATG TCGGTTTAGA CACCGGTGAT ATGCTCTATA AGCTCTCCTG CCCGATTACT
GCAGAAGATA CCAGTGGTAC GCTGTACGAC AAGCTGGCAG AGCTTGGCCC ACAAGGGCTT
ATCACCACGT TGAAACAACT GGCAGACGGC ACGGCGAAAC CAGAAGTTCA GGACGAAACT
CTTGTCACTT ACGCCGAGAA GTTGAGTAAA GAAGAAGCGC GTATTGACTG GTCACTTTCG
GCAGCACAGC TTGAACGCTG CATTCGCGCT TTCAATCCAT GGCCAATGAG CTGGCTGGAA
ATTGAAGGAC AGCCGGTTAA AGTCTGGAAA GCATCGGTCA TTGATACGGC AACCAACGCT
GCACCAGGAA CGATCCTTGA AGCCAACAAA CAAGGCATTC AGGTTGCGAC TGGTGATGGC
ATCCTGAACC TGCTCTCGTT ACAACCTGCG GGTAAGAAAG CGATGAGCGC GCAAGACCTC
CTGAACTCTC GTCGGGAATG GTTTGTTCCG GGCAACCGTC TGGTCTGA
 
Protein sequence
MSESLRIIFA GTPDFAARHL DALLSSGHNV VGVFTQPDRP AGRGKKLMPS PVKVLAEEKG 
LPVFQPVSLR PQENQQLVAE LQADVMVVVA YGLILPKAVL EMPRLGCINV HGSLLPRWRG
AAPIQRSLWA GDAETGVTIM QMDVGLDTGD MLYKLSCPIT AEDTSGTLYD KLAELGPQGL
ITTLKQLADG TAKPEVQDET LVTYAEKLSK EEARIDWSLS AAQLERCIRA FNPWPMSWLE
IEGQPVKVWK ASVIDTATNA APGTILEANK QGIQVATGDG ILNLLSLQPA GKKAMSAQDL
LNSRREWFVP GNRLV