Gene EcSMS35_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3583 
Symbolfmt 
ID6145017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3661449 
End bp3662396 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID641618410 
Productmethionyl-tRNA formyltransferase 
Protein accessionYP_001745550 
Protein GI170682749 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase 
TIGRFAM ID[TIGR00460] methionyl-tRNA formyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.291651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0987748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGAAT CACTACGTAT TATTTTTGCG GGTACACCTG ACTTTGCAGC GCGTCATCTC 
GACGCGCTGT TGTCTTCTGG TCATAACGTC GTTGGCGTGT TCACCCAGCC AGACCGACCG
GCAGGACGCG GTAAAAAACT GATGCCCAGC CCGGTTAAAG TTCTGGCTGA GGAAAAAGGT
CTGCCCGTTT TTCAACCGGT TTCCCTGCGT CCACAAGAAA ACCAGCAACT GGTCGCCGAT
CTACAGGCTG ATGTTATGGT TGTCGTCGCC TATGGCTTAA TTCTGCCTAA AGCCGTGCTG
GAGATGCCGC GTCTTGGCTG TATCAACGTT CATGGTTCAC TGCTGCCACG CTGGCGCGGT
GCTGCACCAA TCCAACGCTC ACTATGGGCG GGTGATGCAG AAACTGGTGT GACCATTATG
CAAATGGATG TTGGTTTAGA CACCGGTGAC ATGCTCTATA AGCTCTCCAG CCCGATTACT
GCAGAAGATA CCAGTGGTAC GCTGTACGAC AAGCTGGCAG AGCTTGGCCC ACAAGGGCTT
ATCACCACAT TGAAACAATT GGCAGACGGC ACGGCGAAAC CAGAAGTTCA GGACGAAACT
CTTGTCACTT ACGCAGAGAA GTTGAGTAAA GAAGAAGCGC GTATTGACTG GTCACTTTCG
GCAGCACAGC TTGAACGCTG CATTCGCGCT TTCAATCCAT GGCCAATGAG CTGGCTGGAA
ATTGAAGGCC AGCCTGTTAA GGTCTGGAAA GCGTCGGTCA TTGATACGGT GACAAAGTCT
GCGCCAGGAA CAATCCTTGA AGCCAGCAAA CAAGGCATTC AGGTCGCGAC CGGTGATGGC
ATCCTGAATC TGCTCTCGTT GCAACCTGCG GGTAAGAAAG CGATGAGCGC ACAAGACCTC
CTGAATTCTC GTCGGGAATG GTTTGTTCCG GGCAACCGTC TGGCCTGA
 
Protein sequence
MSESLRIIFA GTPDFAARHL DALLSSGHNV VGVFTQPDRP AGRGKKLMPS PVKVLAEEKG 
LPVFQPVSLR PQENQQLVAD LQADVMVVVA YGLILPKAVL EMPRLGCINV HGSLLPRWRG
AAPIQRSLWA GDAETGVTIM QMDVGLDTGD MLYKLSSPIT AEDTSGTLYD KLAELGPQGL
ITTLKQLADG TAKPEVQDET LVTYAEKLSK EEARIDWSLS AAQLERCIRA FNPWPMSWLE
IEGQPVKVWK ASVIDTVTKS APGTILEASK QGIQVATGDG ILNLLSLQPA GKKAMSAQDL
LNSRREWFVP GNRLA