Gene EcHS_A3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3481 
Symbolfmt 
ID5595059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3478557 
End bp3479504 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID640922598 
Productmethionyl-tRNA formyltransferase 
Protein accessionYP_001460079 
Protein GI157162761 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase 
TIGRFAM ID[TIGR00460] methionyl-tRNA formyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.000745387 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAGAAT CACTACGTAT TATTTTTGCG GGTACACCTG ACTTTGCAGC GCGTCATCTC 
GACGCGCTGT TGTCTTCTGG TCATAACGTC GTTGGCGTGT TCACCCAGCC AGACCGACCG
GCAGGACGCG GTAAAAAACT GATGCCCAGC CCGGTTAAAG TTCTGGCTGA GGAAAAAGGT
CTGCCCGTTT TTCAACCTGT TTCCCTGCGT CCACAAGAAA ACCAGCAACT GGTCGCCGAA
CTGCAGGCTG ATGTTATGGT CGTCGTCGCC TATGGTTTAA TTCTGCCGAA AGCAGTGCTG
GAGATGCCGC GTCTTGGCTG TATCAACGTT CATGGTTCAC TGCTGCCACG CTGGCGCGGT
GCTGCACCAA TCCAACGCTC ACTATGGGCG GGTGATGCAG AAACTGGTGT GACCATTATG
CAAATGGATG TCGGTTTAGA CACCGGTGAT ATGCTCTATA AGCTCTCCTG CCCGATTACT
GCAGAAGATA CCAGTGGTAC GCTGTACGAC AAGCTGGCAG AGCTTGGCCC ACAAGGGCTT
ATCACCACGT TGAAACAACT GGCAGACGGC ACGGCGAAAC CAGAAGTTCA GGACGAAACT
CTTGTCACTT ACGCCGAGAA GTTGAGTAAA GAAGAAGCGC GTATTGACTG GTCACTTTCG
GCAGCACAGC TTGAACGCTG CATTCGCGCT TTCAATCCAT GGCCAATGAG CTGGCTGGAA
ATTGAAGGAC AGCCGGTTAA AGTCTGGAAA GCATCGGTCA TTGATACGGC AACCAACGCT
GCACCAGGAA CGATCCTTGA AGCCAACAAA CAAGGCATTC AGGTTGCGAC TGGTGATGGC
ATCCTGAACC TGCTCTCGTT ACAACCTGCG GGTAAGAAAG CGATGAGCGC GCAAGACCTC
CTGAACTCTC GTCGGGAATG GTTTGTTCCG GGCAACCGTC TGGTCTGA
 
Protein sequence
MSESLRIIFA GTPDFAARHL DALLSSGHNV VGVFTQPDRP AGRGKKLMPS PVKVLAEEKG 
LPVFQPVSLR PQENQQLVAE LQADVMVVVA YGLILPKAVL EMPRLGCINV HGSLLPRWRG
AAPIQRSLWA GDAETGVTIM QMDVGLDTGD MLYKLSCPIT AEDTSGTLYD KLAELGPQGL
ITTLKQLADG TAKPEVQDET LVTYAEKLSK EEARIDWSLS AAQLERCIRA FNPWPMSWLE
IEGQPVKVWK ASVIDTATNA APGTILEANK QGIQVATGDG ILNLLSLQPA GKKAMSAQDL
LNSRREWFVP GNRLV