Gene ECH74115_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1259 
Symbol 
ID6969355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1267473 
End bp1268600 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID643385249 
Producthypothetical protein 
Protein accessionYP_002269744 
Protein GI209399390 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2822] Predicted periplasmic lipoprotein involved in iron transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.285306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.564709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA ACTTCCGCCG TAACGCATTG CAGTTGAGCG TGGCTGCGCT GTTTTCTTCT 
GCTTTTATGG CTAACGCCGC TGATGTGCCG CAGGTCAAAG TGACCGTGAC GGATAAGCAG
TGCGAACCGA TGACCATTAC GGTTAACGCC GGGAAAACAC AGTTCATTAT TCAGAACCAC
AGCCAGAAGG CGCTGGAATG GGAGATCCTC AAAGGCGTGA TGGTGGTGGA AGAGCGGGAA
AATATCGCCC CTGGCTTTAG CCAGAAAATG ACGGCGAATT TACAGCCTGG CGAATACGAT
ATGACCTGCG GTCTGCTGAC TAACCCGAAA GGGAAGTTGA TCGTCAAAGG TGAGGCAACG
GCGGATGCGG CGCAAAGTGA TGCGCTGTTA AGTCTTGGTG GTGCAATTAC TGCATATAAA
GCGTATGTCA TGGCGGAAAC CACACAGCTG GTGACCGACA CCAAAGCCTT TACCGACGCG
ATTAAAGCAG GCGATATCGA AAAAGCGAAA GCACTGTATG CGCCGACGCG CCAGCACTAT
GAGCGCATTG AACCGATTGC TGAACTGTTC TCCGATCTGG ATGGCAGCAT TGACGCCCGT
GAAGATGATT ACGAGCAAAA AGCCGCTGAT CCAAAATTCA CCGGTTTCCA CCGTCTGGAA
AAAGCATTGT TTGGCGACAA CACCACCAAA GGCATGGATC AGTACGCTGA CCAGCTTTAT
ACCGATGTGG TCGATTTGCA AAAACGCATC AGTGAACTGG CTTTCCCACC TTCAAAAGTG
GTCGGCGGTG CAGCCGGACT GATTGAGGAA GTGGCAGCCA GCAAAATCAG CGGTGAAGAA
GATCGCTACA GCCACACCGA TCTGTGGGAT TTCCAGGCTA ACGTTGAAGG CTCGCAGAAA
ATTGTCGATC TGCTGCGTCC ACAACTGCAA AAAGCTAACC CGGAACTGTT GGCAAAAGTC
GATGCCAACT TTAAAAAGGT CGATACCATT CTGGCGAAAT ACCGTACTAA AGACGGTTTT
GAAACCTACG ACAAATTGAC CGATGCCGAC CGGAATGCAC TGAAAGGACC GATTACTGCG
CTGGCGGAAG ATCTGGCGCA ACTTCGCGGT GTGCTGGGAT TGGATTAA
 
Protein sequence
MTINFRRNAL QLSVAALFSS AFMANAADVP QVKVTVTDKQ CEPMTITVNA GKTQFIIQNH 
SQKALEWEIL KGVMVVEERE NIAPGFSQKM TANLQPGEYD MTCGLLTNPK GKLIVKGEAT
ADAAQSDALL SLGGAITAYK AYVMAETTQL VTDTKAFTDA IKAGDIEKAK ALYAPTRQHY
ERIEPIAELF SDLDGSIDAR EDDYEQKAAD PKFTGFHRLE KALFGDNTTK GMDQYADQLY
TDVVDLQKRI SELAFPPSKV VGGAAGLIEE VAASKISGEE DRYSHTDLWD FQANVEGSQK
IVDLLRPQLQ KANPELLAKV DANFKKVDTI LAKYRTKDGF ETYDKLTDAD RNALKGPITA
LAEDLAQLRG VLGLD