Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1259 |
Symbol | |
ID | 6969355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1267473 |
End bp | 1268600 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385249 |
Product | hypothetical protein |
Protein accession | YP_002269744 |
Protein GI | 209399390 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2822] Predicted periplasmic lipoprotein involved in iron transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.285306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.564709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTA ACTTCCGCCG TAACGCATTG CAGTTGAGCG TGGCTGCGCT GTTTTCTTCT GCTTTTATGG CTAACGCCGC TGATGTGCCG CAGGTCAAAG TGACCGTGAC GGATAAGCAG TGCGAACCGA TGACCATTAC GGTTAACGCC GGGAAAACAC AGTTCATTAT TCAGAACCAC AGCCAGAAGG CGCTGGAATG GGAGATCCTC AAAGGCGTGA TGGTGGTGGA AGAGCGGGAA AATATCGCCC CTGGCTTTAG CCAGAAAATG ACGGCGAATT TACAGCCTGG CGAATACGAT ATGACCTGCG GTCTGCTGAC TAACCCGAAA GGGAAGTTGA TCGTCAAAGG TGAGGCAACG GCGGATGCGG CGCAAAGTGA TGCGCTGTTA AGTCTTGGTG GTGCAATTAC TGCATATAAA GCGTATGTCA TGGCGGAAAC CACACAGCTG GTGACCGACA CCAAAGCCTT TACCGACGCG ATTAAAGCAG GCGATATCGA AAAAGCGAAA GCACTGTATG CGCCGACGCG CCAGCACTAT GAGCGCATTG AACCGATTGC TGAACTGTTC TCCGATCTGG ATGGCAGCAT TGACGCCCGT GAAGATGATT ACGAGCAAAA AGCCGCTGAT CCAAAATTCA CCGGTTTCCA CCGTCTGGAA AAAGCATTGT TTGGCGACAA CACCACCAAA GGCATGGATC AGTACGCTGA CCAGCTTTAT ACCGATGTGG TCGATTTGCA AAAACGCATC AGTGAACTGG CTTTCCCACC TTCAAAAGTG GTCGGCGGTG CAGCCGGACT GATTGAGGAA GTGGCAGCCA GCAAAATCAG CGGTGAAGAA GATCGCTACA GCCACACCGA TCTGTGGGAT TTCCAGGCTA ACGTTGAAGG CTCGCAGAAA ATTGTCGATC TGCTGCGTCC ACAACTGCAA AAAGCTAACC CGGAACTGTT GGCAAAAGTC GATGCCAACT TTAAAAAGGT CGATACCATT CTGGCGAAAT ACCGTACTAA AGACGGTTTT GAAACCTACG ACAAATTGAC CGATGCCGAC CGGAATGCAC TGAAAGGACC GATTACTGCG CTGGCGGAAG ATCTGGCGCA ACTTCGCGGT GTGCTGGGAT TGGATTAA
|
Protein sequence | MTINFRRNAL QLSVAALFSS AFMANAADVP QVKVTVTDKQ CEPMTITVNA GKTQFIIQNH SQKALEWEIL KGVMVVEERE NIAPGFSQKM TANLQPGEYD MTCGLLTNPK GKLIVKGEAT ADAAQSDALL SLGGAITAYK AYVMAETTQL VTDTKAFTDA IKAGDIEKAK ALYAPTRQHY ERIEPIAELF SDLDGSIDAR EDDYEQKAAD PKFTGFHRLE KALFGDNTTK GMDQYADQLY TDVVDLQKRI SELAFPPSKV VGGAAGLIEE VAASKISGEE DRYSHTDLWD FQANVEGSQK IVDLLRPQLQ KANPELLAKV DANFKKVDTI LAKYRTKDGF ETYDKLTDAD RNALKGPITA LAEDLAQLRG VLGLD
|
| |