Gene ECH74115_A0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0019 
Symbol 
ID6966510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp14272 
End bp15894 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content39% 
IMG OID643384050 
Productrepeated sequence found in lipoprotein LPP 
Protein accessionYP_002268529 
Protein GI209395645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CCAAATCTAT TATAGCTGTT TTAGTAGGAC TGTTTGCTGG TAGTGCATAT 
GCAGATTGTG AGTATTACCT TACAGGTAAT AAGGAAACTC TTAACATTTC GGATTGTGTT
GATGGTGTGA AAGACATTGC GAATAATGCC TTGTCAAATT CACAGAATGC ACAAAATGTT
GCAAATGGTG CTGCCTCTAC CGCCCAAAAT GCACAAACAA CCGCAAATGC GGCTAATTCT
GCTGCGCAGA ACGCACAGAA CACGGCTAAT ACGGCACAAA ATACAGCAAA CTCTGCTAAT
TCTATTGCAC AAAATGCTCA ATCGACAGCT AACCAGGCAA TCAAAGATGC GGCAGAAAAA
TCTGAGGCAG CAGAAAAGAA CGCAAACAAC TATACAGACA ATAAAATTAC TGATGTTAAA
AATGAACTGA ATACAAATAT TGATAGTGCT AAAAACGACG CGATCAGTTC ATCTAACAGT
TACACTGACA ACAAAATCAG TGATACGAAG ACAGAACTGA ATGCCAATAT TGATAAGGCG
AAGAATGACG CGATCAGTTC ATCTAACAGC TACACTGACA GCAAAATCAG TGATACGAAG
ACAGAACTGA ATGCCAATAT TGATAAGGCG AAGAATGACG CGATCAGTTC ATCTAACAGC
TACACTGACA GCAAAATCAG TGATACGAAG ACAGAACTGA ATGCCAATAT TGATAAGGCG
AAGAATGACG CGATCAGTTC ATCTAACAGC TACACTGACA GCAAAATCAG TGATACGAAG
ACAGAACTGA ATGCCAATAT TGATAAGGCG AAGAATGACG CGATCAGTTC ATCTAACAGC
TACACTGACA GCAAAATCAG TGATACGAAG ACAGAACTGA ATGCCAATAT TGATAAGGCG
AAGAATGACG CGATCAGTTC ATCTAACAGC TACACTGACA GCAAAATCAG TGATACGAAG
ACAGAACTGA ATGCCAATAT TGATAAGGCG AAGAATGACG CAATCAGTTC ATCTAACAGC
TACACTGACA GCAAAATCAG TGATACGAAG ACAGAATTGA ATGCCAATAT TGATAAGGCA
AAGAATGATG CGATCAGCTC ATCCAACAGT TACACAGACA GTAAAATCAG TGATACGAAA
ACAGAGCTGA ATACCAATAT CAACAATGCC AAAAATGAGT CAATTAGCAC ATCAAAAAAC
TATACGGATA AGAAATATCA ACAAGGTATT AGTTATACAA ATGAAAAATA CGAGCAAAGC
ATACAGTATG CTCAAAATGC GGCTGATAAA GCTGAACAAA ATGCGAATAA CTACACTGAT
AACCGATTTA ATCAGTTAAA CAATCAGTCA AATCAGCGAT TTGAACAATT GAATAAAAAG
ATTGAGCGTG CTGAAAAACG TCTTAATGCC GGTATTGCAG GTGTCGCCGC AATATCTTCA
ATTCCATATG TAGCTGAGAA TAATTTTTCA TATGGTGTTG GGCTTGGTAA TTATCAGAAT
GGAAACGCAA TTGCTGCGGG TATTCAATAT AAAACATCAG CAAATACAAA TGTGCGCCTT
AACGTCTCAT GGGATTCATC TCATAACACT GTTCTTGGTG CAGGTTTTGC GGGTGGCTGG
TAA
 
Protein sequence
MKVTKSIIAV LVGLFAGSAY ADCEYYLTGN KETLNISDCV DGVKDIANNA LSNSQNAQNV 
ANGAASTAQN AQTTANAANS AAQNAQNTAN TAQNTANSAN SIAQNAQSTA NQAIKDAAEK
SEAAEKNANN YTDNKITDVK NELNTNIDSA KNDAISSSNS YTDNKISDTK TELNANIDKA
KNDAISSSNS YTDSKISDTK TELNANIDKA KNDAISSSNS YTDSKISDTK TELNANIDKA
KNDAISSSNS YTDSKISDTK TELNANIDKA KNDAISSSNS YTDSKISDTK TELNANIDKA
KNDAISSSNS YTDSKISDTK TELNANIDKA KNDAISSSNS YTDSKISDTK TELNANIDKA
KNDAISSSNS YTDSKISDTK TELNTNINNA KNESISTSKN YTDKKYQQGI SYTNEKYEQS
IQYAQNAADK AEQNANNYTD NRFNQLNNQS NQRFEQLNKK IERAEKRLNA GIAGVAAISS
IPYVAENNFS YGVGLGNYQN GNAIAAGIQY KTSANTNVRL NVSWDSSHNT VLGAGFAGGW