Gene ECH74115_2258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2258 
Symbol 
ID6967239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2142532 
End bp2144382 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID643386142 
Producthypothetical protein 
Protein accessionYP_002270629 
Protein GI209399622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000000616827 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG 
CTGACACACA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT
TATACCCTGA TGCAGGCGAT TACAGCAGAA GGTGATGTGG TGGTCAGTGG TGCAACTGAG
CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA
GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC
CGCAGTACAG TGACGCCGGG TGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCCGAC
CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC
AAAGGGCAGT ACGGCTGTGT CGGCCAGGGG TTACATATTG CCAAAAAACT GCTCCCGTAT
ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTTACC
CAGGGCGCGG AGGGGACATT CAGCGAGTCC ACGGGGGCCA GCCAGGATTC GGCACGCTGG
GGGGTGGGCA AGCCGTTATA TCAGGATCTG ATTTCCCGCA CCAAAGCGGC ATTGCAGAAA
AATCCCAAAA ACGTTCTGCT GGCCGTCTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC
GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGGCACAGTT TCGTGCTGAC
CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT
GGTGACACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC
GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC
GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT
TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC
AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA
GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC
GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG
GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA
GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG
GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA
GGCGCGCTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT
CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTTACTCAG
ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG
GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT
GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC
AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT
GGCGTTGAGA TAGAAAGTCT GGTGCTGGAG ATAAATGCAC CGGCAGCATA A
 
Protein sequence
MSIKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAITAE GDVVVSGATE 
PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD
HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT
QGAEGTFSES TGASQDSARW GVGKPLYQDL ISRTKAALQK NPKNVLLAVC WMQGEFDMSA
ATHAQQPALF TAMLAQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG
GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF
SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP
AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS
GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG
EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY
GVEIESLVLE INAPAA