Gene ECH74115_2188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2188 
Symbol 
ID6971236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2094628 
End bp2096478 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID643386081 
Producthypothetical protein 
Protein accessionYP_002270570 
Protein GI209399696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.162911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000018311 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG 
CTGACACACA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT
TATACCCTGA TGCAGGCGAT TACAGCAGAA GGTGATGTGG TGGTCAGTGG TGCAACTGAG
CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA
GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC
CGCAGTACAG TGACGCCGGG TGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCCGAC
CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC
AAAGGGCAGT ACGGCTGTGT CGGCCAGGGG TTACATATTG CCAAAAAACT GCTCCCGTAT
ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTTACC
CAGGGCGCGG AGGGGATATT CAGCGAGTCC ACGGGGGCCA GCCAGGATTC GGCGCGCTGG
GGTGTGGGTA AACCGTTATA TCAGGACCTG ATTGCGCGCA CCAAAGCTGC ATTACAGAAG
AACCCGAAAA ATGTGTTGCT GGCGGTGTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC
GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC
CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT
GGTGACACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC
GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC
GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT
TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC
AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA
GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC
GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG
GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA
GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG
GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA
GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA CGCTCCCGTT
CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG
ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG
GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT
GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC
AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT
GGCGTTGAGA TAGAAAGTCT GATGCTGGAG ATAAATGCAC CGGCAGCATA A
 
Protein sequence
MSIKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAITAE GDVVVSGATE 
PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD
HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT
QGAEGIFSES TGASQDSARW GVGKPLYQDL IARTKAALQK NPKNVLLAVC WMQGEFDMSA
ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG
GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF
SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP
AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS
GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG
EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY
GVEIESLMLE INAPAA