Gene ECH74115_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1168 
Symbol 
ID6972429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1183499 
End bp1185349 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content55% 
IMG OID643385167 
Producthypothetical protein 
Protein accessionYP_002269663 
Protein GI209399945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.44098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG 
CTGACACACA AACTGAAAGA GGGCTGGCAG CCATACGGCG GACCGGTTGC CATTACGCCG
TACACACTGA TGCAGGCGGT GGCTATTGAA GGAGATCCAC AGGTCGGCCC TTCATCTAAG
CCGGACTGGT TCTACGTGGT TGTGCTTGCC GGACAGTCCA ACGGCATGGC CTACGGTGAA
GGGCTTCCGT TACCGGATTC TTACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC
CGCAGCACGG TAACTCCGGG TGGAGAGAGT TGTACGTATA ACGACATCAT TCCGGCCGAC
CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC
AAAGGGCAGT ACGGCTGTGT CGGCCAGGGC TTACATATTG CCAAAAAACT GCTTCCGTAT
ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC TGCATTCACC
CAGGGCGCTG AGGGGACATT CAGTGCGGAC ACGGGGGCCA GCCAGGATTC GGCACGCTGG
GGTGTGGGTA AACCGTTATA TCAGGACCTG ATTGCGCGCA CTAAAGCTGC ATTACAGAAG
AACCCGAAAA ATGTGTTGCT GGCGGTGTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC
GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC
CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT
GGTGATACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC
GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC
GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT
TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC
AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA
GCCGGGCGCA CCTCCGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC
GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG
GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA
GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG
GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA ACTGTAAGTT CCGCCTGTCA
GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT
CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG
ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG
GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT
GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC
AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT
GGCGTTGAGA TAGAAAGTCT GGTGCTGGAG ATAAATGCAC CGGCAGCATA A
 
Protein sequence
MAFKHYDVVR AASPSDLAEK LTHKLKEGWQ PYGGPVAITP YTLMQAVAIE GDPQVGPSSK 
PDWFYVVVLA GQSNGMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGES CTYNDIIPAD
HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT
QGAEGTFSAD TGASQDSARW GVGKPLYQDL IARTKAALQK NPKNVLLAVC WMQGEFDMSA
ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG
GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF
SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP
AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLNCKFRLS
GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG
EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY
GVEIESLVLE INAPAA