Gene ECH74115_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3879 
Symbol 
ID6969705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3588360 
End bp3590213 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content55% 
IMG OID643387657 
Producthypothetical protein 
Protein accessionYP_002272106 
Protein GI209399258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG 
CTGACACACA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT
TATACCCTGA TGCAGGTGAT TACAGCAGAA GGTGATGTGG TGGTCAGTGG TGCAACTGAG
CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA
GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC
CGCAGTACAG TTACGCCGGG TGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCCGAC
CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC
AAAGGGCAGT ACGGCTGTGT CGGCCAGGGC TTACATATTG CCAAAAAACT GCTTCCGTAT
ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTCACC
CAGGGCGCGG AGGGGACATT CAGTGCGGAC GCGGGGGCCA GCCAGGATTC GGCGCGCTGG
GGTGTGGGTA AACCGTTATA TCAGGACCTG ATTGCGCGCA CTAAAGCTGC ATTACAGAAG
AACCCGAAAA ATGTGTTGCT GGCGGTGTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC
GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGGCACAGTT TCGTGCTGAC
CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT
GGTGACACGA CGTATTACTG GAAAAATACC TACGGCACCC AGTACAACAC CATTTACGGG
GCGTACAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC
GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT
TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC
AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTATGG CAACCGCTAT TCTGAACGCA
GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC
GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG
GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA
GATGGTGTAT TTAAGATCAC CAAGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG
GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA
GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT
CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG
ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG
GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT
GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC
AGTATGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT
GGCGTTGAGA TAGAAAGTCT GGTGCTGGAG ATAAATGCAC CGGCATCATC ATAA
 
Protein sequence
MTFKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQVITAE GDVVVSGATE 
PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD
HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT
QGAEGTFSAD AGASQDSARW GVGKPLYQDL IARTKAALQK NPKNVLLAVC WMQGEFDMSA
ATHAQQPALF TAMLAQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YGTQYNTIYG
AYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF
SSWARRSIIP DRMATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP
AAGEAAAQGW SIKDGGIQLS DGVFKITKQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS
GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG
EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SMTLGLNALT LTDVTKNAAY
GVEIESLVLE INAPASS