Gene ECH74115_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2790 
Symbol 
ID6968329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2602440 
End bp2604290 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content56% 
IMG OID643386643 
Producthypothetical protein 
Protein accessionYP_002271122 
Protein GI209397280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.578932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG 
CTGACACACA AACTGAAAGA GGGCTGGCAG CCATACGGCG GACCGGTTGC CATTACGCCG
TACACACTGA TGCAGGCGGT GGCTATTGAA GGAGAGCCAC AGGTCGGCCC TTCATCTGAG
CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA
GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC
CGCAGTACAG TGACGCCGGG CGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCTGAC
CACTGTCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAGGGC TGACCTGAGC
AAAGGGCAGT ACGGCTGTGT CGGCCAGGGT TTACATATTG CCAAAAAACT GCTCCCGTAT
ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTTACC
CAGGGCGCGG AGGGGACATT CAGCGAGTCC ACGGGGGCCA GTCAGGATTC GGCACGCTGG
GGGGTGGGCA AGCCGTTATA TCAGGATCTG ATTTCCCGCA CAAAAGCGGC ATTGCAGAAA
AATCCCAAAA ACGTTCTGCT GGCCGTCTGC TGGATGCAGG GTGAGTTTGA CATGAGCGCC
GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC
CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGCGCTG CAGATGTGCC GTGGATTTGT
GGTGACACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC
GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC
GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT
TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC
AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA
GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC
GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG
GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA
GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG
GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA
GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT
CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG
ACCACTGACG GCAGGGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG
GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT
GCCATGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC
AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT
GGCGTTGAGA TAGAAAGTCT GATGCTGGAG ATAAATGCAC CGGCAGCATA A
 
Protein sequence
MSIKHYDVVR AASPSDLAEK LTHKLKEGWQ PYGGPVAITP YTLMQAVAIE GEPQVGPSSE 
PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD
HCLHDVQDMS TLNHPRADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT
QGAEGTFSES TGASQDSARW GVGKPLYQDL ISRTKAALQK NPKNVLLAVC WMQGEFDMSA
ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG
GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF
SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP
AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS
GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG
EFGDYGNDWQ TLELVFTAGS AMVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY
GVEIESLMLE INAPAA