Gene ECH74115_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0947 
Symbol 
ID6971396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp958579 
End bp960591 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content35% 
IMG OID643384968 
Producthypothetical protein 
Protein accessionYP_002269468 
Protein GI209396431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGTT CAAAATGCAA TGGTTATGCG ACGATGTTAT TAAATATGGT GCAGGGTTCT 
GACCCGGTTA ATTTACTGGA ATTACATGGT TTCCTTGAAC ATTTCGCATA TTATGTTTCA
TTTGGCAAAT TTAATGCTGG CCACCAGCGC TATAATGCAT TTAAAAAATT CGTCAGTGAG
ATTTCGGAGA TCAGCGCAAA TGATATCAAT ATGACGATTA AAACCGGGCA ATCCAGACAT
GAAAATGTGA TCTCTATAAA TATGAATGAT GCAATTCCTC GTGATGAGAA AGGGATCACC
GTTAGAATAG ATAATATTAA TGGAAAGAAA AACAACTCTA ATTCATCAGA TGTTTTTATT
CCTTATGTAA ATACATTTCC CGACCTCAAA AATAAAATAC TCAGGATGAA AATTGAACTG
ACTGAAGGTA GTGGTTTTTC TAAAAGTCTT TCTGACAGTC AGATTGAGAT GCATATTCTG
AGGACTGTTA ATTCGCTAAA TGTTGGCGAG AAATTGAATG ATGATAATCT TGCAGATCAT
TCTATATTCA CTAACGAATT CTCTGTTATT ATTCCGCCAT CCTATTATGA TGCAACTTCT
GCAGTAAACG CTAATAATAT TGTGAGAGAA AAACTATTTG AAAGTGATAG TAAAGTTAAA
GATATCGTCG ATGATATGTC TAATCATGAC GTGGAGAGCG AAAAAGATAT ATTTGTAATC
GGGGGGATGA TAGAAAAACT GAATTCTCTG GCAGACGAAT CTTTCAATGA TAGTACTGAT
AACATACAAA CAGTAAAAGA TCTGTTAACG CAATTAACTG ATGGCATGGA GCTCTTCGCG
CTGCGGGACA TAGTGGCATT CCCATCAACC ATAATAGCAA AATTAATAAA ATCCCCACTT
AATAGCGATC ATGAACTTGT TATGCGCGCA CTGGATACGT ATCTGTGCTA CTTCAGAAAT
AAAAATCTGA ATAACAATGC GGAAATAATT AACTTTTTCC ATGCCCTCTT TTTAAAGAGA
CCAGAGTTGA TGGTTGCTGA AAATTATAGA TTTATTCAAT TTATTGATTT GCTTTTTGAA
AACGGAAATG TTGAAGAGAA AAACTTAGCA TTTGACCTTT ATCACAATTA TCTTTCATTA
TCTGAAATAA AACAATTCGT TACTGAAGAA ATTAAATTGA ATTTCAATGA GCAGCAAGGA
TTGTTAGATA AAGATAATAA ATGTTATATC CTGTTATCAT CCGATAACTC TGGGCGAGTA
ATGCGCTTAT CGCACCAGGC GTTAATCTCA ATGCTTGAGC CAGAGGTTAA GAAAAAGACT
ATCTGGAATA ATTACTCTAT TTATCCATCT TTGCAGGATA CCCATGAAGT TGTCAGAGAT
GATCCAGAAA CGATTTGTAT GCGTGCATTT CCTTTATTTG CAAAAGGATG GGAATATGCG
CAGAAAAATA AGAAACATCA ATTGATCCTC AATGCTCTTG GATTCAAAGG TTATATTCGT
GATATATTTA TGTCAGCAAT AATGCGAAAA ACAGATTTTG TGCTCGAATG CAATAATCAA
CCGACAGAAC TTAATTCATC ATTTTCCTCT TTGATGAACG ATTCAGATCA ATGGCAACAG
CATACTCTGA AAGATAAACA TTACGCTAAT CTATTAACGA TGCTGGATCT CAATGATGCC
AGTGAAAGTG ATAAGTCTAA AATATTTTTC TGCCTGTCTG CCGTATTTGC AAATATTTCT
CACAGTAATG TCTTTAACGG TATTCCTGAT GCGTCAAAAA CTTTAAAACG TTATGCATTT
GCTTTATTGG CGAAAGCTCA TTCTTTAGAT GAAAGTATGA TAAGCAATCA AACTTTTAAT
ACATACAAAA CGGTATTGCT TGATTTCAAT AATCTGTCTA ATGAAGAGGC CAACCAGTTG
CGCATAAGTT CGTTATATCG TGATATGGTC CGCTATGCAC AATATCGTTT TTCTAAAGTT
TTATCCGAGT GGACGCCAGA CGCATGGCTG TAA
 
Protein sequence
MDCSKCNGYA TMLLNMVQGS DPVNLLELHG FLEHFAYYVS FGKFNAGHQR YNAFKKFVSE 
ISEISANDIN MTIKTGQSRH ENVISINMND AIPRDEKGIT VRIDNINGKK NNSNSSDVFI
PYVNTFPDLK NKILRMKIEL TEGSGFSKSL SDSQIEMHIL RTVNSLNVGE KLNDDNLADH
SIFTNEFSVI IPPSYYDATS AVNANNIVRE KLFESDSKVK DIVDDMSNHD VESEKDIFVI
GGMIEKLNSL ADESFNDSTD NIQTVKDLLT QLTDGMELFA LRDIVAFPST IIAKLIKSPL
NSDHELVMRA LDTYLCYFRN KNLNNNAEII NFFHALFLKR PELMVAENYR FIQFIDLLFE
NGNVEEKNLA FDLYHNYLSL SEIKQFVTEE IKLNFNEQQG LLDKDNKCYI LLSSDNSGRV
MRLSHQALIS MLEPEVKKKT IWNNYSIYPS LQDTHEVVRD DPETICMRAF PLFAKGWEYA
QKNKKHQLIL NALGFKGYIR DIFMSAIMRK TDFVLECNNQ PTELNSSFSS LMNDSDQWQQ
HTLKDKHYAN LLTMLDLNDA SESDKSKIFF CLSAVFANIS HSNVFNGIPD ASKTLKRYAF
ALLAKAHSLD ESMISNQTFN TYKTVLLDFN NLSNEEANQL RISSLYRDMV RYAQYRFSKV
LSEWTPDAWL