Gene ECH74115_5774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5774 
Symbol 
ID6970759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5406158 
End bp5407894 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content32% 
IMG OID643389404 
Producthypothetical protein 
Protein accessionYP_002273797 
Protein GI209398851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA TATCAGATTT GAATTATTCT CAACACATTA CATTAGCCGA CAATTTTAAA 
CAAAAAAGTG AAGTTTTAAA TACCTGGCGT GTTGGAATGA ATAATTTTGC CCGTAATGCC
GGGGGGCAGG ATAACACAAG AAATATCCTT AATCCTAAGA CATTTTTGGA GTTTTTGGTA
AAAATATTTA CCCTGGGTTA TGTGGATTTT AGCAAACGCT CCAACGAAGC GGGAAGAAAT
ATGATGGCTC ATATTGAGTC CTCATCTTAT ATCAAAAATA ATGATGGCAG TGAGATAATG
AAGTTTGTTA TGAATAATCC TGAAGGGGAA CGAGCGGATT CACCCAAGGT GATTATAGAA
ATTTCACTTT CCACTATTAC TACTATGGGG ACTCGTCAAG GACATACAGC CATTATATTT
CCACAACCTG ATGGTTCGAC TAACCGTTAT GAAAGAAAGT CCTTTGAAAG AAAAGATGAG
AGTTCATTAC ACCTGATTAC TAACAAGGTT CTGGCGTGTT ACCAACGCGA AGCTAACAAG
GAAATAGCTC GTCTATTAAA TAATCATCAG AAGTTAAATA ATCTACAGAA GTTAAATAAT
CTACAGAAGT TAAATAATAT ACAGAAGTTA AATAATATAC AGGAGTTAAA TAATTCGCAG
GAGTTAAATA ATTCGCAGGA GTTAAATAAC TCGCAGGACT TAAAAAATTC GCAGGTGAGT
TGTAAAGGTT CAGTTGATTT TACGATTACG GATTTATTAG AAAAATCATT GAATAATGCA
TTATTAGCAA TAAGGAACGA ACATCTGCTA TTAATGCCTC ATGTATGTAG TGAATCGATT
TCATACTTAC TGGGCGAAAA TGGTATACTT GAAGAAATAG ATAAGCTCTA CGAATTAAAT
GATCACGGAA TTGATAATGA CAAAGAAGGT AACAATGAAA TTAATGACAT CATGATTAAC
CTGTCTCATA TTCTTATTGA ATCCTTAGAT GATGCAAAGG TTAATCTTAC ACCGGTCATC
CATTCGATGT TGATGACTTT TTTAGAATTG CCATATAATA ATGATGTAAA AATACTGGAG
TGGTGTTTTA ATAAAAGCAT GCAATATTTT GATGATTCTG CAAAGATAGA GCATGCATGC
TCCGTAATAA ATCATATTAA TTTTCGTCGC GATCAGTCTA AAGTAGCTGA GACATTATTT
TTCAATCTCG ATAAAGAACC CTATAAAAAT AGCCCTGAAT TACAGGAGTT GATTTGGAAA
AAGTTGGTTG TATATGTCAA TGATTTTAAC TTAAGCAATC GAGAAAAAAC ATATTTAATA
CAAAGAATAT TTAATAATGT TGAGTCACTA TTTAATAAAG TACCTGTCAG TATTTTAGTT
AATGATATTT TTATGAATGA TTTTTTTATG AAAAACACTG AGATGATTAA TTGGTACTTC
CCTCGGTTAC TTAAGAGCTA TGAGGATGAA AAGATTTATT TTGATAAGTT AGGGTATAAT
TTTAATAATA AAGAGTCTAA TGAAGAGATT ATGAAAAATC AACCAAAAGA TGTTATTGAA
GAAAAACTTA ATAATGAATT AAAACTTAGG TTTAGAATGA TGCAAACTAT CTTGAAATCG
GAGGTTAATG TATCGCCATT TATTGACCAA CAGCGTTTAA ATACACTAAA TCCTCCGGAA
AATTTACGTA TAGCAATAGA AAAATTTGGC TGGAAGAAAA AAACTATCAC TGCATAA
 
Protein sequence
MSKISDLNYS QHITLADNFK QKSEVLNTWR VGMNNFARNA GGQDNTRNIL NPKTFLEFLV 
KIFTLGYVDF SKRSNEAGRN MMAHIESSSY IKNNDGSEIM KFVMNNPEGE RADSPKVIIE
ISLSTITTMG TRQGHTAIIF PQPDGSTNRY ERKSFERKDE SSLHLITNKV LACYQREANK
EIARLLNNHQ KLNNLQKLNN LQKLNNIQKL NNIQELNNSQ ELNNSQELNN SQDLKNSQVS
CKGSVDFTIT DLLEKSLNNA LLAIRNEHLL LMPHVCSESI SYLLGENGIL EEIDKLYELN
DHGIDNDKEG NNEINDIMIN LSHILIESLD DAKVNLTPVI HSMLMTFLEL PYNNDVKILE
WCFNKSMQYF DDSAKIEHAC SVINHINFRR DQSKVAETLF FNLDKEPYKN SPELQELIWK
KLVVYVNDFN LSNREKTYLI QRIFNNVESL FNKVPVSILV NDIFMNDFFM KNTEMINWYF
PRLLKSYEDE KIYFDKLGYN FNNKESNEEI MKNQPKDVIE EKLNNELKLR FRMMQTILKS
EVNVSPFIDQ QRLNTLNPPE NLRIAIEKFG WKKKTITA