Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5774 |
Symbol | |
ID | 6970759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5406158 |
End bp | 5407894 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643389404 |
Product | hypothetical protein |
Protein accession | YP_002273797 |
Protein GI | 209398851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAA TATCAGATTT GAATTATTCT CAACACATTA CATTAGCCGA CAATTTTAAA CAAAAAAGTG AAGTTTTAAA TACCTGGCGT GTTGGAATGA ATAATTTTGC CCGTAATGCC GGGGGGCAGG ATAACACAAG AAATATCCTT AATCCTAAGA CATTTTTGGA GTTTTTGGTA AAAATATTTA CCCTGGGTTA TGTGGATTTT AGCAAACGCT CCAACGAAGC GGGAAGAAAT ATGATGGCTC ATATTGAGTC CTCATCTTAT ATCAAAAATA ATGATGGCAG TGAGATAATG AAGTTTGTTA TGAATAATCC TGAAGGGGAA CGAGCGGATT CACCCAAGGT GATTATAGAA ATTTCACTTT CCACTATTAC TACTATGGGG ACTCGTCAAG GACATACAGC CATTATATTT CCACAACCTG ATGGTTCGAC TAACCGTTAT GAAAGAAAGT CCTTTGAAAG AAAAGATGAG AGTTCATTAC ACCTGATTAC TAACAAGGTT CTGGCGTGTT ACCAACGCGA AGCTAACAAG GAAATAGCTC GTCTATTAAA TAATCATCAG AAGTTAAATA ATCTACAGAA GTTAAATAAT CTACAGAAGT TAAATAATAT ACAGAAGTTA AATAATATAC AGGAGTTAAA TAATTCGCAG GAGTTAAATA ATTCGCAGGA GTTAAATAAC TCGCAGGACT TAAAAAATTC GCAGGTGAGT TGTAAAGGTT CAGTTGATTT TACGATTACG GATTTATTAG AAAAATCATT GAATAATGCA TTATTAGCAA TAAGGAACGA ACATCTGCTA TTAATGCCTC ATGTATGTAG TGAATCGATT TCATACTTAC TGGGCGAAAA TGGTATACTT GAAGAAATAG ATAAGCTCTA CGAATTAAAT GATCACGGAA TTGATAATGA CAAAGAAGGT AACAATGAAA TTAATGACAT CATGATTAAC CTGTCTCATA TTCTTATTGA ATCCTTAGAT GATGCAAAGG TTAATCTTAC ACCGGTCATC CATTCGATGT TGATGACTTT TTTAGAATTG CCATATAATA ATGATGTAAA AATACTGGAG TGGTGTTTTA ATAAAAGCAT GCAATATTTT GATGATTCTG CAAAGATAGA GCATGCATGC TCCGTAATAA ATCATATTAA TTTTCGTCGC GATCAGTCTA AAGTAGCTGA GACATTATTT TTCAATCTCG ATAAAGAACC CTATAAAAAT AGCCCTGAAT TACAGGAGTT GATTTGGAAA AAGTTGGTTG TATATGTCAA TGATTTTAAC TTAAGCAATC GAGAAAAAAC ATATTTAATA CAAAGAATAT TTAATAATGT TGAGTCACTA TTTAATAAAG TACCTGTCAG TATTTTAGTT AATGATATTT TTATGAATGA TTTTTTTATG AAAAACACTG AGATGATTAA TTGGTACTTC CCTCGGTTAC TTAAGAGCTA TGAGGATGAA AAGATTTATT TTGATAAGTT AGGGTATAAT TTTAATAATA AAGAGTCTAA TGAAGAGATT ATGAAAAATC AACCAAAAGA TGTTATTGAA GAAAAACTTA ATAATGAATT AAAACTTAGG TTTAGAATGA TGCAAACTAT CTTGAAATCG GAGGTTAATG TATCGCCATT TATTGACCAA CAGCGTTTAA ATACACTAAA TCCTCCGGAA AATTTACGTA TAGCAATAGA AAAATTTGGC TGGAAGAAAA AAACTATCAC TGCATAA
|
Protein sequence | MSKISDLNYS QHITLADNFK QKSEVLNTWR VGMNNFARNA GGQDNTRNIL NPKTFLEFLV KIFTLGYVDF SKRSNEAGRN MMAHIESSSY IKNNDGSEIM KFVMNNPEGE RADSPKVIIE ISLSTITTMG TRQGHTAIIF PQPDGSTNRY ERKSFERKDE SSLHLITNKV LACYQREANK EIARLLNNHQ KLNNLQKLNN LQKLNNIQKL NNIQELNNSQ ELNNSQELNN SQDLKNSQVS CKGSVDFTIT DLLEKSLNNA LLAIRNEHLL LMPHVCSESI SYLLGENGIL EEIDKLYELN DHGIDNDKEG NNEINDIMIN LSHILIESLD DAKVNLTPVI HSMLMTFLEL PYNNDVKILE WCFNKSMQYF DDSAKIEHAC SVINHINFRR DQSKVAETLF FNLDKEPYKN SPELQELIWK KLVVYVNDFN LSNREKTYLI QRIFNNVESL FNKVPVSILV NDIFMNDFFM KNTEMINWYF PRLLKSYEDE KIYFDKLGYN FNNKESNEEI MKNQPKDVIE EKLNNELKLR FRMMQTILKS EVNVSPFIDQ QRLNTLNPPE NLRIAIEKFG WKKKTITA
|
| |