Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0062 |
Symbol | |
ID | 6966480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 27856 |
End bp | 29217 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643383964 |
Product | hypothetical protein |
Protein accession | YP_002268443 |
Protein GI | 209395615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00411387 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCAG CAGCAATCAC CATGACCGCC CCGGAAGCCG CCAGCCCTGT GCAGATGTAC CGCGCGACCT ACTCACCGGA TGACAACAAA CTGCGCCTGT ATGCCGTGTC ACGTCTTGAC CCGGAGACGT ATAAAAAAGT GCATGATGCC GGTTTTCGCT GGGCACCAAA ACAGGCGCTG TTTGTCGCGC CGGCCTGGAC ACCGGGCCGG GAAGACGTGC TCCTCTCACT TGCCGGAGAG ATTGAGGATG AAGACAGCAC GCTCGCTGAA CGTCAGGAAG CACGGGCGGA GCGGTTTACC GGATACAGCG GAAAGCGGGC CAGTGAATCC GCACAGGCAC TTGATGAAGT GGAAAGACTG GCCGCGATGA TCCCGCCCGG TCAGCCCATT CTTGTGGGGC ATCACAGCGA ACGCCGCGCC CGTCGTGATG CGCAGCGTAT TGAAAACGGC ATGAAACGTG CCGTGATGCT CTTTGAACGT GCGGAATACT GGGAAGAACG GGGGCGGTCA GCACTGCTTC ACGCGAAGTA TAAAGAACGT CCGGACGTTC GCTGGCGTCG TATCAAAAAA ATCGAAGCTG ATTTGCGCAA GGCTGAAAAG ACCATCGCGC AGTCGCAGAA ATATCTGACG ATGTGGCGGG CTGAATCGCT GGATCTGAAT ATGGCAAAAC TCATCAGCAG TCATGACCAT ATCAGCGCCT GTTTCCCGCT GGATACGTAT CCGCGCCCGG CAGAAAAAAG CCAGTATGAA GGGAGCCGGT CGTTATGGTC GGCCCTGGAT GATGACATCA TCACCACGGA GCAGGCCCGC GAAATTGCGA TCCGCTGTCA TGAACGGCAG ATTCAGCATC AGCAACGCTG GGTTAACCAC TATCAGAACC GCCTGAACTA TGAGCGTGCC ATGCTGGACG AAAGCGGCGG CGTGGTTACC CGGACACAGG ATTTTGAGCC GGGCGGACAG GTTTTCAGCC GGGGCGAGTG GCTGACCATC ATCCGCGTGA ACAAAAGCAA CGGGGCGGTG AGTTCAGTCA CAACGCCGAA TTACAGTTTT CTCGGGTACA GCGGCACGAT GAAAGTGACG CCCGATCGCA TCACGGACTA CAAAGCACCA TCGGCAGAAG AGGCTGCCGT CGCCAGCCAG GCCGCGAAGC GTCCGCCGGT AGTCAACTAT CCGGGGGAAG GTTTCCGGGA AATGACAAAG GCACAGTGGG CCGCCCTGCC CCGGGACTGT AAGGCCGTGC GCAGTGTGGC AGAAGCAGAA GACCACGGGG CATACCGCTA CCGCCGCACA ATGGACAATA ATTTCCGTCT GGTGAATGTG TATATCACCG ACATGAAAAT TACGGAAATC CCACAGAAAT AA
|
Protein sequence | MTSAAITMTA PEAASPVQMY RATYSPDDNK LRLYAVSRLD PETYKKVHDA GFRWAPKQAL FVAPAWTPGR EDVLLSLAGE IEDEDSTLAE RQEARAERFT GYSGKRASES AQALDEVERL AAMIPPGQPI LVGHHSERRA RRDAQRIENG MKRAVMLFER AEYWEERGRS ALLHAKYKER PDVRWRRIKK IEADLRKAEK TIAQSQKYLT MWRAESLDLN MAKLISSHDH ISACFPLDTY PRPAEKSQYE GSRSLWSALD DDIITTEQAR EIAIRCHERQ IQHQQRWVNH YQNRLNYERA MLDESGGVVT RTQDFEPGGQ VFSRGEWLTI IRVNKSNGAV SSVTTPNYSF LGYSGTMKVT PDRITDYKAP SAEEAAVASQ AAKRPPVVNY PGEGFREMTK AQWAALPRDC KAVRSVAEAE DHGAYRYRRT MDNNFRLVNV YITDMKITEI PQK
|
| |