Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0947 |
Symbol | |
ID | 6971396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 958579 |
End bp | 960591 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643384968 |
Product | hypothetical protein |
Protein accession | YP_002269468 |
Protein GI | 209396431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTGTT CAAAATGCAA TGGTTATGCG ACGATGTTAT TAAATATGGT GCAGGGTTCT GACCCGGTTA ATTTACTGGA ATTACATGGT TTCCTTGAAC ATTTCGCATA TTATGTTTCA TTTGGCAAAT TTAATGCTGG CCACCAGCGC TATAATGCAT TTAAAAAATT CGTCAGTGAG ATTTCGGAGA TCAGCGCAAA TGATATCAAT ATGACGATTA AAACCGGGCA ATCCAGACAT GAAAATGTGA TCTCTATAAA TATGAATGAT GCAATTCCTC GTGATGAGAA AGGGATCACC GTTAGAATAG ATAATATTAA TGGAAAGAAA AACAACTCTA ATTCATCAGA TGTTTTTATT CCTTATGTAA ATACATTTCC CGACCTCAAA AATAAAATAC TCAGGATGAA AATTGAACTG ACTGAAGGTA GTGGTTTTTC TAAAAGTCTT TCTGACAGTC AGATTGAGAT GCATATTCTG AGGACTGTTA ATTCGCTAAA TGTTGGCGAG AAATTGAATG ATGATAATCT TGCAGATCAT TCTATATTCA CTAACGAATT CTCTGTTATT ATTCCGCCAT CCTATTATGA TGCAACTTCT GCAGTAAACG CTAATAATAT TGTGAGAGAA AAACTATTTG AAAGTGATAG TAAAGTTAAA GATATCGTCG ATGATATGTC TAATCATGAC GTGGAGAGCG AAAAAGATAT ATTTGTAATC GGGGGGATGA TAGAAAAACT GAATTCTCTG GCAGACGAAT CTTTCAATGA TAGTACTGAT AACATACAAA CAGTAAAAGA TCTGTTAACG CAATTAACTG ATGGCATGGA GCTCTTCGCG CTGCGGGACA TAGTGGCATT CCCATCAACC ATAATAGCAA AATTAATAAA ATCCCCACTT AATAGCGATC ATGAACTTGT TATGCGCGCA CTGGATACGT ATCTGTGCTA CTTCAGAAAT AAAAATCTGA ATAACAATGC GGAAATAATT AACTTTTTCC ATGCCCTCTT TTTAAAGAGA CCAGAGTTGA TGGTTGCTGA AAATTATAGA TTTATTCAAT TTATTGATTT GCTTTTTGAA AACGGAAATG TTGAAGAGAA AAACTTAGCA TTTGACCTTT ATCACAATTA TCTTTCATTA TCTGAAATAA AACAATTCGT TACTGAAGAA ATTAAATTGA ATTTCAATGA GCAGCAAGGA TTGTTAGATA AAGATAATAA ATGTTATATC CTGTTATCAT CCGATAACTC TGGGCGAGTA ATGCGCTTAT CGCACCAGGC GTTAATCTCA ATGCTTGAGC CAGAGGTTAA GAAAAAGACT ATCTGGAATA ATTACTCTAT TTATCCATCT TTGCAGGATA CCCATGAAGT TGTCAGAGAT GATCCAGAAA CGATTTGTAT GCGTGCATTT CCTTTATTTG CAAAAGGATG GGAATATGCG CAGAAAAATA AGAAACATCA ATTGATCCTC AATGCTCTTG GATTCAAAGG TTATATTCGT GATATATTTA TGTCAGCAAT AATGCGAAAA ACAGATTTTG TGCTCGAATG CAATAATCAA CCGACAGAAC TTAATTCATC ATTTTCCTCT TTGATGAACG ATTCAGATCA ATGGCAACAG CATACTCTGA AAGATAAACA TTACGCTAAT CTATTAACGA TGCTGGATCT CAATGATGCC AGTGAAAGTG ATAAGTCTAA AATATTTTTC TGCCTGTCTG CCGTATTTGC AAATATTTCT CACAGTAATG TCTTTAACGG TATTCCTGAT GCGTCAAAAA CTTTAAAACG TTATGCATTT GCTTTATTGG CGAAAGCTCA TTCTTTAGAT GAAAGTATGA TAAGCAATCA AACTTTTAAT ACATACAAAA CGGTATTGCT TGATTTCAAT AATCTGTCTA ATGAAGAGGC CAACCAGTTG CGCATAAGTT CGTTATATCG TGATATGGTC CGCTATGCAC AATATCGTTT TTCTAAAGTT TTATCCGAGT GGACGCCAGA CGCATGGCTG TAA
|
Protein sequence | MDCSKCNGYA TMLLNMVQGS DPVNLLELHG FLEHFAYYVS FGKFNAGHQR YNAFKKFVSE ISEISANDIN MTIKTGQSRH ENVISINMND AIPRDEKGIT VRIDNINGKK NNSNSSDVFI PYVNTFPDLK NKILRMKIEL TEGSGFSKSL SDSQIEMHIL RTVNSLNVGE KLNDDNLADH SIFTNEFSVI IPPSYYDATS AVNANNIVRE KLFESDSKVK DIVDDMSNHD VESEKDIFVI GGMIEKLNSL ADESFNDSTD NIQTVKDLLT QLTDGMELFA LRDIVAFPST IIAKLIKSPL NSDHELVMRA LDTYLCYFRN KNLNNNAEII NFFHALFLKR PELMVAENYR FIQFIDLLFE NGNVEEKNLA FDLYHNYLSL SEIKQFVTEE IKLNFNEQQG LLDKDNKCYI LLSSDNSGRV MRLSHQALIS MLEPEVKKKT IWNNYSIYPS LQDTHEVVRD DPETICMRAF PLFAKGWEYA QKNKKHQLIL NALGFKGYIR DIFMSAIMRK TDFVLECNNQ PTELNSSFSS LMNDSDQWQQ HTLKDKHYAN LLTMLDLNDA SESDKSKIFF CLSAVFANIS HSNVFNGIPD ASKTLKRYAF ALLAKAHSLD ESMISNQTFN TYKTVLLDFN NLSNEEANQL RISSLYRDMV RYAQYRFSKV LSEWTPDAWL
|
| |