Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3099 |
Symbol | |
ID | 6967527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2869933 |
End bp | 2872824 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643386929 |
Product | hypothetical protein |
Protein accession | YP_002271397 |
Protein GI | 209398923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGT TAAGACTGGT GATAACCGAT GATACTGCTC TGAGTACAGT GGAAAAATAC GACTTCCCAC CGTTGTATCG TGATTTTCGT AATTTTCGCG CTTATCTGGC AATGTTGTTG GCAAATAATG GGGTGCGCGG TGTAAGCCGG ATACTTCTTG AGTTTACGGA AGACCATTCT GACAATCCCA CTTATCTCTT CGAACGCATC AGCGAGACTG AAAATCTTGT CAAATGGCTA TGGAAAACGA ATCACCCGGA TGCGATTCAA ATTCTGATCC TCGGTGTAAT TGGTAAGAAA AAGCACCTGG AATACTTAAG CAAGGCCAGT CAAAAGCATC CCGCTGCGGC TATTGCGGCT TATGCTACTT TGCTGGCAAT ACATGAAGAT AAAGAGTGGC GTAAAGCGCT TGTCAAACTG ATTACCGCCA CGCCAGAGTT AGTATGCGAC GTCATTCCCT GGGTTAATGC TAAAGCGGCA GGCATCTTAT CTGAATGTCG TCCGCAGTCA GTCGCTGAAG AATGTGAATA TGCCACTGTG GATATGTTGC CGGAGCTGCT TGTTTCGCCG CCCTGGATGA CCAAAGAGAA AAAGAAAAAT ACGCCTGTGT TTGATCTTCC AGTGCTTCCA GTTCCTTCTG TTTCTGATGT CACTCCAGAG ATTACCAAAA AACTGACCCG TACTTACCTT GTCACCCACT TCCAGCAAAT AGCCCAACAG CAGGCGACAA AACAAACATT GTTTACCGAC CTGCCGCCGA TCAAAAAAGC AAGCTGGGAG AAACACTTAA TACCGCTTAC ACCGGAGCAA CAAATACTGT GGCATCTTGG TTTTGAAAAA TGGCGGGAGA GCGGAGAAAA AATATACGAA AAAATTCCGG CGCCACAAAG CGCCGTTGAT GCCCTGCTAC GCTTCGATTT TCCAGCACTC AATGCTGAAT TTGTGCATTA TCATAACAAT GCATACAAAA GCTGGAATCT CATTGCCCTC TGTTACCTGC CTGGTCAACA GGCCATTTCT TTTCTTAACC AGATAGTCAA AGAAGATAAC TACTCTGGTG AGGGAAATAT TCTGGCAATT TTTGGTAGCG CCGCCATACC TGCATTTATG GCCTGCCTGC AACGTGATCC ACGACGTCTG TGCTTCTTCC CCTTTTTCCT CGGTGTCAGT GAACTGGCCT TACCGATGGC GCAACAATTA CAGAAAAAAA TGTCTTATGA AGATGCGCGA AACTGGCTAA CTGATTATCC ACGCCATGCC GCCGCCGGGC TGCTTCCTGT GGCCCTGGGC AAAAAAGGAA AAGATCGTGA TTGTGCCCGC CAGGCGCTTC GTCTGCTGGT GAATTTAAAC CAGCGAGAAA CGATTGAGGA AATCGCGCAA GGATATAATC AACCTGATGT TTTAGCTGCG CTGGCAACAT TGTTCGATAG CGACCCACTG GAAGAATATC CCGCTAAAAT CGCCCCACTG CCCGGCTTTT ATCAGTTCAC CTTGTGGCGC AGACCACGGC TTAAAAGTAA CAACCTGCCT CTGTCAGATG ACGCTATGCG CCACCTCGGC ACTATGCTGA GCTTTCCTCG CGACATTACC GCCTACGCTG GGCTGGATAT CATTAGAGAA ATCTTCACCC GCGAATCACT GGCTGAATTT GGCTGGGATC TGTATCCCGC CTGGACAGAA GCTGGCGCAC CCGCAAAAGA AAACTGGGCA TTTACTTCGC TTGGGATTTT AGGCAATGAC GACACCGCGC GTAAATTAAC CCCGCTTATC CGCGCCTGGC CTGGGGAATC CCAGCATAAA CGGGCGGTGT CCGGGCTGGA TGTATTAGCT GATATTGGCA GCGATGTTGC GCTAATGCTG CTTAATGGCA TCGCGCGAAA AATTAAATTC AAAGCATTAC AGGAACATGC CCGCGAGAAA ATCAACATAG TTGCTGAAAA CCGTGGGCTG ACTATGGCTG AACTGGAAGA CCGCCTGGCT CCAGATTTAG GGCTTGATAG CAGTGGCTCG CTGATACTGG ATTTCGGCCC CCGCAAATTC ACCGTTGGTT TTGACGAAAC CTTAAAACCT GTAGTGTGCG ATGCAAACGG CAAAGTCCTG AAAGATTTAC CTAAGCCAAA CCAGAGCGAT GAAAAAACTC AGGCAACTGA CGCGGTTAAT CTCTTCAAAC AGTTGAAAAA AGATGTACGC GCCATAGCCA GCCAGCAGAT TGATCGTCTG GAACAGGCTA TGTGCCAGCG CAGACGCTGG ACGGCAGAGC AGTTCCGCCT GTTTCTGGTG GAGCATCCGC TGGTACGTCA CTTAACCCGA CGGCTGTTAT GGGGGGTTTA TAACGATGAA AACGCCCTTA TCACCTGTTT TCGCGTGGCA GAAGACAGCA CTTACAGCGA TGCGCAGGAT GAGTTATTCA CGCTGCCAGC AGGAAACATC GGTATTCCGC ATGTGCTGGA AATATCCCCT GAATCCGCCG CTGCATTCAG GCAAATTTAC GCTGATTACG AACTGCTTCC CCCTTTCCAA CAGCTCGAGC GAGGTAGCTA TCACCTTGCT GATAATGAAC GTAATACTCA CGAACTGACA CGCTGGCAAG GACGGCTTTG CCAGGCCGGA CGCATTGTCG GGCTGGAACG CAGAGGCTGG CAACGTCTGG AAGAAAGCGG CAGCGTTTAT GCGATGCGCA AAACAACCCC TCATGGCGAT CTCGAACTTG AAACGGAACC TTTTTCATTA ATTTATGGTG AAACGGGGTA TGGCGACCAG CACCCGGTTG AAAGCGTAAA AATCACCTCG CCAGATGATC GCTACGGTAA ACAATCCTCA CTCACTTTCT CCATGCTCGA CGACATCACC GCCAGCGAGC TGATTAACGA CATTGAATCA CTGTTTGATT AA
|
Protein sequence | MTWLRLVITD DTALSTVEKY DFPPLYRDFR NFRAYLAMLL ANNGVRGVSR ILLEFTEDHS DNPTYLFERI SETENLVKWL WKTNHPDAIQ ILILGVIGKK KHLEYLSKAS QKHPAAAIAA YATLLAIHED KEWRKALVKL ITATPELVCD VIPWVNAKAA GILSECRPQS VAEECEYATV DMLPELLVSP PWMTKEKKKN TPVFDLPVLP VPSVSDVTPE ITKKLTRTYL VTHFQQIAQQ QATKQTLFTD LPPIKKASWE KHLIPLTPEQ QILWHLGFEK WRESGEKIYE KIPAPQSAVD ALLRFDFPAL NAEFVHYHNN AYKSWNLIAL CYLPGQQAIS FLNQIVKEDN YSGEGNILAI FGSAAIPAFM ACLQRDPRRL CFFPFFLGVS ELALPMAQQL QKKMSYEDAR NWLTDYPRHA AAGLLPVALG KKGKDRDCAR QALRLLVNLN QRETIEEIAQ GYNQPDVLAA LATLFDSDPL EEYPAKIAPL PGFYQFTLWR RPRLKSNNLP LSDDAMRHLG TMLSFPRDIT AYAGLDIIRE IFTRESLAEF GWDLYPAWTE AGAPAKENWA FTSLGILGND DTARKLTPLI RAWPGESQHK RAVSGLDVLA DIGSDVALML LNGIARKIKF KALQEHAREK INIVAENRGL TMAELEDRLA PDLGLDSSGS LILDFGPRKF TVGFDETLKP VVCDANGKVL KDLPKPNQSD EKTQATDAVN LFKQLKKDVR AIASQQIDRL EQAMCQRRRW TAEQFRLFLV EHPLVRHLTR RLLWGVYNDE NALITCFRVA EDSTYSDAQD ELFTLPAGNI GIPHVLEISP ESAAAFRQIY ADYELLPPFQ QLERGSYHLA DNERNTHELT RWQGRLCQAG RIVGLERRGW QRLEESGSVY AMRKTTPHGD LELETEPFSL IYGETGYGDQ HPVESVKITS PDDRYGKQSS LTFSMLDDIT ASELINDIES LFD
|
| |