Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0320 |
Symbol | |
ID | 6967729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 323438 |
End bp | 325771 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384381 |
Product | putative nucleoside triphosphatase, D5 family |
Protein accession | YP_002268896 |
Protein GI | 209396448 |
COG category | [S] Function unknown |
COG ID | [COG4643] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01613] phage/plasmid primase, P4 family, C-terminal domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.382888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGAA TGAAAGTTAG CCAGGCTGAG AAAGCAGCTC GGGGTCACTG GTCAAGAATT TTACCTGCGC TGGGCGTAAA TGTACTGAAA AATCGGCACC AGCCCTGCCC GGTCTGTGCC GGGAAAGACC GCTTTCGATT TGATGACCAG GAAGGGCGGG GAACGTGGTT CTGTAACCAG TGCGGGGCAG GTGATGGCCT GGCGCTTGTA AGTAAAGTAC TGGATGTAGG CATTAGTGAA GCGGCAGACA GAATAAACGG CATTATCGGA AACCTGCTGC CAGTATCTCA GGGAATGCTT GAATCTGGTT CTCCTGAAAA AGAGGACGGG AGAAAAGCTG CAGCAGTGCT GGCTGCCCGT TTGTTTGATA AGTCCCGCCA GACCACTGGC AATGCCTATC TGACGAGTAA AGGGTTTCCT GCACTGCCTT GCCGGGAATT AACCGCTATG CATAAAGTCG GTGGTGTGGC ATTTCGCGCG GGAGATCTTG TCGTTCCATT GTATGCAGAT GGAGAGCTGG TAAATCTGCA GTTAATCAAC GCTAATGGGG GCAAATGCTT CCTTAAAGGC GGTCAGGTTA AGAATGCCTT TTACCTGGTT GAAGGTACTG CCAAAGCAGC CAAACGGCTC TGGATAGCGG AAGGATATGC CACCGCACTT ACTATCAACT ATCTGACTGG CGATGCTGTC ATGGTGGCCT TTTCGTCCGT CAATTTCCTT TCCCTGGCGA GCATTGCCTG CAGTGAGTAC CCAACGCACC AGATAATTAT TGCTGCTGAC CGCGATCTCA ACGGTGCGGG GCAAACAAGG GGCGCAGCTG TTACCGGGGC CTGCAATTGC ACAATGGCGC TCCCGCCTGT GTTTGGTGAC TGGAACGATG CATTCACGCA AAACGGCGAA GAAGCCACCC GGCATGCAAT TCATGAAGTA ATAAAACCAG CTGTTGCCAG CCCCTTCGAC ACAATGAGCG AAGCTGAATT TACCGCGCTG AGCGTCAGCG AAAAAGCGCA GAGGGTAGTG GATCACTATA AAAATTCACT GGCAGTAGAC CCGAACGGGC AGCTCCTTTC ACGCTATGAG GCGGGGGCCT GGAAAGTTAT CTATTACGCC GATTTTGCCC GTGATGTCGC TGCGCTGTTT CAGCGCCTCG ACGCACCTTT TTCATCCGCG AAAATTGCGT CTCTCGTGGA AACCCTCAAA CTGATCGTTC CGCAACAGCA GAATCCGGCG CGGCAACTTA TCGGATTTCG CAACGGTGTG CTCGATACCC GGACAGGATT GTTCAGCCCG CACGATAAGA AGCACTGGTT ACGTACGCTG TGCGAGGTGG ATTACACGCA GCCCGTTGAC GGTGAGTCAC TGGAAACCCA TGCCCCGGCA TTCTGGCGCT GGCTGGATCG TGCCGCAGGT TTTAATCCTG AAAAACGGGA CATTATTCTG GCTGCATTGT TTATGGTGCT GGCTAACCGT TATGACTGGC AGCTGTTTCT GGAGGTCACT GGCCCTGGCG GAAGTGGAAA GAGTATTCTT GCTGAAATAG CAACCATGCT GGCGGGTGAA GATAACGCGA CCTCGGCAAC CATTGAAATG CTTGAGTCGC CAAGAGAACG AGCTGCGTTA ATAGGTTTTT CACTGATTCG ACTTCCCGAC CAGGAAAAGT GGAGCGGTGA CGGGGCCGGA CTAAAAGCCA TCACTGGCGG CGATGCGGTA TCCGTTGATC CCAAATATCA GAACGCCTAT TCAACCCACA TCCCGGCGGT CATCCTGGCT GTGAACAATA ATCCGATGCG CTTCACTGAT CGTAGTGGTG GAGTTTCACG CCGAAGGGTG ATCCTGCATT TCCCCGACCA GATAGCCCCG GAGGAACGCG ATACCCAGCT CAAAGAGAAA ATTGCCAGTG AGCTAGCGGT GATTGTTCGC CAGCTTATGC AGCGTTTCAG CGACCCAATG AGTGCCAGGA CATTGCTTCA GTCACAGCAG AACTCCGATG AAGCGCTCAC CATCAAACGT GATGCTGATT CAGCTTTTGA TTTTTGCGGC TACCTTGAGG TCCTACCTGA CACCACGGGC ATGTTTATGG GGAACGCTAA TATTGTTCCA CGTCAGCCTC GAACTTACCT CTACCATGCC TATCTGGTCT ACATGGAGGC TAACGGCTAT AAAAATACGC TCAGTCTGAC CATGTTTGGC AAGGGGCTAC CGTTAATGCT CAAGGAATAT GGGCTGCAGT ATGAGAAACG ACGGACCAAT CAAGGAATGC AGACTAATCT GGCCCTAAGA GAGGAAAGCA ATGCTGACTG GTTGCCAAAA TGCGATGAGT TTGCAGCGAA ATAA
|
Protein sequence | MSGMKVSQAE KAARGHWSRI LPALGVNVLK NRHQPCPVCA GKDRFRFDDQ EGRGTWFCNQ CGAGDGLALV SKVLDVGISE AADRINGIIG NLLPVSQGML ESGSPEKEDG RKAAAVLAAR LFDKSRQTTG NAYLTSKGFP ALPCRELTAM HKVGGVAFRA GDLVVPLYAD GELVNLQLIN ANGGKCFLKG GQVKNAFYLV EGTAKAAKRL WIAEGYATAL TINYLTGDAV MVAFSSVNFL SLASIACSEY PTHQIIIAAD RDLNGAGQTR GAAVTGACNC TMALPPVFGD WNDAFTQNGE EATRHAIHEV IKPAVASPFD TMSEAEFTAL SVSEKAQRVV DHYKNSLAVD PNGQLLSRYE AGAWKVIYYA DFARDVAALF QRLDAPFSSA KIASLVETLK LIVPQQQNPA RQLIGFRNGV LDTRTGLFSP HDKKHWLRTL CEVDYTQPVD GESLETHAPA FWRWLDRAAG FNPEKRDIIL AALFMVLANR YDWQLFLEVT GPGGSGKSIL AEIATMLAGE DNATSATIEM LESPRERAAL IGFSLIRLPD QEKWSGDGAG LKAITGGDAV SVDPKYQNAY STHIPAVILA VNNNPMRFTD RSGGVSRRRV ILHFPDQIAP EERDTQLKEK IASELAVIVR QLMQRFSDPM SARTLLQSQQ NSDEALTIKR DADSAFDFCG YLEVLPDTTG MFMGNANIVP RQPRTYLYHA YLVYMEANGY KNTLSLTMFG KGLPLMLKEY GLQYEKRRTN QGMQTNLALR EESNADWLPK CDEFAAK
|
| |