Gene ECH74115_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0320 
Symbol 
ID6967729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp323438 
End bp325771 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content52% 
IMG OID643384381 
Productputative nucleoside triphosphatase, D5 family 
Protein accessionYP_002268896 
Protein GI209396448 
COG category[S] Function unknown 
COG ID[COG4643] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.382888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAA TGAAAGTTAG CCAGGCTGAG AAAGCAGCTC GGGGTCACTG GTCAAGAATT 
TTACCTGCGC TGGGCGTAAA TGTACTGAAA AATCGGCACC AGCCCTGCCC GGTCTGTGCC
GGGAAAGACC GCTTTCGATT TGATGACCAG GAAGGGCGGG GAACGTGGTT CTGTAACCAG
TGCGGGGCAG GTGATGGCCT GGCGCTTGTA AGTAAAGTAC TGGATGTAGG CATTAGTGAA
GCGGCAGACA GAATAAACGG CATTATCGGA AACCTGCTGC CAGTATCTCA GGGAATGCTT
GAATCTGGTT CTCCTGAAAA AGAGGACGGG AGAAAAGCTG CAGCAGTGCT GGCTGCCCGT
TTGTTTGATA AGTCCCGCCA GACCACTGGC AATGCCTATC TGACGAGTAA AGGGTTTCCT
GCACTGCCTT GCCGGGAATT AACCGCTATG CATAAAGTCG GTGGTGTGGC ATTTCGCGCG
GGAGATCTTG TCGTTCCATT GTATGCAGAT GGAGAGCTGG TAAATCTGCA GTTAATCAAC
GCTAATGGGG GCAAATGCTT CCTTAAAGGC GGTCAGGTTA AGAATGCCTT TTACCTGGTT
GAAGGTACTG CCAAAGCAGC CAAACGGCTC TGGATAGCGG AAGGATATGC CACCGCACTT
ACTATCAACT ATCTGACTGG CGATGCTGTC ATGGTGGCCT TTTCGTCCGT CAATTTCCTT
TCCCTGGCGA GCATTGCCTG CAGTGAGTAC CCAACGCACC AGATAATTAT TGCTGCTGAC
CGCGATCTCA ACGGTGCGGG GCAAACAAGG GGCGCAGCTG TTACCGGGGC CTGCAATTGC
ACAATGGCGC TCCCGCCTGT GTTTGGTGAC TGGAACGATG CATTCACGCA AAACGGCGAA
GAAGCCACCC GGCATGCAAT TCATGAAGTA ATAAAACCAG CTGTTGCCAG CCCCTTCGAC
ACAATGAGCG AAGCTGAATT TACCGCGCTG AGCGTCAGCG AAAAAGCGCA GAGGGTAGTG
GATCACTATA AAAATTCACT GGCAGTAGAC CCGAACGGGC AGCTCCTTTC ACGCTATGAG
GCGGGGGCCT GGAAAGTTAT CTATTACGCC GATTTTGCCC GTGATGTCGC TGCGCTGTTT
CAGCGCCTCG ACGCACCTTT TTCATCCGCG AAAATTGCGT CTCTCGTGGA AACCCTCAAA
CTGATCGTTC CGCAACAGCA GAATCCGGCG CGGCAACTTA TCGGATTTCG CAACGGTGTG
CTCGATACCC GGACAGGATT GTTCAGCCCG CACGATAAGA AGCACTGGTT ACGTACGCTG
TGCGAGGTGG ATTACACGCA GCCCGTTGAC GGTGAGTCAC TGGAAACCCA TGCCCCGGCA
TTCTGGCGCT GGCTGGATCG TGCCGCAGGT TTTAATCCTG AAAAACGGGA CATTATTCTG
GCTGCATTGT TTATGGTGCT GGCTAACCGT TATGACTGGC AGCTGTTTCT GGAGGTCACT
GGCCCTGGCG GAAGTGGAAA GAGTATTCTT GCTGAAATAG CAACCATGCT GGCGGGTGAA
GATAACGCGA CCTCGGCAAC CATTGAAATG CTTGAGTCGC CAAGAGAACG AGCTGCGTTA
ATAGGTTTTT CACTGATTCG ACTTCCCGAC CAGGAAAAGT GGAGCGGTGA CGGGGCCGGA
CTAAAAGCCA TCACTGGCGG CGATGCGGTA TCCGTTGATC CCAAATATCA GAACGCCTAT
TCAACCCACA TCCCGGCGGT CATCCTGGCT GTGAACAATA ATCCGATGCG CTTCACTGAT
CGTAGTGGTG GAGTTTCACG CCGAAGGGTG ATCCTGCATT TCCCCGACCA GATAGCCCCG
GAGGAACGCG ATACCCAGCT CAAAGAGAAA ATTGCCAGTG AGCTAGCGGT GATTGTTCGC
CAGCTTATGC AGCGTTTCAG CGACCCAATG AGTGCCAGGA CATTGCTTCA GTCACAGCAG
AACTCCGATG AAGCGCTCAC CATCAAACGT GATGCTGATT CAGCTTTTGA TTTTTGCGGC
TACCTTGAGG TCCTACCTGA CACCACGGGC ATGTTTATGG GGAACGCTAA TATTGTTCCA
CGTCAGCCTC GAACTTACCT CTACCATGCC TATCTGGTCT ACATGGAGGC TAACGGCTAT
AAAAATACGC TCAGTCTGAC CATGTTTGGC AAGGGGCTAC CGTTAATGCT CAAGGAATAT
GGGCTGCAGT ATGAGAAACG ACGGACCAAT CAAGGAATGC AGACTAATCT GGCCCTAAGA
GAGGAAAGCA ATGCTGACTG GTTGCCAAAA TGCGATGAGT TTGCAGCGAA ATAA
 
Protein sequence
MSGMKVSQAE KAARGHWSRI LPALGVNVLK NRHQPCPVCA GKDRFRFDDQ EGRGTWFCNQ 
CGAGDGLALV SKVLDVGISE AADRINGIIG NLLPVSQGML ESGSPEKEDG RKAAAVLAAR
LFDKSRQTTG NAYLTSKGFP ALPCRELTAM HKVGGVAFRA GDLVVPLYAD GELVNLQLIN
ANGGKCFLKG GQVKNAFYLV EGTAKAAKRL WIAEGYATAL TINYLTGDAV MVAFSSVNFL
SLASIACSEY PTHQIIIAAD RDLNGAGQTR GAAVTGACNC TMALPPVFGD WNDAFTQNGE
EATRHAIHEV IKPAVASPFD TMSEAEFTAL SVSEKAQRVV DHYKNSLAVD PNGQLLSRYE
AGAWKVIYYA DFARDVAALF QRLDAPFSSA KIASLVETLK LIVPQQQNPA RQLIGFRNGV
LDTRTGLFSP HDKKHWLRTL CEVDYTQPVD GESLETHAPA FWRWLDRAAG FNPEKRDIIL
AALFMVLANR YDWQLFLEVT GPGGSGKSIL AEIATMLAGE DNATSATIEM LESPRERAAL
IGFSLIRLPD QEKWSGDGAG LKAITGGDAV SVDPKYQNAY STHIPAVILA VNNNPMRFTD
RSGGVSRRRV ILHFPDQIAP EERDTQLKEK IASELAVIVR QLMQRFSDPM SARTLLQSQQ
NSDEALTIKR DADSAFDFCG YLEVLPDTTG MFMGNANIVP RQPRTYLYHA YLVYMEANGY
KNTLSLTMFG KGLPLMLKEY GLQYEKRRTN QGMQTNLALR EESNADWLPK CDEFAAK