Gene ECH74115_3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3696 
Symbol 
ID6970269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3414056 
End bp3416071 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content55% 
IMG OID643387490 
Producthypothetical protein 
Protein accessionYP_002271943 
Protein GI209397566 
COG category[R] General function prediction only 
COG ID[COG1444] Predicted P-loop ATPase fused to an acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC TGACTGCGCT TCACACATTA ACAGCGCAAA TGAAACGTGA AGGGATCCGC 
CGCTTGCTGG TGTTGAGCGG GGAAGAGGGT TGGTGTTTTG ATCATGCGCT TAAGTTACGT
GATGCCTTAC CTGGCGACTG GCTGTGGATT TCGCCGCAGC CAGATGCTGA AAACCACTGT
TCTCCCTCGG CACTACAAAC TTTACTTGGG CGCGAGTTCC GGCATGCGGT ATTCGACGCC
CGCCACGGCT TTGATGCCGC TGCCTTTGCC GCACTTAGCG GAACGTTGAA AGCGGGAAGC
TGGCTGGTTT TGTTACTCCC TGTATGGGAA GAGTGGGAAA ACCAACCTGA TGCCGACTCG
CTGCGCTGGA GTGATTGCCC TGACCCTATT GCGACGCCGC ATTTTGTCCA GCATTTCAAA
CGCGTACTTA CGGCGGATAA CGACGCTATC TTCTGGCGGC AAAACCAGCC GTTCTCGTTG
GCGCATTTTA CTCCCCGTAC TGACTGGCAC CCCGCGACCG GCGCACCACA ACCAGAACAA
CAGCAAATCT TACAGCAGCT ACTGACCATG CCGTCGGGCG TGGCAGCGGT AACTGCTGCG
CGTGGGCGCG GTAAATCGGC GCTGGCAGGG CAACTCATTT CTCGTATTGC GGGTAGTGCG
ATTATCACTG CGCCCGCAAA AGCGGCAACG GATGTACTGG CACAATTTGC GGGCGAGAAG
TTTCGCTTTA TTGCGCCTGA TGCCTTGTTA GCCAGCGATG AGCAAGCCGA CTGGCTGGTG
GTCGATGAAG CCGCAGCCAT ACCTGCGCCG TTGTTGCATC AACTGGTATC GCGTTTTCCT
CGAACGTTGT TAACCACTAC GGTGCAGGGC TACGAAGGTA CCGGACGTGG TTTTTTGCTG
AAATTTTGCG CTCGCTTTCC GCATTTACAC CGTTTTGAAC TGCAACAGCC GATCCGCTGG
GCACAGGGAT GCCCGCTGGA AAAAATGGTT AGTGAGGCAC TGGTTTTTAA CGATGAAAAC
TTCACCCATA CACCACAAGG CAATATCGTC ATTTCCGCAT TTGAACAAAC GTTATGGCGA
AGCGAGCCAG AAACGCCGTT AAAGGTTTAT CAGTTATTGT CTGGTGCGCA CTACCGGACT
TCGCCGCTGG ATTTACGCCG CATGATGGAT GCACCAGGGC AACATTTTTT ACAGGCGGCT
GGCGAAAACG AGATTGCCGG AGCGCTGTGG CTGGTGGATG AGGGGGGATT ATCTCAAGAA
CTCAGTCAGG CGGTATGGGC AGGTTTTCGT CGCCCGCGGG GTAATCTGGT GGCCCAGTCG
CTGGCGGCGC ACGGCAGCAA TCCACTGGCG GCGACATTGC GTGGACGGCG GGTCAGCCGG
ATAGCAGTTC ATCCGGCGCG TCAGCGCGAA GGCGTTGGGC AACAGCTCAT TGCCAGCGCT
TTGCAATATA GGCCTGGCCT CGACTATCTT TCGGTGAGTT TTGGTTACAC CGGGGAGTTA
TGGCGTTTCT GGCAACGCTG CGGTTTTGTG CTGGTGCGAA TGGGTAATCA TCGTGAAGCC
AGCAGCGGTT GCTATACGGC GATGGCGCTG TTACCGATGA GTGATGCGGG TAAACAGCTG
GCTGAACGTG AGCATTACCG TTTACGTCGC GATGCGCAAG CTCTCGCGCA GTGGAATGGC
GAAACACTCC CTGTTGATCC ACTAAACAAT GCCGTCCTTT CTGACGACGA CTGGCTTGAA
CTGGCCGGTT TTGCTTTCGC TCATCGTCCG CTATTAACAT CGTTAGGTTG CTTATTGCGT
CTGCTACAAA CCAGTGAACT GGCATTACCG GCGCTGCGTG GGCGTTTACA GAAAAACGTC
AGCGACGCGC AGTTATGTAC CACACTTAAA CTTTCAGGCC GCAAGATGTT ACTGGTCCGT
CAGCGGGAAG AGGCCGCACA GGCGCTGTTC GCACTTAATG ATGTTCGCAC TGAGCGTCTG
CGCGATCGCA TAACGCAATG GCAATTTTTT CACTGA
 
Protein sequence
MAELTALHTL TAQMKREGIR RLLVLSGEEG WCFDHALKLR DALPGDWLWI SPQPDAENHC 
SPSALQTLLG REFRHAVFDA RHGFDAAAFA ALSGTLKAGS WLVLLLPVWE EWENQPDADS
LRWSDCPDPI ATPHFVQHFK RVLTADNDAI FWRQNQPFSL AHFTPRTDWH PATGAPQPEQ
QQILQQLLTM PSGVAAVTAA RGRGKSALAG QLISRIAGSA IITAPAKAAT DVLAQFAGEK
FRFIAPDALL ASDEQADWLV VDEAAAIPAP LLHQLVSRFP RTLLTTTVQG YEGTGRGFLL
KFCARFPHLH RFELQQPIRW AQGCPLEKMV SEALVFNDEN FTHTPQGNIV ISAFEQTLWR
SEPETPLKVY QLLSGAHYRT SPLDLRRMMD APGQHFLQAA GENEIAGALW LVDEGGLSQE
LSQAVWAGFR RPRGNLVAQS LAAHGSNPLA ATLRGRRVSR IAVHPARQRE GVGQQLIASA
LQYRPGLDYL SVSFGYTGEL WRFWQRCGFV LVRMGNHREA SSGCYTAMAL LPMSDAGKQL
AEREHYRLRR DAQALAQWNG ETLPVDPLNN AVLSDDDWLE LAGFAFAHRP LLTSLGCLLR
LLQTSELALP ALRGRLQKNV SDAQLCTTLK LSGRKMLLVR QREEAAQALF ALNDVRTERL
RDRITQWQFF H