Gene ECH74115_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1822 
Symbol 
ID6968323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1738865 
End bp1741180 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content49% 
IMG OID643385760 
Productexonuclease family protein 
Protein accessionYP_002270250 
Protein GI209400511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0267479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000385953 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACTG ATAAAGAAGA AATTGCACTG TATTACGAAG CCAAAAATGA CAAAGTCAGA 
AAACGCCTTG GGATTAAAGG CGGTTTTTAC TGGCGCACAG CAAAAAAATT ATCGGTTGCA
ATATCACGGG GTGTTGTCGC AATGGACGAT GCTGGATTTG ACGAAGAGGA TTTCAAAAAA
CCTGTTCGCG TGAATTTGCC CATTGTTAAT GACCTGCCGC CTGAAGGTGT GTTCGATACT
GAATTCTGCA ACCGCTATGA AAAAGGCGGG GAAGATGGCA TCACAATGAT ATTTATAGCG
CCTTCCCCCT CAGTTCAGGA CAAACCAGCC AGCTCTGACA ATACCAACGT CAATGGCGAA
GACATGGCTG AGATTGAGGA TAATATGCTC CTGCCGATTT CCGGTCAGGA ACTGCCCATT
CGCTGGCTTG CGCAACATGG CAGCGAAAAA CCGGTAACGC ACGTTTCACG GGAAGAACTT
CAGGCATTAC ATATCGCACG AGCTGAAGAA CTGCCTGCTG TTACTGCCCT GGCTATTTCC
CACAACACAA AGCTGCTCGA CCCGCTGGAG ATTCGCGACC TTCACAAACT GGTACGCGAC
ACAGACAAAG TTTTCCCTAA TCCCGTTAAT TCCAGTCTGG GGTTAATGAC TGCTTTTTTC
GAAGCATACC TGGACGCTGA CTATACCGAT CGAGGTCTGC TGACAAAAGA GTGGATGAAA
GGAAATCGTG TTTTACGCAT CAGCCGCACG CCATCCGGCG CTAATGCTGG CGGAGGAATT
CTTACCGATC GCGGTGAAAG TTTTGTCCAC GATGATGCGT CAGTAGAACG TGACGTTGCC
GCTGGCGTTC TGGCCCGTTC AATGGACATC GATATTTACA ATCCACATCC GGCACACGCC
AAACGCATTG AAGAAATCGT TTCAGAGAAT AAGCCGCCCT TTTCTGTTTT TCGTGACAAA
TTCATCGCCA TGCCTGGTCA CCTGGATTAT TCCCGCGCGA TAGTGGTTGC GTCCGTGAAA
GAAGCACCAA TTGGTATCGA GGCTACTCCC CACCGTGTTA CCGAATATCT GAACAAAGTA
CTGACCGAAA CCGACCATGC CAACCCTGAT CCAGAAATCG TGGATATTGC CTGCGGTCGC
TCCTCTGCTC CAATGCCGCA GCGTGTAACA AAAGAAGGAA AACAGGATGA TGAAGAAAAA
CCGCAGCCAT CTGGCGCAAT GGCAGATGAA CAGGCAACGA CTGAAGCAGT GGAACCGGAT
ACAACTGAAC ATAATCAGGA CACGCAGTCG ATGGATGCTC AGCCACAGAT AAATTCTGTT
GATGCGAAAT ATCAGAAACT GCGGGCAGAA CTCCATGAAG CCCGGAAAAA CATTCCGCCC
CAAAATCCTG TCGATGCAGA CAAATTACTG GCTGCCTCTC GCGGAGAATT TGTTGAAGGG
ATTAGCGACC CGAATGATCC GAAATGGATT AAGGGGATCC AGACCCGCGA TTCTGTGTAC
CAGAATCAGC CAGAAACGGA ACAGAACGAC CAGAAAGCGG AACAGAACAG CCCAAATACG
CAACAAAACG AGCCAGAAAC GAAACAACCT GAACCAGTAG TGCAACAGGA ACCGGAAAAG
ATCTGCACCG CCTGCGGTCA GAGGAGTGGC GGCAACTGCC CTGATTGTGG CGCGGTGATG
GGCGACGCAA CATACCAGGA AACATTCGAT GACAAGAACC TGGTTGAAGT TCAGGAAGAC
GATTCGGAGA AAATGGAAGG CGCTGAACAT CCACACAAGG AGAATGCTGG CAGCGCTCAG
GACCACGCCA GCGATAGTGA AACTGGCGAG ACGGCAGATC CCTTAATTAC GGTGAACGGT
CATCGCATTA TCACATCCAC CAGCAGGACG TGTGACCATC TAATGATCGA CCTTGAAACC
ATGGGAAAAA ATCCTGATGC CCCGATTATC TCAATAGGTG CAATATTTTT CGATCCGCAA
ACCGGAGATA TGGGACCGGA ATTTAGTAAG ACTATCGATC TGGAAACTGC TGGCGGAGTC
ATTGATCGGG ACACCATTAA ATGGTGGCTT AAGCAATCAC GCGAAGCGCA ATCTGCCATT
ATGACCGATG AAATCCCGTT AGATGATGCA CTGTTACAAT TGCGGGAATT TATCGACGAA
AACTCCGGTG AATTTTTTGT TCAGGTCTGG GGAAATGGAG CCAACTTCGA CAACACGATT
TTGCGCCGTT CATACGAACG GCAGGGATCC CCTGCCCGTG GCGTTACTAC AACGATCGCG
ATGTACGCAC AATCGTTGAG CTGGGGAAAG CCATAG
 
Protein sequence
MSTDKEEIAL YYEAKNDKVR KRLGIKGGFY WRTAKKLSVA ISRGVVAMDD AGFDEEDFKK 
PVRVNLPIVN DLPPEGVFDT EFCNRYEKGG EDGITMIFIA PSPSVQDKPA SSDNTNVNGE
DMAEIEDNML LPISGQELPI RWLAQHGSEK PVTHVSREEL QALHIARAEE LPAVTALAIS
HNTKLLDPLE IRDLHKLVRD TDKVFPNPVN SSLGLMTAFF EAYLDADYTD RGLLTKEWMK
GNRVLRISRT PSGANAGGGI LTDRGESFVH DDASVERDVA AGVLARSMDI DIYNPHPAHA
KRIEEIVSEN KPPFSVFRDK FIAMPGHLDY SRAIVVASVK EAPIGIEATP HRVTEYLNKV
LTETDHANPD PEIVDIACGR SSAPMPQRVT KEGKQDDEEK PQPSGAMADE QATTEAVEPD
TTEHNQDTQS MDAQPQINSV DAKYQKLRAE LHEARKNIPP QNPVDADKLL AASRGEFVEG
ISDPNDPKWI KGIQTRDSVY QNQPETEQND QKAEQNSPNT QQNEPETKQP EPVVQQEPEK
ICTACGQRSG GNCPDCGAVM GDATYQETFD DKNLVEVQED DSEKMEGAEH PHKENAGSAQ
DHASDSETGE TADPLITVNG HRIITSTSRT CDHLMIDLET MGKNPDAPII SIGAIFFDPQ
TGDMGPEFSK TIDLETAGGV IDRDTIKWWL KQSREAQSAI MTDEIPLDDA LLQLREFIDE
NSGEFFVQVW GNGANFDNTI LRRSYERQGS PARGVTTTIA MYAQSLSWGK P