Gene ECH74115_3177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3177 
Symbol 
ID6967414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2931219 
End bp2933663 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content49% 
IMG OID643386998 
Productexonuclease family protein 
Protein accessionYP_002271465 
Protein GI209395771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000796892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000114787 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACTG ATAAAGAAGA AATTGCACTG TATTACGAAG CCAAAAATGA CAAAGTCAGA 
AAACGCCTTG GGATTAAAGG CGGTTTTTAC TGGCGCACAG CAAAAAAATT ATCGGTTGCA
ATATCACGGG GTGTTGTCGC AATGGACGAT GCTGGATTTG ACGAAGAGGA TTTCAAAAAA
CCTGTTCGCG TGAATTTGCC CATTGTTAAT GACCTGCCGC CTGAAGGTGT GTTCGATACT
GAATTCTGCA ACCGCTATGA AAAAGGCGGG GAAGATGGCA TCACAATGAT ATTTATAGCG
CCTTCCCCCT CAGTTCAGGA CAAACCAGCC AGCTCTGACA ATACCAACGT CAATGGCGAA
GACATGGCTG AGATTGAGGA TAATATGCTC CTGCCGATTT CCGGTCAGGA ACTGCCCATT
CGCTGGCTTG CGCAACATGG CAGCGAAAAA CCGGTAACGC ACGTTTCACG GGAAGAACTT
CAGGCATTAC ATATCGCACG AGCTGAAGAA CTGCCTGCTG TTACTGCCCT GGCTATTTCC
CACAACACAA AGCTGCTCGA CCCGCTGGAG ATTCGCGACC TTCACAAACT GGTACGCGAC
ACAGACAAAG TTTTCCCTAA TCCCGTTAAT TCCAGTCTGG GGTTAATGAC TGCTTTTTTC
GAAGCATACC TGGACGCTGA CTATACCGAT CGAGGTCTGC TGACAAAAGA GTGGATGAAA
GGAAATCGTG TTTTACGCAT CAGCCGCACG CCATCCGGCG CTAATGCTGG CGGAGGAATT
CTTACCGATC GCGGTGAAAG TTTTGTCCAC GATGATGCGT CAGTAGAACG TGACGTTGCC
GCTGGCGTTC TGGCCCGTTC AATGGACATC GATATTTACA ATCCACATCC GGCACACGCC
AAACGCATTG AAGAAATCGT TTCAGAGAAT AAGCCGCCCT TTTCTGTTTT TCGTGACAAA
TTCATCGCCA TGCCTGGTCA CCTGGATTAT TCCCGCGCGA TAGTGGTTGC GTCCGTGAAA
GAAGCACCAA TTGGTATCGA GGCTACTCCC CACCGTGTTA CCGAATATCT GAACAAAGTA
CTGACCGAAA CCGACCATGC CAACCCTGAT CCAGAAATCG TGGATATTGC CTGCGGTCGC
TCCTCTGCTC CAATGCCGCA GCGTGTAACA AAAGAAGGAA AACAGGATGA TGAAGAAAAA
CCGCAGCCAT CTGGCGCAAT GGCAGATGAA CAGGCAACGA CTGAAGCAGT GGAACCGGAT
ACAACTGAAC ATAATCAGGA CACGCAGTCG ATGGATGCTC AGCCACAGAT AAATTCTGTT
GATGCGAAAT ATCAGAAACT GCGGGCAGAA CTCCATGAAG CCCGGAAAAA CATTCCGCCC
CAAAATCCTG TCGATGCAGA CAAATTACTG GCTGCCTCTC GCGGAGAATT TGTTGAAGGG
ATTAGCGACC CGAATGATCC GAAATGGATT AAGGGGATCC AGACCCGCGA TTCTGTGTAC
CAGAATCAGC CAGAAACGGA ACAGAACGAC CAGAAAGCGG AACAGAACAG CCCAAATACG
CAACAAAACG AGCCAGAAAC GAAACAACCT GAACCAGTAG TGCAACAGGA ACCGGAAAAG
ATCTGCACCG CCTGCGGTCA GAGGAGTGGC GGCAACTGCC CTGATTGTGG CGCGGTGATG
GGCGACGCAA CATACCAGGA AACATTCGAT GACAAGAACC TGGTTGAAGT TCAGGAAGAC
GATTCGGAGA AAATGGAAGG CGCTGAACAT CCACACAAGG AGAATGCTGG CAGCGCTCAG
GACCACGCCA GCGATAGTGA AACTGGCGAG ACGGCAGATC CCTTAATTAC GGTGAACGGT
CATCGCATTA TCACATCCAC CAGCAGGACG TGTGACCATC TAATGATCGA CCTTGAAACC
ATGGGAAAAA ATCCTGATGC CCCGATTATC TCAATAGGTG CAATATTTTT CGATCCGCAA
ACCGGAGATA TGGGACCGGA ATTTAGTAAG ACTATCGATC TGGAAACTGC TGGCGGAGTC
ATTGATCGGG ACACCATTAA ATGGTGGCTT AAGCAATCAC GCGAAGCGCA ATCTGCCATT
ATGACCGATG AAATCCCGTT AGATGATGCA CTGTTACAAT TGCGGGAATT TATCGACGAA
AACTCCGGTG AATTTTTTGT TCAGGTCTGG GGAAATGGAG CCAACTTCGA CAACACGATT
TTGCGCCGTT CATACGAACG GCAGGGGATC CCCTGCCCGT GGCGTTACTA CAACGATCGC
GATGTACGCA CAATCGTTGA GCTGGGGAAA GCCATAGACT TCGATGCCAG AACGGCTATT
CCATTCGAAG GTGAGCGCCA CAATGCGCTG GATGACGCTC GTTACCAGGC AAAATACGTT
TCAGCTATCT GGCAAAAACT GATCCCGAAT CAGGCTGATT TTTAA
 
Protein sequence
MSTDKEEIAL YYEAKNDKVR KRLGIKGGFY WRTAKKLSVA ISRGVVAMDD AGFDEEDFKK 
PVRVNLPIVN DLPPEGVFDT EFCNRYEKGG EDGITMIFIA PSPSVQDKPA SSDNTNVNGE
DMAEIEDNML LPISGQELPI RWLAQHGSEK PVTHVSREEL QALHIARAEE LPAVTALAIS
HNTKLLDPLE IRDLHKLVRD TDKVFPNPVN SSLGLMTAFF EAYLDADYTD RGLLTKEWMK
GNRVLRISRT PSGANAGGGI LTDRGESFVH DDASVERDVA AGVLARSMDI DIYNPHPAHA
KRIEEIVSEN KPPFSVFRDK FIAMPGHLDY SRAIVVASVK EAPIGIEATP HRVTEYLNKV
LTETDHANPD PEIVDIACGR SSAPMPQRVT KEGKQDDEEK PQPSGAMADE QATTEAVEPD
TTEHNQDTQS MDAQPQINSV DAKYQKLRAE LHEARKNIPP QNPVDADKLL AASRGEFVEG
ISDPNDPKWI KGIQTRDSVY QNQPETEQND QKAEQNSPNT QQNEPETKQP EPVVQQEPEK
ICTACGQRSG GNCPDCGAVM GDATYQETFD DKNLVEVQED DSEKMEGAEH PHKENAGSAQ
DHASDSETGE TADPLITVNG HRIITSTSRT CDHLMIDLET MGKNPDAPII SIGAIFFDPQ
TGDMGPEFSK TIDLETAGGV IDRDTIKWWL KQSREAQSAI MTDEIPLDDA LLQLREFIDE
NSGEFFVQVW GNGANFDNTI LRRSYERQGI PCPWRYYNDR DVRTIVELGK AIDFDARTAI
PFEGERHNAL DDARYQAKYV SAIWQKLIPN QADF