Gene ECH74115_5730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5730 
SymbolcpdB 
ID6969231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5365285 
End bp5367228 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content50% 
IMG OID643389363 
Productbifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein 
Protein accessionYP_002273756 
Protein GI209396574 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGT TTAGCGCAAC GCTCCTGGCC ACGCTGATTG CCGCCAGTGT GAATGCAGCG 
ACGGTCGATC TGCGTATCAT GGAAACCACT GATCTGCATA GCAACATGAT GGATTTCGAT
TATTACAAAG ACACCGCCAC GGAAAAATTC GGACTGGTAC GTACGGCAAG CCTGATTAAC
AATGCCCGCA ATGAAGTGAA AAACAGCGTA CTGGTCGATA ACGGCGATTT GATTCAGGGG
AGTCCGCTGG CCGATTACAT GTCGGCGAAA GGATTAAAAG CAGGTGATAT TCACCCGGTC
TATAAGGCAT TAAATACGCT GGACTATACC GTCGGAACGC TTGGCAACCA CGAGTTTAAC
TACGGTCTGG ATTACCTGAA AAATGCGCTG GCAGGAGCGA AATTCCCTTA TGTAAATGCC
AACGTCATTG ACGCCAGAAC CAAACAGCCA ATGTTTACAC CGTATTTAAT TAAAGACACC
GAAGTGGTCG ATAAAGACGG AAAAAAACAG ACGCTGAAGA TTGGCTATAT TGGCGTCGTG
CCGCCGCAAA TCATGGGCTG GGATAAAGCT AATTTATCCG GAAAAGTGAC GGTGAATGAT
ATTACCGAAA CCGTGCGCAA ATACGAGCCT GAAATGCGCG AGAAAGGTGC CGATGTCGTT
GTCGTTCTGG CGCATTCCGG GCTGTCTGCC GATCCGTATA AAGTAATGGC GGAAAACTCA
GTTTATTACC TCAGTGAAAT TCCGGGCGTT AACGCCATTA TGTTTGGCCA TGCTCACGCC
GTTTTCCCGG GTAAAGATTT TGCTGATATC GAAGGGGCTG ATATCACCAA AGGCACGCTG
AATGGTGTTC CGGCGGTAAT GCCGGGCATG TGGGGCGATC ATCTTGGGGT GGTCGACTTA
CAACTCAGTA ATGACAGCGG TAAATGGCAG GTGACGCAGG CGAAAGCGGA AGCACGGCCG
ATTTACGACA TCGCCAATAA AAAATCCCTC GCGGCGGAAG ACAGCAAGCT GGTAGAAACA
CTCAAAGCCG ATCACGATGC CACACGCCAG TTCGTCAGCA AGCCAATCGG TAAATCTGCC
GACAATATGT ATAGCTATCT GGCGCTGGTG CAGGACGATC CGACCGTGCA AGTAGTGAAC
AACGCGCAAA AAGCGTATGT CGAGCATTAC ATTCAGGGCG ATCCGGATCT GGCAAAACTG
CCGGTGCTTT CAGCTGCCGC ACCGTTTAAA GTCGGTGGTC GCAAAAATGA CCCGGCAAGC
TATGTGGAGG TGGAAAAAGG CCAGTTGACC TTCCGTAATG CCGCCGATCT TTATCTCTAT
CCCAATACGC TGATTGTGGT GAAAGCCAGC GGTAAAGAGG TGAAAGAGTG GCTGGAGTGC
TCCGCCGGAC AGTTTAACCA GATTGATCCC AACAGCACGA AACCACAGTC ACTCATCAAC
TGGGATGGTT TCCGCACTTA TAACTTTGAT GTTATTGATG GTGTGAATTA TCAGATTGAT
GTTACCCAGC CCGCCCGTTA TGACGGCGAG TGCCAGATGA TTAATGCCAA TGCGGAAAGG
ATTAAGAACC TGACCTTTAA TGGCAAGCCG ATTGATCCGA ACGCTATGTT CCTCGTTGCC
ACCAATAACT ATCGCGCTTA CGGCGGCAAA TTTGCCGGGA CGGGCGACAG CCATATCGCT
TTTGCTTCAC CGGATGAGAA CCGCTCGGTG CTGGCAGCGT GGATTGCTGA TGAGTCGAAA
CGTGCGGGGG AAATTCACCC GGCGGCAGAT AACAACTGGC GTTTAGCACC GATAGCTGCC
GATAAGAAAC TGGATATCCG TTTCGAAACT TCCCCGTCAG ATAAAGCCGC AGCGTTTATT
AAAGAGAAAG GGCAGTATCC GATGAATAAA GTCGCGACCG ATGATATCGG GTTTGCGATT
TATCAGGTGG ATTTGAGTAA GTAA
 
Protein sequence
MIKFSATLLA TLIAASVNAA TVDLRIMETT DLHSNMMDFD YYKDTATEKF GLVRTASLIN 
NARNEVKNSV LVDNGDLIQG SPLADYMSAK GLKAGDIHPV YKALNTLDYT VGTLGNHEFN
YGLDYLKNAL AGAKFPYVNA NVIDARTKQP MFTPYLIKDT EVVDKDGKKQ TLKIGYIGVV
PPQIMGWDKA NLSGKVTVND ITETVRKYEP EMREKGADVV VVLAHSGLSA DPYKVMAENS
VYYLSEIPGV NAIMFGHAHA VFPGKDFADI EGADITKGTL NGVPAVMPGM WGDHLGVVDL
QLSNDSGKWQ VTQAKAEARP IYDIANKKSL AAEDSKLVET LKADHDATRQ FVSKPIGKSA
DNMYSYLALV QDDPTVQVVN NAQKAYVEHY IQGDPDLAKL PVLSAAAPFK VGGRKNDPAS
YVEVEKGQLT FRNAADLYLY PNTLIVVKAS GKEVKEWLEC SAGQFNQIDP NSTKPQSLIN
WDGFRTYNFD VIDGVNYQID VTQPARYDGE CQMINANAER IKNLTFNGKP IDPNAMFLVA
TNNYRAYGGK FAGTGDSHIA FASPDENRSV LAAWIADESK RAGEIHPAAD NNWRLAPIAA
DKKLDIRFET SPSDKAAAFI KEKGQYPMNK VATDDIGFAI YQVDLSK