Gene EcHS_A4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4467 
SymbolcpdB 
ID5594053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4469775 
End bp4471718 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content49% 
IMG OID640923565 
Productbifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein 
Protein accessionYP_001461006 
Protein GI157163688 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGT TTAGCGCAAC GCTCCTGGCC ACGCTGATTG CCGCCAGTGT GAATGCAGCG 
ACGGTCGATC TGCGTATCAT GGAAACCACT GATCTGCATA GCAACATGAT GGATTTCGAT
TATTACAAAG ACACCGCCAC GGAAAAGTTC GGACTGGTAC GCACCGCAAG CCTGATTAAC
GATGCCCGCA ATGAAGTCAA AAATAGCGTG CTGGTGGATA ACGGCGACTT GATCCAGGGA
AGTCCGCTGG CCGATTACAT GTCGGCGAAA GGATTAAAAG CCGGAGATAT TCACCCGGTC
TATAAGGCAT TAAATACGCT GGACTATACC GTCGGCACGC TTGGCAACCA CGAGTTTAAC
TACGGTCTGG ATTACCTGAA AAATGCGTTG GCAGGAGCGA AATTCCCTTA TGTAAATGCC
AACGTCATTG ACGCAAAAAC CAAACAGCCC ATGTTTACAC CGTATTTAAT TAAAGACACC
GAAGTGGTCG ATAAAGACGG AAAAAAACAG ACGTTAAAGA TTGGCTATAT TGGCGTCGTG
CCGCCGCAGA TTATGGGGTG GGATAAAGCC AATTTGTCCG GCAAAGTCAC CGTTAACGAT
ATAACAGAAA CTGTGCGCAA ATACGTGCCT GAAATGCGTG AGAAAGGTGC CGATCTCGTT
GTCGTTCTGG CGCATTCCGG GCTGTCTGCC GATCCGTATA AAGTGATGGC GGAAAACTCA
GTTTATTATC TCAGTGAAAT TCCGGGTGTT GACGCCATTA TGTTTGGCCA TGCTCACGCC
GTTTTCCCAA GTAAAGATTT TGCTGATATC GAAGGAGCTG ATATCGCGAA AGGCACGCTG
AATGGTGTTC CGGCGGTAAT GCCAGGCATG TGGGGCGATC ATCTTGGTGT GGTCGACTTA
CAACTCAGTA ATGACAGCGG TAAATGGCAG GTGACGCAGG CGAAAGCGGA AGCACGACCG
ATTTACGACA TCGCTAATAA AAAATCCCTC GCGGCGGAAG ACAGCAAGCT GGTAGAAACA
CTCAAAGCCG ATCACGATGC CACACGCCAG TTCGTCAGCA AGCCAATCGG TAAATCTGCC
GACAATATGT ATAGCTATCT GGCACTGGTG CAGGACGATC CGACCGTGCA GGTGGTGAAC
AACGCGCAAA AAGCGTATGT CGAGCATTAC ATTCAGAGCG ATCCGGATCT GGCAAAACTG
CCAGTGCTTT CAGCTGCCGC ACCGTTTAAA GTTGGTGGTC GCAAAAATGA CCCGGCAAGC
TATGTGGAGG TGGAAAAAGG CCAGTTGACC TTCCGTAATG CCGCCGATCT TTATCTCTAC
CCCAATACGC TGATTGTGGT GAAAGCCAGC GGTAAAGAGG TGAAAGAGTG GCTGGAATGC
TCTGCCGGAC AGTTTAACCA GATTGATCCT AACAGCACGA AACCACAGTC ACTCATTAAC
TGGGATGGTT TCCGCACTTA TAACTTTGAT GTGATTGATG GTGTGAATTA TCAGATTGAT
GTTACCCAAC CCGCCCGTTA TGACGGCGAG TGCCAGATGA TTAATGCCAA TGCGGAAAGG
ATTAAGAACC TGACCTTTAA CGGCAAGCCG ATTGATCCGA ACGCCATGTT CCTCGTTGCC
ACCAATAACT ATCGCGCTTA CGGCGGCAAA TTTGCCGGTA CGGGCGACAG CCATATCGCT
TTTGCTTCAC CGGATGAGAA CCGCTCGGTG CTGGCAGCGT GGATTGCTGA TGAGTCGAAA
CGTGCGGGGG AAATTCACCC GGCGGCAGAT AACAACTGGC GTTTAGCACC GATAGCCGGC
GATAAGAAAC TGGATATCCG TTTCGAAACT TCCCCGTCAG ATAAAGCCGC AGCGTTTATT
AAAGAGAAAG GGCAATATCC GATGAATAAA GTCGCGACCG ATGATATCGG GTTTGCGATT
TATCAGGTGG ATTTGAGTAA GTAA
 
Protein sequence
MIKFSATLLA TLIAASVNAA TVDLRIMETT DLHSNMMDFD YYKDTATEKF GLVRTASLIN 
DARNEVKNSV LVDNGDLIQG SPLADYMSAK GLKAGDIHPV YKALNTLDYT VGTLGNHEFN
YGLDYLKNAL AGAKFPYVNA NVIDAKTKQP MFTPYLIKDT EVVDKDGKKQ TLKIGYIGVV
PPQIMGWDKA NLSGKVTVND ITETVRKYVP EMREKGADLV VVLAHSGLSA DPYKVMAENS
VYYLSEIPGV DAIMFGHAHA VFPSKDFADI EGADIAKGTL NGVPAVMPGM WGDHLGVVDL
QLSNDSGKWQ VTQAKAEARP IYDIANKKSL AAEDSKLVET LKADHDATRQ FVSKPIGKSA
DNMYSYLALV QDDPTVQVVN NAQKAYVEHY IQSDPDLAKL PVLSAAAPFK VGGRKNDPAS
YVEVEKGQLT FRNAADLYLY PNTLIVVKAS GKEVKEWLEC SAGQFNQIDP NSTKPQSLIN
WDGFRTYNFD VIDGVNYQID VTQPARYDGE CQMINANAER IKNLTFNGKP IDPNAMFLVA
TNNYRAYGGK FAGTGDSHIA FASPDENRSV LAAWIADESK RAGEIHPAAD NNWRLAPIAG
DKKLDIRFET SPSDKAAAFI KEKGQYPMNK VATDDIGFAI YQVDLSK