Gene EcolC_3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3793 
SymbolcpdB 
ID6067232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4152158 
End bp4154101 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content49% 
IMG OID641603206 
Productbifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein 
Protein accessionYP_001726725 
Protein GI170021771 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.377475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGT TTAGCGCAAC GCTCCTGGCC ACGCTGATTG CCGCCAGTGT GAATGCAGCG 
ACGGTCGATC TACGTATCAT GGAAACCACT GATCTGCATA GCAACATGAT GGATTTCGAT
TATTACAAAG ACACCGCCAC GGAAAAATTC GGACTGGTAC GTACGGCAAG CCTGATTAAC
GATGCCCGCA ATGAAGTGAA AAACAGCGTA CTGGTCGATA ACGGCGATTT GATTCAGGGG
AGTCCGCTGG CCGATTACAT ATCGGCGAAA GGATTAAAAG CAGGTGATGT TCATCCGGTT
TATAAGGCGC TGAATACGCT GGATTATACG GTCGGTACAC TCGGCAATCA TGAATTTAAC
TACGGTCTGG ATTACCTGAA AAATGCGTTG GCGGGAGCGA AATTCCCTTA TGTAAATGCC
AACGTCATTG ACGCCAGAAC CAAACAGCCA ATGTTTACAC CGTATTTAAT TAAAGATACC
GAAGTGGTCG ATAAAGACGG AAAAAAACAG ACGCTGAAGA TTGGCTATAT TGGCGTCGTG
CCGCCGCAAA TCATGGGCTG GGATAAAGCT AATTTATCCG GAAAAGTGAC GGTGAATGAT
ATTACCGAAA CCGTGCGCAA ATACGTGCCT GAAATGCGCG AGAAAGGTGC CGATGTCGTT
GTCGTTCTGG CGCATTCCGG GCTATCTGCC GATCCGTATA AAGTGATGGC GGAAAACTCA
GTTTATTACC TCAGTGAAAT TCCTGGCGTT AACGCCATTA TGTTTGGCCA TGCTCACGCC
GTTTTCCCAG GTAAAGATTT TGCTGATATC GAAGGGGCTG ATATCGCGAA AGGCACGCTG
AATGGTGTTC CGGCGGTAAT GCCGGGCATG TGGGGCGATC ATCTTGGGGT GGTCGACTTA
CAACTCAGTA ATAACAGCGG TAAATGGCAG GTGACGCAGG CGAAAGCGGA AGCACGGCCG
ATTTACGACA TCGCCAATAA AAAATCCCTC GCGGCGGAAG ACAGCAAGCT GGTAGAAACA
CTCAAAGCCG ATCACGATGC CACACGCCAG TTCGTCAGCA AGCCAATCGG TAAATCCGCC
GACAATATGT ATAGCTATCT GGCGCTGGTG CAGGACGATC CGACCGTGCA GGTGGTGAAC
AACGCGCAAA AAGCGTATGT CGAGCATTAC ATTCAGGGCG ATCCGGATCT GGCAAAACTG
CCGGTGCTTT CAGCTGCTGC ACCGTTTAAA GTTGGTGGTC GCAAAAATGA TCCGGCAAGC
TATGTGGAGG TGGAAAAAGG CCAGTTGACC TTCCGTAATG CCGCCGATCT TTATCTCTAC
CCCAATACGC TGATTGTGGT GAAAGCCAGC GGTAAAGAGG TGAAAGAGTG GCTGGAGTGC
TCCGCCGGAC AGTTTAACCA GATTGATCCT GACAACACGA AACCACAGTC ACTCATCAAC
TGGGATGGTT TCCGCACCTA TAACTTTGAT GTGATTGATG GTGTGAATTA TCAGATTGAT
GTTACCCAGC CTGCCCGTTA TGACGGCGAG TGCCAGATGG TTAATGCCAA TGCGGAAAGG
ATTAAGAACC TGACCTTTAA TGGCAAGCCG ATTGATCCGA ACGCCATGTT CCTGGTTGCC
ACCAATAACT ATCGCGCTTA CGGCGGCAAA TTTGCCGGTA CGGGCGACAG CCATATCGCT
TTTGCTTCAC CGGATGAGAA CCGCTCGGTG CTGGCAGCGT GGATTGCTGA TGAGTCGAAA
CGTGCGGGGG AAATTCACCC GGCGGCAGAT AACAACTGGC GTTTAGCACC GATAGCTGGC
GATAAGAAAC TGGATATCCG TTTCGAAACC TCTCCATCAG ATAAAGCCGC GGCGTTTATT
AAAGAGAAAG GGCAATATCC GATGAATAAA GTCGCGACCG ATGATATCGG GTTTGCGATT
TATCAGGTGG ATTTGAGTAA GTAA
 
Protein sequence
MIKFSATLLA TLIAASVNAA TVDLRIMETT DLHSNMMDFD YYKDTATEKF GLVRTASLIN 
DARNEVKNSV LVDNGDLIQG SPLADYISAK GLKAGDVHPV YKALNTLDYT VGTLGNHEFN
YGLDYLKNAL AGAKFPYVNA NVIDARTKQP MFTPYLIKDT EVVDKDGKKQ TLKIGYIGVV
PPQIMGWDKA NLSGKVTVND ITETVRKYVP EMREKGADVV VVLAHSGLSA DPYKVMAENS
VYYLSEIPGV NAIMFGHAHA VFPGKDFADI EGADIAKGTL NGVPAVMPGM WGDHLGVVDL
QLSNNSGKWQ VTQAKAEARP IYDIANKKSL AAEDSKLVET LKADHDATRQ FVSKPIGKSA
DNMYSYLALV QDDPTVQVVN NAQKAYVEHY IQGDPDLAKL PVLSAAAPFK VGGRKNDPAS
YVEVEKGQLT FRNAADLYLY PNTLIVVKAS GKEVKEWLEC SAGQFNQIDP DNTKPQSLIN
WDGFRTYNFD VIDGVNYQID VTQPARYDGE CQMVNANAER IKNLTFNGKP IDPNAMFLVA
TNNYRAYGGK FAGTGDSHIA FASPDENRSV LAAWIADESK RAGEIHPAAD NNWRLAPIAG
DKKLDIRFET SPSDKAAAFI KEKGQYPMNK VATDDIGFAI YQVDLSK