Gene EcSMS35_4692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4692 
SymbolcpdB 
ID6145344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4790565 
End bp4792508 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content49% 
IMG OID641619508 
Productbifunctional 2',3'-cyclic nucleotide 2'-phosphodiesterase/3'-nucleotidase periplasmic precursor protein 
Protein accessionYP_001746616 
Protein GI170683745 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGT TTAGCGCAAC GCTCCTGGCC ACGCTGATTG CCGCCAGTGT GAATGCAGCG 
ACGGTCGATC TGCGTATCAT GGAAACCACT GATCTGCATA GCAACATGAT GGATTTCGAT
TATTACAAAG ACACCGCCAC GGAAAAATTC GGACTGGTAC GTACGGCAAG CCTGATTAAT
GATGCCCGCA ATGAAGTGAA AAACAGCGTA CTGGTCGATA ACGGCGATTT GATTCAGGGG
AGTCCGCTGG CAGATTACAT GTCGGCGAAA GGATTAAAAG CAGGTGATAT TCACCCGGTC
TATAAGGCAT TAAATACGCT GGACTATACC GTCGGCACGC TTGGCAACCA TGAGTTTAAC
TACGGTCTGG ATTACCTGAA AAATGCGCTG GCGGGAGCGA AATTCCCTTA TGTAAATGCC
AACGTCATTG ACGCCAGAAC CAAACAGCCA ATGTTTACAC CGTATTTAAT TAAAGACACC
GAAGTGGTCG ATAAAGACGG AAAAAAACAG ACGCTGAAGA TTGGCTATAT TGGCGTCGTA
CCGCCGCAAA TCATGGGCTG GGATAAAGCT AATTTATCCG GGAAAGTGAC GGTGAATGAT
ATTACCGAAA CCGTGCGCAA ATACGTGCCT GAAATGCGCG AGAAAGGTGC CGATGTCGTT
GTCGTTCTGG CGCATTCCGG GCTGTCTGCC GATCCGTATA AAGTGATGGC GGAAAACTCA
GTTTATTACC TCAGTGAAAT TCCGGGCGTT AACGCCATTA TGTTTGGCCA TGCTCACGCC
GTTTTCCCGG GTAAAGATTT TGCTGATATC GAAGGAGCTG ATATCGCGAA AGGCACGCTG
AATGGTGTTC CGGCGGTAAT GCCAGGCATG TGGGGCGATC ATCTTGGTGT GGTCGACTTA
CAACTCAGTA ATGACAGCGG TAAATGGCAG GTAACACAGG CGAAAGCGGA AGCACGACCG
ATTTACGACA TCGCTAATAA AAAATCCCTC GCGGCGGAAG ACAGCAAGCT GGTAGAAACA
CTCAAAGCCG ATCACGATGC CACACGCCAG TTCGTCAGCA AGCCAATCGG TAAATCTGCC
GACAATATGT ATAGCTATCT GGCACTGGTG CAGGACGATC CGACCGTGCA GGTGGTGAAC
AACGCGCAAA AAGCGTATGT CGAGCATTAC ATTCAGGGCG ATCCGGATCT GGCAAAACTG
CCAGTGCTTT CAGCTGCCGC ACCGTTTAAA GTTGGTGGTC GCAAAAATGA CCCGGCAAGC
TATGTGGAGG TGGAAAAAGG TCAGCTGACT TTCCGTAATG CCGCCGATCT TTATCTCTAC
CCCAATACGC TGATTGTGGT GAAAGCCAGC GGTAAAGAGG TGAAAGAGTG GCTGGAATGC
TCTGCCGGAC AGTTTAACCA GATTGATCCT AACAGCACGA AACCACAGTC ACTCATTAAC
TGGGATGGTT TCCGCACTTA TAACTTTGAT GTGATTGATG GTGTGAATTA TCAGATTGAT
GTTACCCAGC CCGCCCGTTA TGACGGCGAG TGCCAGATGA TTAATGCCAA TGCGGAAAGG
ATTAAGAACC TGACCTTTAA CGGCAAGCCG ATTGATCCGA ACGCCATGTT CCTCGTTGCC
ACCAATAACT ATCGCGCTTA CGGCGGCAAA TTTGCCGGTA CGGGCGACAG CCATATCGCT
TTTGCTTCAC CGGATGAGAA CCGCTCGGTG CTGGCAGCGT GGATTGCTGA TGAGTCGAAA
CGTGCGGGGG AAATTCACCC GGCGGCAGAT AACAACTGGC GTTTAGCACC GATAGCCGGC
GATAAGAAAC TGGATATCCG TTTCGAAACC TCTCCGTCAG ATAAAGCCGC AGCGTTTATT
AAAGAAAAAG GGCAGTATCC GATGAATAAA GTCGCGACCG ATGATATCGG GTTTGCAATT
TATCAGGTGG ATCTGAGTAA GTAA
 
Protein sequence
MIKFSATLLA TLIAASVNAA TVDLRIMETT DLHSNMMDFD YYKDTATEKF GLVRTASLIN 
DARNEVKNSV LVDNGDLIQG SPLADYMSAK GLKAGDIHPV YKALNTLDYT VGTLGNHEFN
YGLDYLKNAL AGAKFPYVNA NVIDARTKQP MFTPYLIKDT EVVDKDGKKQ TLKIGYIGVV
PPQIMGWDKA NLSGKVTVND ITETVRKYVP EMREKGADVV VVLAHSGLSA DPYKVMAENS
VYYLSEIPGV NAIMFGHAHA VFPGKDFADI EGADIAKGTL NGVPAVMPGM WGDHLGVVDL
QLSNDSGKWQ VTQAKAEARP IYDIANKKSL AAEDSKLVET LKADHDATRQ FVSKPIGKSA
DNMYSYLALV QDDPTVQVVN NAQKAYVEHY IQGDPDLAKL PVLSAAAPFK VGGRKNDPAS
YVEVEKGQLT FRNAADLYLY PNTLIVVKAS GKEVKEWLEC SAGQFNQIDP NSTKPQSLIN
WDGFRTYNFD VIDGVNYQID VTQPARYDGE CQMINANAER IKNLTFNGKP IDPNAMFLVA
TNNYRAYGGK FAGTGDSHIA FASPDENRSV LAAWIADESK RAGEIHPAAD NNWRLAPIAG
DKKLDIRFET SPSDKAAAFI KEKGQYPMNK VATDDIGFAI YQVDLSK