Gene B21_04048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04048 
SymbolcpdB 
ID8116676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4346073 
End bp4348016 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content49% 
IMG OID644850198 
Producthypothetical protein 
Protein accessionYP_003001771 
Protein GI251787467 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01390] 2',3'-cyclic-nucleotide 2'-phosphodiesterase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.897161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGT TTAGCGCAAC GCTCCTGGCC ACGCTGATTG CCGCCAGTGT GAATGCAGCG 
ACGGTCGATC TACGTATCAT GGAAACCACT GATCTGCATA GCAACATGAT GGATTTCGAT
TATTACAAAG ACACCGCCAC GGAAAAATTC GGACTGGTAC GTACGGCAAG CCTGATTAAC
GATGCCCGCA ATGAAGTGAA AAACAGCGTA CTGGTCGATA ACGGCGATTT GATTCAGGGG
AGTCCGCTGG CCGATTACAT ATCGGCGAAA GGATTAAAAG CAGGTGATGT TCATCCGGTT
TATAAGGCGC TGAATACGCT GGATTATACG GTCGGTACAC TCGGCAATCA TGAATTTAAC
TACGGTCTGG ATTACCTGAA AAATGCGTTG GCGGGAGCGA AATTCCCTTA TGTAAATGCC
AACGTCATTG ACGCCAGAAC CAAACAGCCA ATGTTTACAC CGTATTTAAT TAAAGATACC
GAAGTGGTCG ATAAAGACGG AAAAAAACAG ACGCTGAAGA TTGGCTATAT TGGCGTCGTG
CCGCCGCAAA TCATGGGCTG GGATAAAGCT AATTTATCCG GAAAAGTGAC GGTGAATGAT
ATTACCGAAA CCGTGCGCAA ATACGTGCCT GAAATGCGCG AGAAAGGTGC CGATGTCGTT
GTCGTTCTGG CGCATTCCGG GCTATCTGCC GATCCGTATA AAGTGATGGC GGAAAACTCA
GTTTATTACC TCAGTGAAAT TCCTGGCGTT AACGCCATTA TGTTTGGCCA TGCTCACGCC
GTTTTCCCAG GTAAAGATTT TGCTGATATC GAAGGGGCTG ATATCGCGAA AGGCACGCTG
AATGGTGTTC CGGCGGTAAT GCCGGGCATG TGGGGCGATC ATCTTGGGGT GGTCGACTTA
CAACTCAGTA ATAACAGCGG TAAATGGCAG GTGACGCAGG CGAAAGCGGA AGCACGGCCG
ATTTACGACA TCGCCAATAA AAAATCCCTC GCGGCGGAAG ACAGCAAGCT GGTAGAAACA
CTCAAAGCCG ATCACGATGC CACACGCCAG TTCGTCAGCA AGCCAATCGG TAAATCCGCC
GACAATATGT ATAGCTATCT GGCGCTGGTG CAGGACGATC CGACCGTGCA GGTGGTGAAC
AACGCGCAAA AAGCGTATGT CGAGCATTAC ATTCAGGGCG ATCCGGATCT GGCAAAACTG
CCGGTGCTTT CAGCTGCTGC ACCGTTTAAA GTTGGTGGTC GCAAAAATGA TCCGGCAAGC
TATGTGGAGG TGGAAAAAGG CCAGTTGACC TTCCGTAATG CCGCCGATCT TTATCTCTAC
CCCAATACGC TGATTGTGGT GAAAGCCAGC GGTAAAGAGG TGAAAGAGTG GCTGGAGTGC
TCCGCCGGAC AGTTTAACCA GATTGATCCT GACAACACGA AACCACAGTC ACTCATCAAC
TGGGATGGTT TCCGCACCTA TAACTTTGAT GTGATTGATG GTGTGAATTA TCAGATTGAT
GTTACCCAGC CTGCCCGTTA TGACGGCGAG TGCCAGATGG TTAATGCCAA TGCGGAAAGG
ATTAAGAACC TGACCTTTAA TGGCAAGCCG ATTGATCCGA ACGCCATGTT CCTGGTTGCC
ACCAATAACT ATCGCGCTTA CGGCGGCAAA TTTGCCGGTA CGGGCGACAG CCATATCGCT
TTTGCTTCAC CGGATGAGAA CCGCTCGGTG CTGGCAGCGT GGATTGCTGA TGAGTCGAAA
CGTGCGGGGG AAATTCACCC GGCGGCAGAT AACAACTGGC GTTTAGCACC GATAGCTGGC
GATAAGAAAC TGGATATCCG TTTCGAAACC TCTCCATCAG ATAAAGCCGC GGCGTTTATT
AAAGAGAAAG GGCAATATCC GATGAATAAA GTCGCGACCG ATGATATCGG GTTTGCGATT
TATCAGGTGG ATTTGAGTAA GTAA
 
Protein sequence
MIKFSATLLA TLIAASVNAA TVDLRIMETT DLHSNMMDFD YYKDTATEKF GLVRTASLIN 
DARNEVKNSV LVDNGDLIQG SPLADYISAK GLKAGDVHPV YKALNTLDYT VGTLGNHEFN
YGLDYLKNAL AGAKFPYVNA NVIDARTKQP MFTPYLIKDT EVVDKDGKKQ TLKIGYIGVV
PPQIMGWDKA NLSGKVTVND ITETVRKYVP EMREKGADVV VVLAHSGLSA DPYKVMAENS
VYYLSEIPGV NAIMFGHAHA VFPGKDFADI EGADIAKGTL NGVPAVMPGM WGDHLGVVDL
QLSNNSGKWQ VTQAKAEARP IYDIANKKSL AAEDSKLVET LKADHDATRQ FVSKPIGKSA
DNMYSYLALV QDDPTVQVVN NAQKAYVEHY IQGDPDLAKL PVLSAAAPFK VGGRKNDPAS
YVEVEKGQLT FRNAADLYLY PNTLIVVKAS GKEVKEWLEC SAGQFNQIDP DNTKPQSLIN
WDGFRTYNFD VIDGVNYQID VTQPARYDGE CQMVNANAER IKNLTFNGKP IDPNAMFLVA
TNNYRAYGGK FAGTGDSHIA FASPDENRSV LAAWIADESK RAGEIHPAAD NNWRLAPIAG
DKKLDIRFET SPSDKAAAFI KEKGQYPMNK VATDDIGFAI YQVDLSK