Gene B21_03966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03966 
SymbolcadC 
ID8113077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4265572 
End bp4267110 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID644850118 
Producthypothetical protein 
Protein accessionYP_003001691 
Protein GI251787387 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.992437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC CTGTAGTTCG CGTTGGCGAA TGGCTTGTTA CTCCGTCCAT AAACCAAATT 
AGCCGCAATG GGCGTCAACT TACCCTTGAG CCGAGATTAA TCGATCTTCT GGTTTTCTTT
GCTCAACACA GTGGCGAAGT ACTTAGCAGG GATGAACTTA TCGATAATGT CTGGAAGAGA
AGTATTGTCA CCAATCACGT TGTGACGCAG AGTATCTCAG AACTACGTAA GTCATTAAAA
GATAATGATG AAGATAGTCC TGTCTATATC GCTACTGTAC CAAAGCGCGG CTATAAATTA
ATGGTGCCGG TTATCTGGTA CAGCGAAGAA GAGGGAGAGG AAATAATGCT ATCTTCGCCT
CCCCCTATAC CAGAGGCGGT TCCTGCCACA GATTCTCCCT CCCACAGTCT TAACATTCAA
AACACCGCAA CGCCACCTGA ACAATCCCCA GTTAAAAGCA AACGATTCAC TACCTTTTGG
GTATGGTTTT TTTTCCTGTT GTCGTTAGGT ATCTGTGTAG CACTGGTCGC GTTTTCAAGT
CTTGATACAC GTCTTCCGAT GAGCAAATCG CGTATTTTGC TCAATCCACG CGATATTGAC
ATTAATATGG TAAATAAAAG TTGTAACAGC TGGAGTTCCC CGTATCAGCT CTCTTACGCG
ATAGGCGTGG GTGATTTGGT GGCGACATCA CTTAACACCT TCTCCACCTT TATGGTGCAT
GACAAAATCA ACTACAACAT TGATGAACCG AGCAGTTCCG GTAAAACATT ATCTATTGCG
TTTGTTAATC AGCGCCAATA CCGTGCTCAA CAATGCTTTA TGTCGATAAA ATTGGTAGAC
AATGCAGATG GTTCAACCAT GCTGGATAAA CGTTATGTCA TCACTAACGG TAATCAGCTG
GCGATTCAAA ATGATTTACT GGAGAGTTTA TCAAAAGCGT TAAACCAACC GTGGCCACAA
CGAATGCAGG AGACGCTCCA GCAAATTTTG CCTCATCGTG GTGCGTTATT AACTAATTTT
TATCAGGCAC ATGATTATTT ACTGCATGGC GATGATAAAT CATTGAACCG TGCCAGTGAA
TTATTAGGTG AGATTGTTCA ATCATCCCCA GAATTTACCT ACGCGAGAGC AGAAAAAGCA
TTAGTTGATA TCGTGCGCCA TTCTCAACAT CCTTTAGATG AAAAACAATT AGCAGCACTG
AACACAGAAA TAGATAACAT TGTTACACTA CCGGAATTGA ATAACCTGTC CATTATATAT
CAAATAAAAG CGGTCAGTGC TCTGGTAAAA GGTAAAACAG ATGAGTCTTA CCAGGCGATA
AATACTGGCA TTGATCTTGA AATGTCCTGG CTAAATTATG TATTGCTTGG CAAGGTTTAT
GAAATGAAGG GGATGAACCG GGAAGCGGCT GATGCATATC TCACCGCCTT TAATTTACGC
CCAGGGGCAA ACACCCTTTA CTGGATTGAA AATGGTATAT TCCAGACTTC TGTTCCTTAT
GTTGTACCTT ATCTCGACAA ATTTCTCGCT TCAGAATAA
 
Protein sequence
MQQPVVRVGE WLVTPSINQI SRNGRQLTLE PRLIDLLVFF AQHSGEVLSR DELIDNVWKR 
SIVTNHVVTQ SISELRKSLK DNDEDSPVYI ATVPKRGYKL MVPVIWYSEE EGEEIMLSSP
PPIPEAVPAT DSPSHSLNIQ NTATPPEQSP VKSKRFTTFW VWFFFLLSLG ICVALVAFSS
LDTRLPMSKS RILLNPRDID INMVNKSCNS WSSPYQLSYA IGVGDLVATS LNTFSTFMVH
DKINYNIDEP SSSGKTLSIA FVNQRQYRAQ QCFMSIKLVD NADGSTMLDK RYVITNGNQL
AIQNDLLESL SKALNQPWPQ RMQETLQQIL PHRGALLTNF YQAHDYLLHG DDKSLNRASE
LLGEIVQSSP EFTYARAEKA LVDIVRHSQH PLDEKQLAAL NTEIDNIVTL PELNNLSIIY
QIKAVSALVK GKTDESYQAI NTGIDLEMSW LNYVLLGKVY EMKGMNREAA DAYLTAFNLR
PGANTLYWIE NGIFQTSVPY VVPYLDKFLA SE