Gene B21_01459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01459 
Symbolunknown 
ID8114493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1518213 
End bp1520270 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content50% 
IMG OID644847698 
Producthypothetical protein 
Protein accessionYP_002999271 
Protein GI251784967 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAGTTTATT ACCTGGCGCT GGTACGGGAT GCCAGCGTAG AAATGGCGCA AAAAGAACAG 
ACCCGACAAT TGATTATTGC CGTTGACCAT CTCGACCGAC CGGTGATTGT CCTCGATCCG
GAACGCCATA TTGTGCAGTG CAATCGCGCA TTTACCGAAA TGTTTGGTTA CTGCATTAGC
GAAGCCAGCG GTATGCAGCC CGATACACTC CTGAACATTC CTGAATTCCC TGCCGATAAC
CGCATTCGTT TACAACAGTT GCTATGGAAA ACCGCCCGCG ATCAGGACGA ATTTCTGCTG
TTGACGCGCA CCGGTGAAAA AATCTGGATT AAAGCCTCTA TCAGCCCGGT TTATGACGTG
CTCGCGCATC TGCAGAACCT GGTAATGACT TTCTCGGATA TCACCGAAGA ACGGCAAATC
CGCCAGCTTG AAGGCAATAT TCTCGCCGCC ATGTGCAGCA GCCCGCCATT TCATGAAATG
GGGGAAATCA TTTGTCGTAA CATCGAATCT GTACTCAACG AATCGCATGT TTCGCTGTTC
GCACTGCGCA ACGGGATGCC GATACACTGG GCGTCATCTT CCCACGGTGC AGAAATTCAA
AATGCGCAAA GCTGGTCAGC GACCATTCGT CAGCGTGATG GCGCGCCTGC GGGGATCCTG
CAAATTAAAA CCTCGTCAGG AGCAGAAACC AGCGCCTTTA TCGAACGCGT GGCAGATATC
AGCCAGCATA TGGCCGCGCT GGCGCTGGAA CAGGAAAAAA GCCGTCAGCA TATTGAACAA
CTCATCCAAT TTGATCCGAT GACCGGTCTG CCAAATCGCA ATAACCTGCA CAATTACCTC
GATGACCTGG TCGACAAAGC CGTCTCTCCC GTGGTGTATC TCATCGGTGT TGACCATATT
CAGGATGTGA TTGATAGCCT TGGCTATGCG TGGGCCGATC AGGCATTGCT GGAAGTGGTC
AATCGCTTTC GTGAAAAACT CAAACCGGAT CAGTATCTCT GTCGTATCGA AGGTACGCAG
TTTGTCCTCG TGAGCCTCGA AAACGACGTC AGTAACATTA CCCAAATCGC CGATGAGCTA
CGGAATGTGG TCAGCAAGCC GATAATGATT GACGATAAAC CCTTCCCGCT TACCTTGAGT
ATTGGCATCA GCTACGACGT GGGTAAAAAC CGCGATTACT TGCTCTCCAC TGCTCACAAT
GCAATGGATT ATATTCGCAA GAATGGCGGT AACGGCTGGC AGTTCTTCAG CCCGGCGATG
AACGAAATGG TAAAAGAGCG TTTGTTTTTA GGCGCAGCGC TGAAAGAAGC GATTAGCAAT
AACCAACTGA AACTGGTTTA CCAGCCGCAA ATCTTCGCAG AAACGGGTGA ACTGTACGGC
ATCGAAGCCC TTGCTCGCTG GCACGATCCC CTGCATGGTC ATGTGCCCCC TTCACGGTTT
ATTCCTCTCG CAGAAGAGAT TGGTGAAATC GAAAATATTG GGCGCTGGGT CATCGCGGAA
GCTTGCCGTC AGTTAGCAGA ATGGCGTAGC CAGAATATTC ATATCCCGGC GCTTTCCGTT
AACTTGTCGG CACTGCACTT TCGCAGTAAT CAGCTGCCTA ATCAGGTGTC TGATGCAATG
CAAGCCTGGG GTATTGACGG CCACCAGCTG ACGGTGGAAA TCACGGAAAG CATGATGATG
GAACACGATA CCGAAATCTT TAAGCGCATT CAGATCCTGC GCGATATGGG CGTGGGCTTA
TCGGTAGATG ATTTTGGCAC GGGCTTTTCC GGATTATCCC GCTTAGTCAG TCTTCCGGTA
ACGGAAATCA AAATTGACAA AAGTTTTGTC GATCGTTGTC TGACCGAAAA ACGCATCCTT
GCCTTACTTG AAGCCATTAC CAGCATTGGG CAAAGCCTCA ATTTAACCGT CGTGGCGGAA
GGCGTCGAAA CCAAAGAGCA ATTTGAGATG CTACGCAAGA TCCACTGTCG CGTTATTCAG
GGATATTTCT TTTCCCGCCC CCTACCCGCC GAAGAAATTC CAGGCTGGAT GAGCAGCGTG
TTACCGCTGA AAATCTGA
 
Protein sequence
KVYYLALVRD ASVEMAQKEQ TRQLIIAVDH LDRPVIVLDP ERHIVQCNRA FTEMFGYCIS 
EASGMQPDTL LNIPEFPADN RIRLQQLLWK TARDQDEFLL LTRTGEKIWI KASISPVYDV
LAHLQNLVMT FSDITEERQI RQLEGNILAA MCSSPPFHEM GEIICRNIES VLNESHVSLF
ALRNGMPIHW ASSSHGAEIQ NAQSWSATIR QRDGAPAGIL QIKTSSGAET SAFIERVADI
SQHMAALALE QEKSRQHIEQ LIQFDPMTGL PNRNNLHNYL DDLVDKAVSP VVYLIGVDHI
QDVIDSLGYA WADQALLEVV NRFREKLKPD QYLCRIEGTQ FVLVSLENDV SNITQIADEL
RNVVSKPIMI DDKPFPLTLS IGISYDVGKN RDYLLSTAHN AMDYIRKNGG NGWQFFSPAM
NEMVKERLFL GAALKEAISN NQLKLVYQPQ IFAETGELYG IEALARWHDP LHGHVPPSRF
IPLAEEIGEI ENIGRWVIAE ACRQLAEWRS QNIHIPALSV NLSALHFRSN QLPNQVSDAM
QAWGIDGHQL TVEITESMMM EHDTEIFKRI QILRDMGVGL SVDDFGTGFS GLSRLVSLPV
TEIKIDKSFV DRCLTEKRIL ALLEAITSIG QSLNLTVVAE GVETKEQFEM LRKIHCRVIQ
GYFFSRPLPA EEIPGWMSSV LPLKI