Gene B21_03330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03330 
SymbolyhjK 
ID8113425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3544029 
End bp3545978 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content53% 
IMG OID644849505 
Producthypothetical protein 
Protein accessionYP_003001078 
Protein GI251786774 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCAG CCGTTGTCCT GGTGTTCGTT TTTATTTTTT GCACCGTTTT GCTGTTCCAT 
CTGGTCCAGC AGAATCGCTA TAACACGGCT ACGCAACTGG AAAGCATTGC TCGCTCTGTC
CGCGAACCCT TATCTTCAGC TATTTTGAAA GGCGATATTC CCGAAGCGGA AGCTATTCTT
GCCAGCATTA AACCGGCAGG CGTGGTCAGC CGTGCCGATG TAGTGCTGCC TAACCAGTTC
CAGGCGCTGC GTAAAAGTTT TATTCCAGAG CGCCCGGTGC CGGTAATGGT TACTCGCCTG
TTTGAGCTAC CGGTTCAAAT CTCGCTGGGC GTTTACTCGC TCGAACGTCC GGCAAACCCG
CAGCCAATTG CCTATCTGGT ACTACAGGCG GATTCCTTCC GTATGTATAA GTTCGTGATG
AGCACCCTCT CAACGTTAGT GACCATTTAC TTACTTTTGT CGCTTATCCT GACCGTCGCC
ATCAGCTGGT GCATTAACCG CCTGATTTTG CATCCGTTAC GCAATATTGC TCGCGAACTT
AACGCCATCC CAGCCAAGGA GCTTGTTGGT CACCAACTGG CATTACCGCG TCTGCATCAG
GACGATGAAA TCGGTATGTT GGTGCGCAGT TACAACCTCA ACCAGCAATT GCTGCAGCGC
CATTATGAAG AACAGAACGA AAATGCGATG CGCTTCCCGG TGTCGGATTT GCCGAACAAA
GCCTTGCTGA TGGAGATGCT GGAGCAGGTT GTCGCGCGTA AACAAACCAC CGCGCTGATG
ATCATCACCT GTGAAACCCT GCGTGATACT GCGGGCGTGC TGAAAGAGGC GCAACGAGAA
ATTCTGCTGC TGACGCTGGT GGAAAAACTC AAATCGGTAC TGTCGCCACG TATGATCCTC
GCGCAGATTA GCGGTTATGA CTTTGCTGTC ATTGCCAACG GTGTACAGGA ACCGTGGCAC
GCAATCACCT TAGGTCAGCA AGTGCTCACT ATCATGAGCG AGCGCCTGCC GATTGAACGT
ATTCAACTCC GTCCGCACTG TAGCATTGGC GTGGCGATGT TCTACGGCGA TCTCACCGCC
GAACAGCTTT ACAGTCGCGC TATTTCTGCG GCATTTACCG CTCGCCATAA AGGCAAGAAT
CAGATTCAGT TCTTTGATCC GCAGCAGATG GAAGCCGCCC AGAAGCGGTT GACGGAAGAG
AGCGATATCC TTAATGCACT GGAAAATCAT CAGTTTGCTA TTTGGTTACA GCCACAGGTC
GAGATGACCA GCGGTAAACT GGTCAGTGCG GAAGTGTTAC TGCGTATCCA GCAACCGGAT
GGCAGTTGGG ACCTGCCGGA TGGCTTAATC GATCGCATTG AGTGCTGTGG GCTGATGGTT
ACCGTCGGTC ACTGGGTGCT GGAAGAGTCC TGTCGATTGC TTGCAGCCTG GCAAGAGCGC
GGCATTATGC TGCCCTTGTC GGTAAACCTC TCTGCGCTGC AACTGATGCA CCCGAATATG
GTGGCGGATA TGCTGGAACT GTTAACCCGC TATCGCATTC AGCCGGGAAC ACTGATTCTG
GAAGTGACAG AAAGCCGACG TATTGACGAC CCTCATGCTG CGGTGGCAAT CCTCCGTCCG
CTGCGCAATG CCGGAGTTCG GGTGGCGCTG GATGATTTCG GCATGGGCTA CGCAGGGCTG
CGTCAGCTGC AGCATATGAA ATCGTTGCCA ATCGACGTAC TGAAAATCGA CAAAATGTTT
GTTGAAGGCT TGCCGGGAGA TAGCAGCATG ATTGCTGCAA TTATCATGCT GGCGCAGAGC
CTGAACTTAC AAATGATTGC CGAAGGCGTG GAGACTGAAG CACAACGCGA CTGGCTGGCA
AAAGCGGGCG TTGGTATTGC CCAGGGCTTC CTTTTTGCTC GCCCACTCCC TATTGAAATC
TTCGAAGAGA GTTACCTGGA AGAAAAGTAG
 
Protein sequence
MVAAVVLVFV FIFCTVLLFH LVQQNRYNTA TQLESIARSV REPLSSAILK GDIPEAEAIL 
ASIKPAGVVS RADVVLPNQF QALRKSFIPE RPVPVMVTRL FELPVQISLG VYSLERPANP
QPIAYLVLQA DSFRMYKFVM STLSTLVTIY LLLSLILTVA ISWCINRLIL HPLRNIAREL
NAIPAKELVG HQLALPRLHQ DDEIGMLVRS YNLNQQLLQR HYEEQNENAM RFPVSDLPNK
ALLMEMLEQV VARKQTTALM IITCETLRDT AGVLKEAQRE ILLLTLVEKL KSVLSPRMIL
AQISGYDFAV IANGVQEPWH AITLGQQVLT IMSERLPIER IQLRPHCSIG VAMFYGDLTA
EQLYSRAISA AFTARHKGKN QIQFFDPQQM EAAQKRLTEE SDILNALENH QFAIWLQPQV
EMTSGKLVSA EVLLRIQQPD GSWDLPDGLI DRIECCGLMV TVGHWVLEES CRLLAAWQER
GIMLPLSVNL SALQLMHPNM VADMLELLTR YRIQPGTLIL EVTESRRIDD PHAAVAILRP
LRNAGVRVAL DDFGMGYAGL RQLQHMKSLP IDVLKIDKMF VEGLPGDSSM IAAIIMLAQS
LNLQMIAEGV ETEAQRDWLA KAGVGIAQGF LFARPLPIEI FEESYLEEK