Gene B21_02457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02457 
SymbolyfiN 
ID8113939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2608105 
End bp2609331 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content49% 
IMG OID644848657 
Producthypothetical protein 
Protein accessionYP_003000230 
Protein GI251785926 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATA ACGATAATTC TCTTAATAAG CGCCCCACGT TTAAAAGAGC ATTACGCAAC 
ATCAGTATCA CCAGCATATT TATCACTATG ATGCTGATCT GGTTGCTGCT TTCCGTGACC
TCGGTGCTGA CCCTGAAACA GTACGCGCAA AAAAACCTGG CACTGACAGC AGCAACAATG
ACTTACAGTC TGGAAGCAGC TGTCGTTTTT GCCGATGGCC CTGCAGCAAC TGAAACACTG
GCAGCGCTGG GCCAGCAAGG GCAATTTTCA ACTGCAGAAG TACGTGATAA GCAGCAAAAT
ATTCTGGCGT CCTGGCATTA CACCCGTAAG GATCCAGGCG ATACTTTCAG CAATTTCATA
AGCCACTGGC TCTTCCCCGC CCCCATCATT CAGCCGATTC GTCACAATGG TGAAACCATT
GGCGAAGTAC GCTTAACCGC TCGCGACAGT TCAATCAGCC ATTTTATCTG GTTTTCGCTC
GCCGTACTGA CCGGTTGTAT TCTGCTGGCA TCAGGCATCG CAATTACCCT CACCCGCCAT
TTGCACAATG GCCTGGTGGA AGCACTGAAA AATATCACCG ATGTCGTACA TGATGTGCGT
TCCAACCGCA ATTTTTCCCG ACGAGTTTCG GAAGAACGTA TCGCTGAGTT TCACCGCTTC
GCTCTCGACT TCAACAGTCT GCTGGATGAA ATGGAAGAGT GGCAGCTTCG TTTACAGGCT
AAAAATGCGC AGCTTCTACG TACCGCGCTA CATGACCCAT TAACCGGGCT GGCTAACCGC
GCAGCGTTTC GTAGCGGCAT CAACACGTTG ATGAACAATT CCGATGCCCG AAAAACGTCG
GCGTTACTAT TTCTTGATGG CGATAATTTC AAATACATCA ATGATACCTG GGGTCATGCG
ACGGGCGATA GAGTCTTGAT TGAAATCGCA AAACGGTTAG CTGAAGTTGG CGGGCTGCGA
CATAAAGCAT ACCGCCTGGG CGGCGATGAA TTCGCTATGG TGCTCTATGA TGTACAGTCA
GAATCTGAAG TGCAGCAGAT ATGCTCAGCA CTGACACAAA TCTTTAATCT CCCGTTTGAT
CTTCATAATG GTCATCAGAC CACCATGACA TTAAGCATTG GTTACGCGAT GACCATTGAG
CACGCCTCTG CGGAAAAATT ACAAGAGCTT GCCGATCACA ATATGTATCA GGCCAAACAC
CAGCGTGCCG AAAAGCTGGT GAGATAA
 
Protein sequence
MMDNDNSLNK RPTFKRALRN ISITSIFITM MLIWLLLSVT SVLTLKQYAQ KNLALTAATM 
TYSLEAAVVF ADGPAATETL AALGQQGQFS TAEVRDKQQN ILASWHYTRK DPGDTFSNFI
SHWLFPAPII QPIRHNGETI GEVRLTARDS SISHFIWFSL AVLTGCILLA SGIAITLTRH
LHNGLVEALK NITDVVHDVR SNRNFSRRVS EERIAEFHRF ALDFNSLLDE MEEWQLRLQA
KNAQLLRTAL HDPLTGLANR AAFRSGINTL MNNSDARKTS ALLFLDGDNF KYINDTWGHA
TGDRVLIEIA KRLAEVGGLR HKAYRLGGDE FAMVLYDVQS ESEVQQICSA LTQIFNLPFD
LHNGHQTTMT LSIGYAMTIE HASAEKLQEL ADHNMYQAKH QRAEKLVR