Gene B21_03244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03244 
SymbolyhhX 
ID8112547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3442128 
End bp3443165 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID644849421 
Producthypothetical protein 
Protein accessionYP_003000994 
Protein GI251786690 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATCA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG 
TATGTACTTA ACCGCAAGGA TAGCTGGCAT GTCGCGCATA TTTTTCGTCG CCATGCGAAG
CCGGAAGAAC AGGCTCCCAT TTATTCCCAT ATCCATTTCA CCAGCGATCT CGACGAAGTA
CTAAACGATC CCGATGTTAA GCTGGTTGTT GTCTGCACCC ACGCGGACAG CCATTTCGAG
TACGCGAAAC GCGCGCTGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACCCCG
ACGCTTGCCC AGGCGAAGGA GCTGTTTGCG TTGGCGAAAA GCAAAGGGCT GACCGTCACG
CCGTATCAGA ATCGTCGCTT TGACTCCTGC TTCCTGACAG CGAAAAAAGC GATTGAAAGT
GGCAAGTTGG GAGAGATTGT TGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTGGCA
GAAACCAAAC CTGGGCTGCC GCAGGATGGC GCGTTTTATG GCCTTGGTGT GCATACGATG
GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG
CGTAATAAAG CCAATCCTGA CGACACCTTT GAAGCGCAAC TGTTTTATGG CGACCTGAAA
GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT
AAGAAAGGTT CGTTTATTAA ATACGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT
ATTATGCCGG GCGAACCGGG ATTCGCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC
AATGACGAGG GCGTGACGGT CAGAGAAGAG ATGAAGCCGG AGATGGGCGA TTACGGGCGC
GTTTATGATG CGTTGTATCA AACCATCACC CACGGTGCGC CAAATTACGT CAAGGAATCT
GAAGTTCTTA CCAATCTGGA AATCCTTGAA CGCGGATTTG AGCAAGCCTC TCCCTCCACA
GTGACTCTCG CGAAGTAA
 
Protein sequence
MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLDEV 
LNDPDVKLVV VCTHADSHFE YAKRALEAGK NVLVEKPFTP TLAQAKELFA LAKSKGLTVT
PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM
DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG
KKGSFIKYGI DQQETSLKAN IMPGEPGFAA DDSVGVLEYV NDEGVTVREE MKPEMGDYGR
VYDALYQTIT HGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK