Gene B21_02369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02369 
SymbolispG 
ID8116346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2504375 
End bp2505493 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content53% 
IMG OID644848571 
Producthypothetical protein 
Protein accessionYP_003000144 
Protein GI251785840 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACC AGGCTCCAAT TCAACGTAGA AAATCAACAC GTATTTACGT TGGGAATGTG 
CCGATTGGCG ATGGTGCTCC CATCGCCGTA CAGTCCATGA CCAATACGCG TACGACAGAC
GTCGAAGCAA CGGTCAATCA AATCAAGGCG CTGGAACGCG TTGGCGCTGA TATCGTCCGT
GTATCCGTAC CGACGATGGA CGCGGCAGAA GCGTTCAAAC TCATCAAACA GCAGGTTAAC
GTGCCGCTGG TGGCTGACAT CCACTTCGAC TATCGCATTG CGCTGAAAGT AGCGGAATAC
GGCGTCGATT GTCTGCGTAT TAACCCTGGC AATATCGGTA ATGAAGAGCG TATTCGCATG
GTGGTTGACT GTGCGCGCGA TAAAAACATT CCGATCCGTA TTGGCGTTAA CGCCGGATCG
CTGGAAAAAG ATCTGCAAGA AAAGTATGGC GAACCGACGC CGCAGGCGTT GCTGGAATCT
GCCATGCGTC ATGTTGATCA TCTCGATCGC CTGAACTTCG ATCAGTTCAA AGTCAGCGTG
AAAGCGTCTG ACGTCTTCCT CGCTGTTGAG TCTTATCGTT TGCTGGCAAA ACAGATCGAT
CAGCCGTTGC ATCTGGGGAT CACCGAAGCC GGTGGTGCGC GCAGCGGGGC AGTAAAATCC
GCCATTGGTT TAGGTCTGCT GCTGTCTGAA GGCATCGGCG ACACGCTGCG CGTATCGCTG
GCGGCCGATC CGGTCGAAGA GATCAAAGTC GGTTTCGATA TTTTGAAATC GCTGCGTATC
CGTTCGCGAG GGATCAACTT CATCGCCTGC CCGACCTGTT CGCGTCAGGA ATTTGATGTT
ATCGGTACGG TTAACGCGCT GGAGCAACGC CTGGAAGATA TCATCACTCC GATGGACGTT
TCGATTATCG GCTGCGTGGT GAATGGCCCA GGTGAGGCGC TGGTTTCTAC ACTCGGCGTC
ACCGGCGGCA ACAAGAAAAG CGGCCTCTAT GAAGATGGCG TGCGCAAAGA CCGTCTGGAC
AACAACGATA TGATCGACCA GCTGGAAGCA CGCATTCGTG CGAAAGCCAG TCAGCTGGAC
GAAGCGCGTC GAATTGACGT TCAGCAGGTT GAAAAATAA
 
Protein sequence
MHNQAPIQRR KSTRIYVGNV PIGDGAPIAV QSMTNTRTTD VEATVNQIKA LERVGADIVR 
VSVPTMDAAE AFKLIKQQVN VPLVADIHFD YRIALKVAEY GVDCLRINPG NIGNEERIRM
VVDCARDKNI PIRIGVNAGS LEKDLQEKYG EPTPQALLES AMRHVDHLDR LNFDQFKVSV
KASDVFLAVE SYRLLAKQID QPLHLGITEA GGARSGAVKS AIGLGLLLSE GIGDTLRVSL
AADPVEEIKV GFDILKSLRI RSRGINFIAC PTCSRQEFDV IGTVNALEQR LEDIITPMDV
SIIGCVVNGP GEALVSTLGV TGGNKKSGLY EDGVRKDRLD NNDMIDQLEA RIRAKASQLD
EARRIDVQQV EK