Gene B21_00472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00472 
SymbolallD 
ID8115371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp514258 
End bp515307 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID644846754 
Producthypothetical protein 
Protein accessionYP_002998327 
Protein GI251784023 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID[TIGR03175] ureidoglycolate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG 
TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTACGCCGA TGCCAGAGGG
ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCGG AACGCATTTC AAAAGGCGGC
ACCAACCGCG AACCGGAGTT TCGTCTTGAG GAAACCGGGC CGTGCTCGGC AATTTTACAT
GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC
GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC
TCTTATTTTG TGCAGCAGGC AGCCCGCGCC GGATTAATTG GTATTTCGAT GTGCCAGTCC
GATCCAATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC
TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG
GCATGGGGAA AAGTGCTCGA CGCCCGCTCG CGTAATATGT CTATCCCGGA TACCTGGGCG
GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCC
GCCGGGCCAA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC
GGCTTACCGT TCGGGCTACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT
TTGGGGCAAT TACATATCGT TATTAACCCG AACTTTTTCT TCTCCAGCAA ATTATTTCGT
CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT
CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCAAAGCCGC CGTCGAAGGC
ATCGAAATTG TTGATGATAT TTACCAGTAT TTAATTTCCG ACGCGCTTTA TAACACGTCA
TACGAAACGA AAAATCCCTT TGCGCAATAA
 
Protein sequence
MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG 
TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI
SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ
AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL
GLPFGLQVSS MYDDLHAGRN LGQLHIVINP NFFFSSKLFR QHLSQTMREL NAITPAPGFN
QVYYPGQDQD IKQRKAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ