Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00472 |
Symbol | allD |
ID | 8115371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 514258 |
End bp | 515307 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644846754 |
Product | hypothetical protein |
Protein accession | YP_002998327 |
Protein GI | 251784023 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | [TIGR03175] ureidoglycolate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA GTCGGGAAAC ACTCCACCAG CTAATTGAGA ATAAACTCTG CCAGGCTGGG TTAAAACGTG AGCACGCTGC AACCGTGGCT GAAGTATTGG TTTACGCCGA TGCCAGAGGG ATCCACTCTC ATGGCGCGGT GCGCGTGGAA TACTACGCGG AACGCATTTC AAAAGGCGGC ACCAACCGCG AACCGGAGTT TCGTCTTGAG GAAACCGGGC CGTGCTCGGC AATTTTACAT GCCGACAATG CCGCCGGACA GGTCGCGGCG AAAATGGGTA TGGAACATGC CATCAAAACC GCCCAGCAAA ATGGCGTTGC GGTGGTCGGT ATCAGCCGGA TGGGTCACAG CGGCGCAATC TCTTATTTTG TGCAGCAGGC AGCCCGCGCC GGATTAATTG GTATTTCGAT GTGCCAGTCC GATCCAATGG TGGTGCCGTT TGGCGGCGCG GAAATTTACT ACGGTACTAA CCCACTGGCC TTTGCCGCGC CGGGAGAAGG CGACGAGATC CTTACCTTTG ATATGGCGAC TACCGTACAG GCATGGGGAA AAGTGCTCGA CGCCCGCTCG CGTAATATGT CTATCCCGGA TACCTGGGCG GTCGATAAAA ACGGTGCACC AACAACCGAT CCGTTCGCGG TACATGCTCT GCTCCCCGCC GCCGGGCCAA AAGGGTATGG CCTGATGATG ATGATTGACG TCCTCTCAGG CGTCTTACTC GGCTTACCGT TCGGGCTACA GGTTAGTTCG ATGTATGACG ATTTACACGC CGGGCGTAAT TTGGGGCAAT TACATATCGT TATTAACCCG AACTTTTTCT TCTCCAGCAA ATTATTTCGT CAACATCTTA GCCAGACCAT GCGCGAATTA AATGCCATTA CCCCCGCGCC CGGTTTTAAT CAGGTTTATT ATCCCGGACA GGATCAGGAT ATTAAACAAC GCAAAGCCGC CGTCGAAGGC ATCGAAATTG TTGATGATAT TTACCAGTAT TTAATTTCCG ACGCGCTTTA TAACACGTCA TACGAAACGA AAAATCCCTT TGCGCAATAA
|
Protein sequence | MKISRETLHQ LIENKLCQAG LKREHAATVA EVLVYADARG IHSHGAVRVE YYAERISKGG TNREPEFRLE ETGPCSAILH ADNAAGQVAA KMGMEHAIKT AQQNGVAVVG ISRMGHSGAI SYFVQQAARA GLIGISMCQS DPMVVPFGGA EIYYGTNPLA FAAPGEGDEI LTFDMATTVQ AWGKVLDARS RNMSIPDTWA VDKNGAPTTD PFAVHALLPA AGPKGYGLMM MIDVLSGVLL GLPFGLQVSS MYDDLHAGRN LGQLHIVINP NFFFSSKLFR QHLSQTMREL NAITPAPGFN QVYYPGQDQD IKQRKAAVEG IEIVDDIYQY LISDALYNTS YETKNPFAQ
|
| |