Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04201 |
Symbol | yjjN |
ID | 8115005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4512454 |
End bp | 4513476 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644850340 |
Product | hypothetical protein |
Protein accession | YP_003001913 |
Protein GI | 251787609 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.774265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACGA TGAATGTTTT AATTTGCCAG CAGCCGAAAG AATTAGTCTG GAAACAACGC GAGATACCTA TTCCGGGTGA CAATGAAGCA TTAATAAAAA TTAAGTCTGT CGGGATTTGC GGTACCGATA TTCATGCCTG GGGTGGAAAT CAACCATTTT TTAGTTATCC ACGTGTTTTA GGCCATGAAA TATGTGGGGA GATTGTTGGG CTGGGTAAAA ATATTGCTGA TCTTAAAAAT GGTCAGCAAG TTGCTGTGAT CCCTTATGTT GCCTGTCAGC AATGCCCGGC GTGTAAAAGC GGGCGTACCA ATTGCTGTGA AAAAATTTCA GTCATTGGCG TGCATCAGGA TGGCGGTTTT AGTGAGTATT TGAGCGTGCC GGTGGCGAAC ATTTTGCCCG CAGACGGTAT TGACCCGCAG GCGGCAGCAT TGATTGAACC TTTCGCTATT AGCGCTCATG CGGTGCGTCG CGCAGCCATT GCTCCCGGCG AGCAGGTGCT GGTGGTCGGG GCGGGGCCAA TCGGTCTGGG CGCGGCGGCA ATCGCTAAAG CCGATGGCGC ACAGGTGGTG GTGGCGGATA CCAGTCCGGC GCGCCGTGAA CATGTGGCAA CGCGTCTGGA ATTACCTTTA CTGGACCCGT CAGCCGAAGA TTTTGACGCG CAGCTACGGG CGCAGTTTGG TGGTTCGCTG GCGCAGAAAG TGATCGACGC GACAGGTAAT CAACATGCGA TGAATAACAC CGTGAATTTG ATTCGTCACG GCGGCACGGT GGTATTTGTC GGCCTGTTTA AAGGTGAGTT GCAGTTCTCC GATCCGGAAT TCCATAAAAA AGAAACGACG ATGATGGGCA GCCGCAACGC CACGCCGGAA GATTTTGCTA AAGTCGGTCG ACTGATGGCG GAAGGAAAAA TCACTGCTGA CATGATGTTA ACCCATCGCT ATCCGTTCGC CACGCTGGCA GAAACCTACG AGCGCGATGT GATTAACAAT CGTGAGTTAA TTAAAGGCGT AATTACTTTC TGA
|
Protein sequence | MSTMNVLICQ QPKELVWKQR EIPIPGDNEA LIKIKSVGIC GTDIHAWGGN QPFFSYPRVL GHEICGEIVG LGKNIADLKN GQQVAVIPYV ACQQCPACKS GRTNCCEKIS VIGVHQDGGF SEYLSVPVAN ILPADGIDPQ AAALIEPFAI SAHAVRRAAI APGEQVLVVG AGPIGLGAAA IAKADGAQVV VADTSPARRE HVATRLELPL LDPSAEDFDA QLRAQFGGSL AQKVIDATGN QHAMNNTVNL IRHGGTVVFV GLFKGELQFS DPEFHKKETT MMGSRNATPE DFAKVGRLMA EGKITADMML THRYPFATLA ETYERDVINN RELIKGVITF
|
| |