Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04196 |
Symbol | ybl228 |
ID | 8115090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4505996 |
End bp | 4507285 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644850336 |
Product | hypothetical protein |
Protein accession | YP_003001909 |
Protein GI | 251787605 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR02303] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit [TIGR02305] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCA CTATCTTCGC CGTAGCGTTG AACCATCGCA GCCAGCTTGA TGCATGGCAG GACGCGTTCC AGCAATCCCC CTACAAAGCC CCGCCTAAAA CTGCGGTCTG GTTTATTAAA CCGCGCAATA CGGTGATTGG TTGCGGTGAA CCGATTCCCT TTCCACAGGG TGAAAAGGTA CTGAGCGGTG CGACAGTTGC GTTGATTGTG GGAAAAACGG CGACGAAAGT ACGTGAAGAA GATGCGGCAG AGTACATCGC CGGATATGCG CTGGCTAACG ACGTCAGCCT GCCAGAAGAG AGCTTTTACC GCCCGGCAAT CAAAGCAAAA TGCCGTGATG GATTCTGCCC CATTGGCGAA ACCGTGGCTC TCAGCAATGT CGATAATCTG ACTATCTATA CCGAGATCAA CGGGCGTCCT GCCGATCACT GGAACACCGC CGATTTACAA CGTAACGCCG CACAGTTGCT GAGTGCCCTG AGCGAATTTG CCATGCTGAA TCCAGGCGAT GCCATTCTGC TCGGCACGCC ACAGGCGCGC GTGGAAATAC AGCCAGGCGA TCGCGTGCGT GTTCTCGCAG AAGGTTTCCC GCCGCTGGAA AATCCGGTAG TGGACGAACG TGAAGTGACC ACGCGCAAGA GCTTCCCAAC GCAGCCACAC CCGCACGGCA CGCTGTTTGC CCTCGGCCTG AACTACGCCG ACCACGCCAG CGAACTGGAA TTTAAGCCAC CGGAAGAGCC GCTGGTGTTC CTGAAAGCGC CGAATACCCT CACTGGCGAT AACCAGACCT CCGTTCGCCC AAACAATATT GCATACATGC ACTACGAAGC GGAGCTGGTG GTGGTGATTG GCAAGCAGGC GCGTAACGTC AGCGAAGCCG ATGCCATGGA TTATGTCGCG GGCTACACCG TGTGTAACGA CTACGCCATT CGCGACTATC TGGAAAACTA CTACCGCCCT AACCTGCGGG TAAAAAGCCG CGACGGACTG ACGCCGATGC TTTCAACCAT CGTGCCGAAA GAGGCGATCC CGGACCCGCA TAATCTGACC CTTCGCACCT TCGTCAACGG CGAGTTACGC CAGCAAGGCA CCACCGCCGA TCTGATCTTC AGCGTGCCCT TCCTGATCGC CTACTTAAGC GAATTTATGA CCCTGAATCC GGGCGACATG ATCGCCACCG GCACACCAAA AGGCTTATCT GACGTAGTGC CTGGCGATGA AGTAGTGGTG GAAGTAGAAG GCGTGGGCCG CCTGGTGAAC CGAATTGTGA GTGAGGAAAC AGCGAAATGA
|
Protein sequence | MKGTIFAVAL NHRSQLDAWQ DAFQQSPYKA PPKTAVWFIK PRNTVIGCGE PIPFPQGEKV LSGATVALIV GKTATKVREE DAAEYIAGYA LANDVSLPEE SFYRPAIKAK CRDGFCPIGE TVALSNVDNL TIYTEINGRP ADHWNTADLQ RNAAQLLSAL SEFAMLNPGD AILLGTPQAR VEIQPGDRVR VLAEGFPPLE NPVVDEREVT TRKSFPTQPH PHGTLFALGL NYADHASELE FKPPEEPLVF LKAPNTLTGD NQTSVRPNNI AYMHYEAELV VVIGKQARNV SEADAMDYVA GYTVCNDYAI RDYLENYYRP NLRVKSRDGL TPMLSTIVPK EAIPDPHNLT LRTFVNGELR QQGTTADLIF SVPFLIAYLS EFMTLNPGDM IATGTPKGLS DVVPGDEVVV EVEGVGRLVN RIVSEETAK
|
| |