Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03694 |
Symbol | yihE |
ID | 8114935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3947230 |
End bp | 3948216 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644849855 |
Product | hypothetical protein |
Protein accession | YP_003001428 |
Protein GI | 251787124 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.617644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACA GCGCTTTTAC TTTCCAGACA CTACACCCGG ATACCATCAT GGACGCTCTG TTTGAGCATG GGATCCGGGT GGATTCCGGT CTTACCCCGC TTAACAGCTA TGAAAACCGT GTCTATCAAT TTCAGGACGA AGATCGTCGA CGTTTTGTCG TCAAATTTTA TCGCCCTGAA CGTTGGACAG CCGATCAAAT CCTCGAAGAA CATCAATTTG CGTTGCAGCT GGTAAATGAT GAAGTTCCGG TCGCAGCACC TGTGGCCTTT AACGGTCAGA CTTTATTGAA TCATCAGGGA TTTTATTTCG CTGTTTTTCC AAGCGTCGGT GGTCGCCAGT TCGAAGCTGA TAATATCGAT CAGATGGAAG CGGTTGGGCG TTATTTAGGG CGTATGCACC AGACGGGGCG CAAACAGCTT TTTATCCATC GCCCGACCAT CGGTTTGAAC GAATATCTCA TTGAGCCACG CAAGCTGTTT GAGGACGCTA CACTGATACC TTCCGGGTTG AAAGCGGCAT TCCTGAAAGC GACAGATGAG CTGATTGCCG CCGTTACAGC ACACTGGCGG GAAGATTTCA CCGTTCTGCG GCTACATGGA GACTGCCACG CCGGGAATAT TCTCTGGCGC GATGGTCCAA TGTTTGTTGA TCTGGATGAT GCACGTAATG GTCCAGCCGT TCAGGATTTG TGGATGTTGC TCAATGGCGA TAAAGCCGAG CAGCGGATGC AACTGGAAAC TATTATTGAA GCTTATGAAG AATTTAGCGA GTTCGACACC GCTGAAATCG GACTGATTGA ACCTTTACGC GCCATGCGTT TGGTTTATTA TCTTGCCTGG CTAATGCGGC GTTGGGCTGA TCCCGCGTTC CCGAAAAATT TCCCGTGGTT AACCGGGGAA GATTACTGGC TACGACAGAC GGCGACTTTT ATAGAACAGG CAAAAGTTCT ACAAGAACCC CCTTTGCAAT TAACACCTAT GTATTAA
|
Protein sequence | MNNSAFTFQT LHPDTIMDAL FEHGIRVDSG LTPLNSYENR VYQFQDEDRR RFVVKFYRPE RWTADQILEE HQFALQLVND EVPVAAPVAF NGQTLLNHQG FYFAVFPSVG GRQFEADNID QMEAVGRYLG RMHQTGRKQL FIHRPTIGLN EYLIEPRKLF EDATLIPSGL KAAFLKATDE LIAAVTAHWR EDFTVLRLHG DCHAGNILWR DGPMFVDLDD ARNGPAVQDL WMLLNGDKAE QRMQLETIIE AYEEFSEFDT AEIGLIEPLR AMRLVYYLAW LMRRWADPAF PKNFPWLTGE DYWLRQTATF IEQAKVLQEP PLQLTPMY
|
| |