Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03696 |
Symbol | yihF |
ID | 8114555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3949014 |
End bp | 3950444 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644849857 |
Product | hypothetical protein |
Protein accession | YP_003001430 |
Protein GI | 251787126 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.142182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACGTA AGTCAGCTAC AGGTGTTATT GTTGCGTTAG CCGTAATCTG GGGTGGTGGC ACATGGTACA CAGGTACGCA AATTCAGCCT GGTGTCGAAA AGTTTATTAA AGATTTTAAC GATGCTAAAA AGAAAGGTGA ACATGCCTAC GATATGACGT TAAGTTATAA AAATTTTGAC AAAGGTTTTT TTAATTCTCG TTTTCAAATG CAAATGACAT TCGATAACGG TGCACCCGAT CTCAATATCA AGCCAGGCCA GAAAGTTGTA TTTGATGTGG ATGTTGAGCA CGGTCCGTTG CCCATCACAA TGTTAATGCA TGGTAATGTC ATCCCAGCAC TGGCAGCGGC AAAAGTGAAC TTAGTGAATA ATGAACTGAC ACAACCGCTA TTTATCGCCG CGAAAAATAA ATCGCCCGTG GAAGCGACAT TGCGATTCGC GTTTGGTGGC TCATTCTCTA CGACATTAGA TGTTGCCCCT GCAGAGTATG GAAAGTTTTC TTTTGGTGAG GGCCAGTTTA CTTTTAATGG TGATGGTAGT TCATTGTCTA ACCTGGATAT TGAAGGCAAA GTCGAAGATA TTGTTCTGCA ATTATCACCA ATGAACAAAG TAACAGCAAA AAGTTTTACC ATTGATTCTC TGGCGCGATT AGAAGAAAAG AAATTTCCGG TTGGTGAAAG CGAGTCGAAA TTTAATCAGA TTAACATTAT CAATCACGGG GAAGACGTTG CCCAAATCGA TGCTTTCGTT GCAAAAACCA GGCTGGATCG CGTTAAAGAC AAAGATTATA TCAATGTCAA TCTGACCTAC GAACTTGATA AGTTAACAAA AGGGAATCAG CAACTCGGTA GTGGTGAGTG GTCATTGATT GCTGAATCTA TTGATCCCTC AGCAGTGCGC CAATTTATCA TCCAGTATAA CATTGCGATG CAGAAGCAGC TTGCTGCACA TCCTGAGTTA GCAAACGATG AAGTTGCTCT GCAAGAAGTG AATGCTGCAT TGTTCAAAGA GTATTTACCG TTATTACAAC AAAGTGAGCC GGCCATTAAA CAACCGGTAA GATGGAAGAA CGCACTCGGC GAACTAAATG CCAATCTGGA TATCAGTATT GCCGACCCAG CCAAATCTTC ATCATCCACA AACAAAGATA TCAAATCGCT CAATTTTGAT GTGAAGTTAC CGCTTAATGT CGCCACAGAA ACCGCAAAAC AGCTTAATTT ATCTGAAGGA ATGGATGCGG AAAAAGCGCA AAAGCAGGCT GATAAACAAA TCAGCGGGAT GATGACATTA GGTCAGATGT TTCAGTTAAT CACGATTGAC AACAATACCG CCTCGCTGCA GCTGCGTTAT ACACCGGGTA AAGTTGTTTT TAACGGACAG GAGATGAGCG AAGAAGAATT TATGTCTCGT GCCGGACGTT TTGTTCATTA A
|
Protein sequence | MIRKSATGVI VALAVIWGGG TWYTGTQIQP GVEKFIKDFN DAKKKGEHAY DMTLSYKNFD KGFFNSRFQM QMTFDNGAPD LNIKPGQKVV FDVDVEHGPL PITMLMHGNV IPALAAAKVN LVNNELTQPL FIAAKNKSPV EATLRFAFGG SFSTTLDVAP AEYGKFSFGE GQFTFNGDGS SLSNLDIEGK VEDIVLQLSP MNKVTAKSFT IDSLARLEEK KFPVGESESK FNQINIINHG EDVAQIDAFV AKTRLDRVKD KDYINVNLTY ELDKLTKGNQ QLGSGEWSLI AESIDPSAVR QFIIQYNIAM QKQLAAHPEL ANDEVALQEV NAALFKEYLP LLQQSEPAIK QPVRWKNALG ELNANLDISI ADPAKSSSST NKDIKSLNFD VKLPLNVATE TAKQLNLSEG MDAEKAQKQA DKQISGMMTL GQMFQLITID NNTASLQLRY TPGKVVFNGQ EMSEEEFMSR AGRFVH
|
| |