Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02008 |
Symbol | yehP |
ID | 8114673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2102142 |
End bp | 2103278 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644848220 |
Product | hypothetical protein |
Protein accession | YP_002999793 |
Protein GI | 251785489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAC TGAACGATCT TCTGACCACC CGTGAGCTAC AACGCTGGCG ATTAATTCTT GGCGAAGCGG CAGAAACGAC GCTTTGTGGG CTGGATGACA ACGCCCGGCA GATAGACCAC GCGCTGGAGT GGCTGTATGG GCGCGATCCT GAACGGCTCC AGCGTGGTGA ACGCTCCGGT GGATTAGGTG GCTCAAATCT CACCACCCCT GAGTGGATCA ACAGTATTCA CACGCTGTTT CCGCAACAGG TGATTGAGCG GCTGGAAAGC GATGCCGTAC TGCGCTACGG CATTGAAGAT GTGGTGACAA ATCTCGACGT GCTGGAACGT ATGCAGCCTT CCGAAAGCCT GCTACGCGCC GTTTTGCACA CCAAACATCT GATGAATCCC GAAGTACTGG CTGCCGCCCG CCAGATAGTG CGCCAGGTTG TTGAAGAAAT TATGGCGCGA CTGGCAAAGG AAGTTCGCCA GGCTTTTTCT GGTGTCCGCG ATCGCCGTCG CCGTTCATTT ATTCCACTGG CGCGAAACTT TGATTTCAAA AGTACTCTGC GCGCCAACCT GCAACACTGG CACCCGCAAC ACGGCAAGTT GTATATCGAA TCCCCCCGCT TTAACAGCCG CATTAAACGC CAAAGCGAAC AATGGCAACT GGTCTTACTG GTTGATCAAA GCGGATCGAT GGTCGATTCG GTGATCCACT CTGCGGTGAT AGCGGCCTGT TTGTGGCAGT TACCCGGCAT TCGTACCCAT CTGGTGGCGT TTGACACAAG CGTCGTTGAT CTCACGGCAG ACGTTGCCGA TCCGGTAGAG TTATTAATGA AAGTACAGTT GGGCGGCGGG ACCAATATCG CCAGTGCCGT GGAGTATGGT CGGCAACTTA TTGAACAACC AGCGAAAAGC GTCATTATCC TCGTGAGCGA TTTTTACGAA GGGGGTTCAT CATCATTACT GACGCATCAG GTGAAAAAGT GTGTCCAGAG CGGCATCAAA GTGCTGGGAC TGGCAGCGCT CGATAGCACC GCAACACCTT GCTATGACCG CGATACGGCC CAGGCGCTGG TTAATGTCGG CGCACAAATA GCCGCCATGA CGCCGGGCGA GCTGGCATCA TGGCTTGCGG AGAATCTTCA GTCATGA
|
Protein sequence | MSELNDLLTT RELQRWRLIL GEAAETTLCG LDDNARQIDH ALEWLYGRDP ERLQRGERSG GLGGSNLTTP EWINSIHTLF PQQVIERLES DAVLRYGIED VVTNLDVLER MQPSESLLRA VLHTKHLMNP EVLAAARQIV RQVVEEIMAR LAKEVRQAFS GVRDRRRRSF IPLARNFDFK STLRANLQHW HPQHGKLYIE SPRFNSRIKR QSEQWQLVLL VDQSGSMVDS VIHSAVIAAC LWQLPGIRTH LVAFDTSVVD LTADVADPVE LLMKVQLGGG TNIASAVEYG RQLIEQPAKS VIILVSDFYE GGSSSLLTHQ VKKCVQSGIK VLGLAALDST ATPCYDRDTA QALVNVGAQI AAMTPGELAS WLAENLQS
|
| |