Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01183 |
Symbol | dhaH |
ID | 8112837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1235252 |
End bp | 1236670 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644847436 |
Product | hypothetical protein |
Protein accession | YP_002999009 |
Protein GI | 251784705 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.500726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAACC TGGTCATAGT TTCACATAGC AGCCGACTGG GAGAAGGTGT CGGTGAATTA GCCCGTCAGA TGTTAATGAG TGATAGTTGT AAAATCGCCA TTGCCGCGGG AATTGACGAT CCACAAAATC CCATTGGTAC CGATGCCGTC AAAGTGATGG AGGCCATCGA ATCTGTTGCT GATGCCGACC ATGTGCTGGT CATGATGGAT ATGGGTAGCG CATTATTGAG TGCTGAAACT GCGCTGGAAT TGCTGGCTCC CGAGATCGCC GCAAAAGTAC GTTTGTGTGC TGCGCCGTTG GTCGAAGGTA CACTGGCAGC AACGGTCAGC GCGGCCTCGG GGGCGGATAT CGACAAAGTT ATCTTTGACG CCATGCATGC GCTGGAAGCC AAACGTGAAC AACTGGGTTT ACCGTCCTCC GACACTGAAA TCTCTGACAC ATGTCCTGCG TACGATGAAG AAGCCCGTTC TCTGGCGGTG GTCATAAAAA ACCGTAACGG CCTGCATGTA CGTCCGGCCT CCCGGCTGGT TTATACCTTA TCGACATTTA ATGCCGATAT GTTGCTGGAA AAAAACGGCA AATGCGTCAC ACCAGAGAGT ATTAACCAGA TTGCGTTACT ACAAGTTCGC TATAACGATA CGCTGCGCCT GATTGCGAAA GGGCCAGAAG CTGAAGAGGC ACTGATCGCT TTCCGTCAGC TGGCTGAAGA TAACTTTGGT GAAACGGAGG AAGTCGCTCC ACCTACTCTG CGTCCCGTTC CGCCTGTTTC GGGTAAAGCC TTTTATTATC AACCAGTTTT ATGTACGGTA CAGGCAAAAT CAACCCTGAC CGTGGAAGAA GAACAAGATC GATTACGCCA GGCTATTGAC TTCACGTTAT TAGATCTGAT GACGTTAACA GCGAAAGCAG AAGCCAGCGG GCTTGACGAT ATTGCCGCAA TCTTTTCTGG TCACCATACA CTGTTAGATG ATCCGGAACT GCTGGCGGCG GCAAGCGAAC TCCTTCAGCA TGAACATTGC ACGGCAGAAT ATGCCTGGCA GCAAGTTCTT AAAGAACTTA GCCAGCAATA CCAGCAACTG GATGATGAAT ATCTACAAGC TCGCTATATT GATGTGGACG ATCTTCTGCA TCGCACCCTG GTCCACCTGA CCCAAACGAA AGAAGAACTC CCGCAGTTTA ACTCGCCAAC TATTCTACTG GCGGAGAACA TTTATCCTTC CACAGTACTG CAACTGGATC CGGCGGTTGT AAAAGGTATC TGCCTTAGCG CCGGAAGTCC GGTATCCCAC AGCGCCCTAA TCGCCCGTGA ACTGGGGATT GGCTGGATTT GCCAGCAGGG TGAGAAACTG TATGCGATAC AACCAGAAGA AACGCTAACG CTGGACGTTA AAACGCAACG TTTCAACCGT CAGGGTTAA
|
Protein sequence | MVNLVIVSHS SRLGEGVGEL ARQMLMSDSC KIAIAAGIDD PQNPIGTDAV KVMEAIESVA DADHVLVMMD MGSALLSAET ALELLAPEIA AKVRLCAAPL VEGTLAATVS AASGADIDKV IFDAMHALEA KREQLGLPSS DTEISDTCPA YDEEARSLAV VIKNRNGLHV RPASRLVYTL STFNADMLLE KNGKCVTPES INQIALLQVR YNDTLRLIAK GPEAEEALIA FRQLAEDNFG ETEEVAPPTL RPVPPVSGKA FYYQPVLCTV QAKSTLTVEE EQDRLRQAID FTLLDLMTLT AKAEASGLDD IAAIFSGHHT LLDDPELLAA ASELLQHEHC TAEYAWQQVL KELSQQYQQL DDEYLQARYI DVDDLLHRTL VHLTQTKEEL PQFNSPTILL AENIYPSTVL QLDPAVVKGI CLSAGSPVSH SALIARELGI GWICQQGEKL YAIQPEETLT LDVKTQRFNR QG
|
| |