Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_03866 |
Symbol | thiH |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 4098786 |
End bp | 4099919 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | thiamine biosynthesis protein ThiH |
Protein accession | ACT45658 |
Protein GI | 253979988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCT TCAGCGATCG CTGGCGACAA CTGGACTGGG ATGACATCCG CCTGCGTATC AACGGCAAAA CGGCTGTTGA CGTAGAGCGG GCGCTAAATG CCTCGCAATT CACCCGCGAC GATATGATGG CGCTGTTATC GCCAGCCGCC AGTGGCTATC TGGAACAACT GGCCCAACGG GCGCAGCGTC TGACCCGTCA GCGATTTGGC AACACAGTTA GTTTCTACGT CCCGCTTTAT CTTTCCAATC TTTGCGCTAA CGACTGCACG TACTGCGGAT TTTCCATGAG TAATCGCATC AAGCGCAAAA CGCTGGATGA AGCGGATATT GCCAGGGAAA GCGCCGCTAT ACGGGAGATG GGCTTTGAAC ATCTGCTATT AGTCACTGGT GAACATCAGG CGAAAGTGGG GATGGATTAC TTTCGTCGTC ATCTCCCCGC CCTGCGTGAA CAGTTCTCTT CACTACAAAT GGAAGTGCAA CCGCTGGCGG AGACGGAATA CGCCGAGTTA AAGCAGCTTG GTCTGGATGG CGTGATGGTT TATCAGGAGA CATATCACGA GGCGACTTAT GCCCGCCATC ATCTGAAAGG CAAAAAACAG GACTTCTTCT GGCGGCTGGA AACGCCGGAT CGGCTGGGGC GTGCGGGGAT TGATAAGATA GGCCTCGGCG CGCTAATTGG CCTTTCCGAC AACTGGCGCG TTGACTGCTA TATGGTTGCC GAACATTTGC TATGGCTGCA ACAGCATTAC TGGCAAAGCC GTTACTCTGT CTCCTTTCCG CGCCTGCGCC CGTGTACTGG CGGCATTGAG CCTGCGTCGA TTATGGATGA ACGCCAGTTA GTGCAAACCA TCTGCGCCTT CCGACTGCTT GCACCGGAGA TTGAACTGTC ACTCTCCACG CGGGAATCAC CGTGGTTTCG CGATCGCGTT ATTCCGCTGG CGATCAATAA CGTCAGCGCC TTCTCGAAAA CGCAGCCAGG TGGCTATGCC GATAATCACC CCGAGTTGGA ACAGTTCTCA CCGCACGACG ATCGCAGACC GGAAGCGGTT GCTGCCGCGT TAACCGCTCA GGGTTTGCAG CCGGTATGGA AAGACTGGGA CAGCTATCTG GGACGGCCCT CGCAAAGACT ATGA
|
Protein sequence | MKTFSDRWRQ LDWDDIRLRI NGKTAVDVER ALNASQFTRD DMMALLSPAA SGYLEQLAQR AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEADI ARESAAIREM GFEHLLLVTG EHQAKVGMDY FRRHLPALRE QFSSLQMEVQ PLAETEYAEL KQLGLDGVMV YQETYHEATY ARHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA EHLLWLQQHY WQSRYSVSFP RLRPCTGGIE PASIMDERQL VQTICAFRLL APEIELSLST RESPWFRDRV IPLAINNVSA FSKTQPGGYA DNHPELEQFS PHDDRRPEAV AAALTAQGLQ PVWKDWDSYL GRPSQRL
|
| |