Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3875 |
Symbol | thiH |
ID | 8118832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 4378780 |
End bp | 4379925 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644854246 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003006154 |
Protein GI | 251791433 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCTC ACTCTCAGCC GTCATTTGAA CAACGCTGGC AACAGCTGGA ATGGCACGAC CTGACGCTAC GCATCAACAG CAAAACCGAC GCCGACGTAG AACGGGCGTT ATGCGCGGAT CGACTCACGC GTGAAGATAT GATGGCGCTG CTGTCGCCCG CCGCCAGCCG CTGGCTGGAA CCACTGGCGC AACGGGCGCA ACAGCTGACG CGCCAGCGCT TCGGCAACAC CGTGAGTTTC TACGTACCGC TGTATCTCTC CAACCTGTGT GCCAACGACT GTACCTATTG CGGTTTTTCC ATGAGCAACC ACCTCAAACG CAAAACGCTG GACGAACCGG AGATCCTGCG CGAGTGCGAG GCGATCAAAT CGCTGGGATT CGATCACCTG CTGTTGGTGA CCGGCGAACA CCAACGCAAA GTGGGGATGG ACTATTTTCG CCGGATGCTG CCGTTGATTC GCCCGCAGTT CAGTTCAGTG ATGATGGAAG TGCAGCCGCT GTCGCAAGCG GAGTACGCCG AGTTGAAAAC CCTCGGGTTG GATGGGGTTA TGGTGTATCA GGAAACCTAT CACGCGGCGA CCTACGCCCG CCATCACCTG CGCGGCCACA AGCAGGATTT CGCCTGGCGG CTGGCAACGC CGGACAGGCT GGGGCGCGCC GGTATCGACA AGATCGGGCT GGGGGCGTTG ATCGGGCTGT CCGACAGTTG GCGTACCGAC TGCTATATGG TGGCCGAGCA CCTGCTGTAT TTGCAGCAAA CGTACTGGCA GAGCCGCTAT TCGGTGTCGT TCCCGCGCCT GCGCCCTTGT GCCGGCGGCA TCGAACCGGC TTCGCTGATG GATGAGGCGC AACTGATGCA GGTGATCTGC GCCTTCCGGC TACTGGCGCC CGATGTGGAG CTGTCGCTCT CCACCCGCGA GTCGCCGTTC TTCCGCGACC ACATGATCCC GGTAGCCATC AACAACGTCA GCGCGTTTTC CAAAACCCAA CCGGGCGGCT ACGCCGACGA TCATCCTGAA CTGGAACAAT TCACCCCGCA CGATAACCGC CGTCCGCAAG CGGTGGCCGA TGCCCTGACC CAGGCCGGTC TGCAACCGGT GTGGAAAGAC TGGGACGGTT ATCTGGGAAG GCCAGACGTC AACTAA
|
Protein sequence | MSAHSQPSFE QRWQQLEWHD LTLRINSKTD ADVERALCAD RLTREDMMAL LSPAASRWLE PLAQRAQQLT RQRFGNTVSF YVPLYLSNLC ANDCTYCGFS MSNHLKRKTL DEPEILRECE AIKSLGFDHL LLVTGEHQRK VGMDYFRRML PLIRPQFSSV MMEVQPLSQA EYAELKTLGL DGVMVYQETY HAATYARHHL RGHKQDFAWR LATPDRLGRA GIDKIGLGAL IGLSDSWRTD CYMVAEHLLY LQQTYWQSRY SVSFPRLRPC AGGIEPASLM DEAQLMQVIC AFRLLAPDVE LSLSTRESPF FRDHMIPVAI NNVSAFSKTQ PGGYADDHPE LEQFTPHDNR RPQAVADALT QAGLQPVWKD WDGYLGRPDV N
|
| |