Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd703_3720 |
Symbol | thiH |
ID | 8087090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya dadantii Ech703 |
Kingdom | Bacteria |
Replicon accession | NC_012880 |
Strand | + |
Start bp | 4310150 |
End bp | 4311283 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644837798 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002989297 |
Protein GI | 242241116 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.526508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACT CGTTTGCACA ACGCTGGCGA CAGCTGGACT GGGACGACCT GACGCTGCGC ATCAACGGTA AAACCCGCGC CGACGTCGAA CAGGCGCTGA ACGCCGAACA CCCCGACATG GAGGATTTCA TGGCGCTGCT GTCTCCGGCC GCCGCCTCTT ATCTGGAACC GCTGGCCCAA CGCGCCCAAC AGCTGACCCG GCAGCGCTTC GGCAATACCG TCGGCTTTTA CGTGCCGCTC TATCTCTCCA ATCTCTGCGC CAACGACTGC ACCTACTGCG GCTTCTCCAT GAGCAACCAC CTCAAGCGCA AAACCTTGAA CGACGAGGAA ATTCGGCGGG AATGCGGTGC CATCAAGGAA TTGGGGTTCG ACCACCTGCT GCTAGTGACG GGCGAGCACC AACGTAAGGT GGGGATGGAC TATTTCCGGC GCGTATTTCC ACTGATTCGC CCGCACTTCA GCGCGCTGAT GATGGAAGTG CAGCCGCTGG CGCAGGAAGA GTACGCGGAG CTGAAAACGC TGGGGCTGGA CGGGGTGATG GTGTATCAGG AAACCTACCA TCAGGCGACT TACGCCCGCC ACCACCTGCA CGGGAAAAAA CAGGATTTCG CCTGGCGGCT GAACACCCCG GATCGGCTGG GCCGCGCCGG GATCGACAAG ATCGGGCTGG GCGCGTTGAT CGGCCTGTCC GACAGTTGGC GCACCGACTG CTATATGGTG GCGGAACACT TGCGCCATCT GCAACGCCAC TATTGGCAAA GCCGCTACTC GTTGTCGTTT CCGCGCCTGC GCCCGTGCGC CGGCGGCATC GAACCCGCTT CGCTGATGGA TGAAGCGCAA CTGATGCAGA CGATCTGCGC CTTCCGGCTG CTGGCGCCCG ATGTAGAACT GTCGCTGTCG ACCCGGGAAT CGCCCCATTT TCGCGACCAC ATGATCCCGA TCGCCATCAA CAACGTCAGC GCTTTCTCCA AAACCCAGCC GGGCGGCTAC GCCGACGGCC ATGCGGAATT GGAGCAGTTC TCCCCGCACG ACGACCGCCG GCCGGAAGCA GTGGCCGATG TCTTGACGCG CGCCGGCCTG CAGCCGGTCT GGAAAGACTG GGAGGGATAT CTGGGCCGGA CGATCGCAAA CTGA
|
Protein sequence | MDNSFAQRWR QLDWDDLTLR INGKTRADVE QALNAEHPDM EDFMALLSPA AASYLEPLAQ RAQQLTRQRF GNTVGFYVPL YLSNLCANDC TYCGFSMSNH LKRKTLNDEE IRRECGAIKE LGFDHLLLVT GEHQRKVGMD YFRRVFPLIR PHFSALMMEV QPLAQEEYAE LKTLGLDGVM VYQETYHQAT YARHHLHGKK QDFAWRLNTP DRLGRAGIDK IGLGALIGLS DSWRTDCYMV AEHLRHLQRH YWQSRYSLSF PRLRPCAGGI EPASLMDEAQ LMQTICAFRL LAPDVELSLS TRESPHFRDH MIPIAINNVS AFSKTQPGGY ADGHAELEQF SPHDDRRPEA VADVLTRAGL QPVWKDWEGY LGRTIAN
|
| |