Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1746 |
Symbol | thiH |
ID | 8416045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2053474 |
End bp | 2054715 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645024712 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003182100 |
Protein GI | 257791494 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0157605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.01356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAG CAGGCACCCG CTCCGCAACG CCCTCGGCGG TTCCGCACTT CACGCGCCTC GAGGCGGTGG ACCCCGCCGA CGTGGCGCGC GATCGCGGCA TCGACCCTAT GGCCTACCTT CCGGATATGG ACGTCACCGA CTCGCCCGTG CTCGACGAAC TGTTGGCACG CGCGGCGGCG TTCGACTTCG ACGCGGCGAC CGAGGCCGAC GTGCGCGCCG CGCTCGCGGC CGACCGCCTT TCCCCCGAGG GCTTCGGCGC GCTGCTGTCG CCTGCCGCCG AGCCGTTGCT GGAGGAGCTG GCCGCCGCCG CCCGCCGCGC GCGGCGGCGT TGGTTCGGCA GCACCGCGTA CCTGTTCACG CCGCTGTACC TTGCGAACTA CTGCGACAAC CACTGCGTGT ACTGCGGCTT CAACCGCGAC AACGACATAT GTCGCGCCCG TCTCGACCGC GCGGGCATCG CCGCCGAGCT CGACGCCATC GCGGCGACGG GGCTCGAGGA GATCCTGCTG CTCACGGGCG AGGATCGCGA GCGCACCGAC CCCGCCTACA TCGGCGAGGC GTGCAAGCTG GCCGCCGAGC GCTTCCGCAT GGTGGGCGTG GAAGTGTACC CCATGAACGA GGACGAATAC GCCTACCTGC ACGGATGCGG CGTCGATTAC GTCACGGTGT TCCAGGAGAC GTACGACCCC GCGCTTTACG GCAAGCTGCA CCTGGCGGGG CGCAAGCGGG TGTTCCCGTA CCGTGCGAAC GCTCAGGAGC GCGCGATGAG AGGCGGCATG CGGGGCGCGG CGTTCGGCGC CCTCTTGGGT CTCGGCGACT TCCGGCGCGA CGCGTACGCA TGCGGGCTGC ACGCATGGCT CGTGCAGCGC GCTTACCCGC ACGCCGAGCT GTCGCTGTCG TGCCCCCGTC TGCGTCCCAT CGCCGGGAAC GGGTCGCTGG GGCCGCGCGG CGTGGGCGAG CGACAGCTTC TGCAGGTGAT GTGCGCTTAC CGTCTTCTGC TGCCGCAGGC GGGCATCACC ATCTCGTCGC GCGAGCGCGC GGGCTTCCGC GACCGGGCCA TGGGCATCGC CGCCACGAAG ATATCGGCCG GCGTGTCCAC GGGCGTCGGC GAGCATGCGG ACGGATCGCC TGCGGGCGAC GACCAGTTCG AGATCGCCGA CGGCCGCGAC GTGGCGCAGG TGCGCGCCGC GCTGCGCGGC GTCGGTCTCG AGCCGGTGAT GAACGACTAT GTTCGCCTGT GA
|
Protein sequence | MTQAGTRSAT PSAVPHFTRL EAVDPADVAR DRGIDPMAYL PDMDVTDSPV LDELLARAAA FDFDAATEAD VRAALAADRL SPEGFGALLS PAAEPLLEEL AAAARRARRR WFGSTAYLFT PLYLANYCDN HCVYCGFNRD NDICRARLDR AGIAAELDAI AATGLEEILL LTGEDRERTD PAYIGEACKL AAERFRMVGV EVYPMNEDEY AYLHGCGVDY VTVFQETYDP ALYGKLHLAG RKRVFPYRAN AQERAMRGGM RGAAFGALLG LGDFRRDAYA CGLHAWLVQR AYPHAELSLS CPRLRPIAGN GSLGPRGVGE RQLLQVMCAY RLLLPQAGIT ISSRERAGFR DRAMGIAATK ISAGVSTGVG EHADGSPAGD DQFEIADGRD VAQVRAALRG VGLEPVMNDY VRL
|
| |