Gene Elen_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1746 
SymbolthiH 
ID8416045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2053474 
End bp2054715 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content71% 
IMG OID645024712 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003182100 
Protein GI257791494 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0157605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.01356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAG CAGGCACCCG CTCCGCAACG CCCTCGGCGG TTCCGCACTT CACGCGCCTC 
GAGGCGGTGG ACCCCGCCGA CGTGGCGCGC GATCGCGGCA TCGACCCTAT GGCCTACCTT
CCGGATATGG ACGTCACCGA CTCGCCCGTG CTCGACGAAC TGTTGGCACG CGCGGCGGCG
TTCGACTTCG ACGCGGCGAC CGAGGCCGAC GTGCGCGCCG CGCTCGCGGC CGACCGCCTT
TCCCCCGAGG GCTTCGGCGC GCTGCTGTCG CCTGCCGCCG AGCCGTTGCT GGAGGAGCTG
GCCGCCGCCG CCCGCCGCGC GCGGCGGCGT TGGTTCGGCA GCACCGCGTA CCTGTTCACG
CCGCTGTACC TTGCGAACTA CTGCGACAAC CACTGCGTGT ACTGCGGCTT CAACCGCGAC
AACGACATAT GTCGCGCCCG TCTCGACCGC GCGGGCATCG CCGCCGAGCT CGACGCCATC
GCGGCGACGG GGCTCGAGGA GATCCTGCTG CTCACGGGCG AGGATCGCGA GCGCACCGAC
CCCGCCTACA TCGGCGAGGC GTGCAAGCTG GCCGCCGAGC GCTTCCGCAT GGTGGGCGTG
GAAGTGTACC CCATGAACGA GGACGAATAC GCCTACCTGC ACGGATGCGG CGTCGATTAC
GTCACGGTGT TCCAGGAGAC GTACGACCCC GCGCTTTACG GCAAGCTGCA CCTGGCGGGG
CGCAAGCGGG TGTTCCCGTA CCGTGCGAAC GCTCAGGAGC GCGCGATGAG AGGCGGCATG
CGGGGCGCGG CGTTCGGCGC CCTCTTGGGT CTCGGCGACT TCCGGCGCGA CGCGTACGCA
TGCGGGCTGC ACGCATGGCT CGTGCAGCGC GCTTACCCGC ACGCCGAGCT GTCGCTGTCG
TGCCCCCGTC TGCGTCCCAT CGCCGGGAAC GGGTCGCTGG GGCCGCGCGG CGTGGGCGAG
CGACAGCTTC TGCAGGTGAT GTGCGCTTAC CGTCTTCTGC TGCCGCAGGC GGGCATCACC
ATCTCGTCGC GCGAGCGCGC GGGCTTCCGC GACCGGGCCA TGGGCATCGC CGCCACGAAG
ATATCGGCCG GCGTGTCCAC GGGCGTCGGC GAGCATGCGG ACGGATCGCC TGCGGGCGAC
GACCAGTTCG AGATCGCCGA CGGCCGCGAC GTGGCGCAGG TGCGCGCCGC GCTGCGCGGC
GTCGGTCTCG AGCCGGTGAT GAACGACTAT GTTCGCCTGT GA
 
Protein sequence
MTQAGTRSAT PSAVPHFTRL EAVDPADVAR DRGIDPMAYL PDMDVTDSPV LDELLARAAA 
FDFDAATEAD VRAALAADRL SPEGFGALLS PAAEPLLEEL AAAARRARRR WFGSTAYLFT
PLYLANYCDN HCVYCGFNRD NDICRARLDR AGIAAELDAI AATGLEEILL LTGEDRERTD
PAYIGEACKL AAERFRMVGV EVYPMNEDEY AYLHGCGVDY VTVFQETYDP ALYGKLHLAG
RKRVFPYRAN AQERAMRGGM RGAAFGALLG LGDFRRDAYA CGLHAWLVQR AYPHAELSLS
CPRLRPIAGN GSLGPRGVGE RQLLQVMCAY RLLLPQAGIT ISSRERAGFR DRAMGIAATK
ISAGVSTGVG EHADGSPAGD DQFEIADGRD VAQVRAALRG VGLEPVMNDY VRL