Gene Adeg_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAdeg_1241 
SymbolthiH 
ID8491233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAmmonifex degensii KC4 
KingdomBacteria 
Replicon accessionNC_013385 
Strand
Start bp1252404 
End bp1253882 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content59% 
IMG OID646359247 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003239202 
Protein GI260893105 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.721049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCTGA GCTTAGCCGA AAGGACTTGG CGCGACAAGC GCCTGGAGAT GATTAAGAAG 
TACGAGGAAG AAGAGAAGCG TCAGGACTTT ATCAAGGAGG AGGAGATCTG GCGTATCCTG
GAAGAGAAGG CTCATCCTGA GCCGGCTGAG GTGCGGGAGG TATTGGCCAA GGCGCGGGAA
CTCAAGGGGC TTTCGCCCGA AGACACAGCG GTGCTGATCA ACACCAAAGA CCCGGAGCTT
TTGCAGGAGC TCTTTGAAAC CGCCTTCTGG ATCAAGAACC AGGTTTACGG CAACCGCATC
GTGCTCTTCG CCCCCCTTTA CGTCTCCAGC CCCTGCGTCA ACAACTGTGT TTACTGCGGC
TTCCGCCATT CCAACGAAGC AGTTTATAAG CGAACCCTCA CCATGGAGGA GCTGGCTCAG
GAAGTTAGAG TCATTACCCG AGTGGGACAC AAGCGGGTGC TGGCGGTCTT CGGCGAGCAT
CCCGCCAGCG ACGTGGACTA CATCTGCCGC TCGCTCGAGA CCATCTACGC CACTAAGCAC
GGCCGCGACG AGATACGGCG GGTCAATGTC AACGCTGCTC CCATGACGGT GGAAGAGTAC
CGGCAGATCA AGGAAGTAGG CATAGGCACC TATCAGGTAT TCCAGGAGAC TTACCACCAC
GAGACCTACC GGCGTCTACA TCCCCCAGAC ACTTTGAAGC ACTCCTACAA GTGGCGCCTC
TTCGCACTAC ACCGGGCTCA GGAGGCCGGG ATAGACGACG TGGCCATCGG GGTGCTCTTC
GGACTTTTCG ACTGGCGTTT CGAGGTGCTG GGCCTCCTTT ACCACGCCAT GGACCTGGAG
AGGGAGTTCG GCGTGGGGCC TCACACCATC TCCTTCCCCC GGCTGGAGCC GGCGCTCAAC
ACTCCCTTCA CCACTAACTC TCCTTACTTA GTTAGCGACG AAGAGCTCAA GAAGATCATC
GCCATCCTGC GCTGCGCCGT GCCCTACACC GGGCTCATCC TGACCGCCCG CGAGAATCCG
GAGCTTAGGC GCGAGCTCAT AAGACTGGGT GTTTCCCAGA CGGATGCCGG CTCCCGGATC
GCCGTGGGAG GCTACTCGGA AATGGAGAAG GAACACATCT TGGAGCGGCA GCAGTTCAAG
ATCAACGACA CCCGGAGCCT CGACGAGTTC ATCTACGACC TCTGCCAGGA CGGCTACATC
CCCTCCTTCT GTACCGCCGG CTACCGGGCC GGCCGGACCG GGTGCCACTT CATGGAGTTC
GCCAAGAAGG GCTTGGTAAA GAACTTCTGT GTGCCCAACG CCATCCTTAC CTTCAAAGAA
TACCTGCTGG ACTACGCCTC TCCCCGCACC CGCGAGCTAG GGGAGAAAGT CATCGAGCGG
TACCTGCAGG AGGTCGAGGA GCGCCTGCCG CGCCTGGCAG AGAAGGTGCG GGAATACCTG
GCTAGGATGG AGGCGGGAGA GCGTGACCTC TACGTTTAA
 
Protein sequence
MALSLAERTW RDKRLEMIKK YEEEEKRQDF IKEEEIWRIL EEKAHPEPAE VREVLAKARE 
LKGLSPEDTA VLINTKDPEL LQELFETAFW IKNQVYGNRI VLFAPLYVSS PCVNNCVYCG
FRHSNEAVYK RTLTMEELAQ EVRVITRVGH KRVLAVFGEH PASDVDYICR SLETIYATKH
GRDEIRRVNV NAAPMTVEEY RQIKEVGIGT YQVFQETYHH ETYRRLHPPD TLKHSYKWRL
FALHRAQEAG IDDVAIGVLF GLFDWRFEVL GLLYHAMDLE REFGVGPHTI SFPRLEPALN
TPFTTNSPYL VSDEELKKII AILRCAVPYT GLILTARENP ELRRELIRLG VSQTDAGSRI
AVGGYSEMEK EHILERQQFK INDTRSLDEF IYDLCQDGYI PSFCTAGYRA GRTGCHFMEF
AKKGLVKNFC VPNAILTFKE YLLDYASPRT RELGEKVIER YLQEVEERLP RLAEKVREYL
ARMEAGERDL YV