Gene Daud_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0163 
SymbolthiH 
ID6027285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp186717 
End bp188192 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content61% 
IMG OID641593015 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001716359 
Protein GI169830377 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACCA CTACGGAAAA AGCCTGGCTG GACGAACGCC TGGCATACAT CAAGGCCTAT 
GAGGCTTCCG AAAAAAGGCC GAGCTTCGTG AGCGACGCGG AAATTGAGGC CGTCTTGAAA
CGCAAGGCCG ATCCCGAGCG GCTGGAAGTG GAAGAGGTTC TAAGCAAGGC CAAGGAGCTT
CACGGACTAA CCCCGGACGA CGCAGCCGTA TTGCTGAACA ACCGGGACCC GGAACTCTGG
GCGGAGATTT TTGCGACCGC GCACTGGATC AAGCAGGAGG TCTACGGTAA CCGGATCGTT
CTGTTCGCCC CCCTCTACAT TTCCAGTCCG TGCGTAAACA ACTGCGCTTA CTGCGGCTTC
CGGCACAGCA ATGATCAGGT CGCCAAAAAG ACCCTTTCGC CGGCCGAACT GGAGGCCGAA
GTAAAGGCGC TGATCACCAA GGGGCACAAA CGGCTGATCG TGGTTTATGG TGAGCACCCG
GCCAGCGACG TCGATTTTAT GTGCCGGACC ATCGAAACGA TCTACGCCGT CAAAGAAGGC
CGGGGCGAGA TCCGCAGGGT CAACGTCAAC GCCGCGCCCC TAACCGTGGA AGAATACCGG
CGGCTGAAAG AAGTCGGCAT CGGTACCTAT CAGGTGTTTC AGGAGACGTA CCACCTGACC
ACCTACCGGA AGATGCACCC GGCGAACACC CTCAAGGGTT CGTTCCGCTG GCGGTTGTTC
GCCCTGCACC GGGCGCAGGA AGCAGGAATC GACGACGTGG CCGTCGGCGC GCTCTTCGGG
CTGTATGACT GGCGCTTCGA GGTACTGGGC CTCCTTTACC ACGCCCTGGA CCTGGAGCGG
GAGTTCGGCG TGGGCCCGCA CACCATCTCG GTCCCCCGGC TGGAGCCGGC CTTGAACACC
CCGCTGACCA CCAGCTCACC CTACCGGGTG GCCGATGAGG ACTTCAAGAA GGCGGTCACC
GTACTGCGGT GCGCCGTGCC TTACACCGGT ATCATCCTCA CCTGCCGCGA AAAGCCCGCT
TTGCGGCGGG AGGTCATCGC GCTGGGCGTC TCGCAGGTCG ACGCCGGGTC CCGGACCGCC
GTGGGCGGCT ACGCGGAGAT GGAACGGGAA CACATCCCCG ACCGCGAGCA GTTCCAACTG
GCGGACACCC GCTCCCTGGA CGAGTATATT CTGGAACTGT GCCGGGACGG GTACATCCCT
TCCTTCTGCA CGGCGGGATA CCGTACCGGT CGCACCGGCT GCCACTTTAT GTCCTTTGCC
AAGCAGGGTT TGATTAAGAA TTTCTGCCTG CCGAACGCCG TGCTCACTTT CAAGGAATAC
CTTCTCGACT ACGCTTCGCC CGAAACCAGG GATGCCGGGG AAAAAACCAT CGCCCGGCAC
GTCGAAGACT TTGCCCGCCG GATGCCGCAA CGCGCCGAAA AACTGAAAGA GATGCTTTCC
CGGATGGAGG GTGGCCAGCG TGACCTGTAC TTTTAA
 
Protein sequence
MLTTTEKAWL DERLAYIKAY EASEKRPSFV SDAEIEAVLK RKADPERLEV EEVLSKAKEL 
HGLTPDDAAV LLNNRDPELW AEIFATAHWI KQEVYGNRIV LFAPLYISSP CVNNCAYCGF
RHSNDQVAKK TLSPAELEAE VKALITKGHK RLIVVYGEHP ASDVDFMCRT IETIYAVKEG
RGEIRRVNVN AAPLTVEEYR RLKEVGIGTY QVFQETYHLT TYRKMHPANT LKGSFRWRLF
ALHRAQEAGI DDVAVGALFG LYDWRFEVLG LLYHALDLER EFGVGPHTIS VPRLEPALNT
PLTTSSPYRV ADEDFKKAVT VLRCAVPYTG IILTCREKPA LRREVIALGV SQVDAGSRTA
VGGYAEMERE HIPDREQFQL ADTRSLDEYI LELCRDGYIP SFCTAGYRTG RTGCHFMSFA
KQGLIKNFCL PNAVLTFKEY LLDYASPETR DAGEKTIARH VEDFARRMPQ RAEKLKEMLS
RMEGGQRDLY F