Gene Daud_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1089 
Symbol 
ID6027346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1146549 
End bp1147655 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID641593904 
Productradical SAM domain-containing protein 
Protein accessionYP_001717233 
Protein GI169831251 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACTTCT GGTTCGTGGA CGATAGGCTC GAAGCGGTTG CCGGGAAGGT GCGGAATGGT 
GAGCGGCTCG GGTTGGAGGA CGGGGTGATC CTTTTCCGGT CCCCGGACCT GATCGGAGTC
GGGCAGCTGG CCGACGCCGT CCGGCGCAAG AAACACGGCG ACCGGGTATA CTTCGTGGTG
AATCGGCACA TCAACCACAC CAACATCTGC GTGAACGGTT GCCGGTTTTG CGCTTTCGGC
AAGGAGCCGG GCGAGCCGGG CGGCTACGTG ATGTCCCTGG ACGAGATCGA GGCCCGGGCC
CGCGAATCCT GGGCACTCGG CATCTCCGAA GTGCACGTCG TTGGGGGTCT GCACCCCGAC
CTGAACCTGG ACTACTACCG GGAGATGCTC ACCCGGCTCC GGAACACCGT CCCCGGCGTG
ATCATCCAGG CCCTGACCGC GGTCGAGGTG GACTACCTAG CCGGCCTGCA CGGGCTTGAG
CTGGAGGATG TACTTACCGA ACTCCGGGCG GCTGGCCTTG ATTCCCTGCC TGGCGGCGGG
GCCGAGGTTT TCGCCCCCCG GGTGCGGGAG TCGGTCTGCC CGAAAAAGAT CAGCGGTGCA
CGGTGGCTCG CCGTACACGA GACGGCGCAC CGGCTGGGCA TCCGCACCAA CGCCACCATG
CTCTACGGGC ATGTGGAAAC GCTGGAGGAG CGAGTCGAGC ACCTCCTACA ACTGCGGGAA
CTTCAGGATC GGACCGGGGG CTTTCAGGCT TTCATCCCGC TGGCCTTCCA CCCGTGGAAC
ACCGCCCTCG AACCGGAGGT GCCCGCCGGC ACTACCGGGT ACGACGATCT GAAAATGCTG
GCGGTGGCGC GGCTCATGCT CGACAACTTC GACCACATCA AGGCCTTCTG GGTGATGATC
GGACCCAAGC TGGCCCAGAT TTCCCTAAAC TTCGGGGTCA ACGACATCGA CGGCACGGTG
GTCGAGGAAC GAATCACCCG CGCGGCCGGT GGGCAGACGG CCCATGGTCT GGAGCGCGGG
GAACTCTTGC GGCTCATCCG GGCGGCGGGC CGGGTGCCGG TGGAACGCGA TACGTTGTAT
AACGTGGTCA GGGAGGATTT CGCCTGA
 
Protein sequence
MDFWFVDDRL EAVAGKVRNG ERLGLEDGVI LFRSPDLIGV GQLADAVRRK KHGDRVYFVV 
NRHINHTNIC VNGCRFCAFG KEPGEPGGYV MSLDEIEARA RESWALGISE VHVVGGLHPD
LNLDYYREML TRLRNTVPGV IIQALTAVEV DYLAGLHGLE LEDVLTELRA AGLDSLPGGG
AEVFAPRVRE SVCPKKISGA RWLAVHETAH RLGIRTNATM LYGHVETLEE RVEHLLQLRE
LQDRTGGFQA FIPLAFHPWN TALEPEVPAG TTGYDDLKML AVARLMLDNF DHIKAFWVMI
GPKLAQISLN FGVNDIDGTV VEERITRAAG GQTAHGLERG ELLRLIRAAG RVPVERDTLY
NVVREDFA