Gene Dtox_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0178 
Symbol 
ID8427102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp199343 
End bp200674 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content51% 
IMG OID645032567 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_003189756 
Protein GI258513534 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAT ATTTTCACTC TGTAAGACTG GACGAAGATA AGTGCAAAGG CTGCACCAAT 
TGCGTCAAAC GCTGCCCCAC CGAGGCTATC CGGGTAAGGG AAGGACGGGC TCTAATTATT
GAAGAACGCT GTATCGACTG CGGTGAGTGC ATAAAAATTT GCCCGAACCG GGCTAAACTG
GCCACAACTG ACGGGCTGGA GCAGCTGCAA AACTTTCATT ACACCATCGC ACTGCCTGCT
CCTTCTCTTT ATGCGCAATT TGAGCCTAAT ACCAGTCCCG AACAAATATT AGGCGCACTT
TTAAACATAG GCTTTGACGA TATTTTCGAA GTAGCCCTGG GAGCCGAGGC AGTTTCACTG
GCTCTGAGAG ATTACCTTAG CCGCGATCAT CCAAGGCCCA TGATTTCTTC CGCCTGTCCG
GCCGTACTCA GGCTGATGCA GGTGCGTTTT CCCGGACTAT TGCAGCACAT AGTGCCAATC
GAGACACCTA TGGAAATTTC AGCCGGGCTG GCCAGGGAAA AGGCCCGTCA GCAGCATGGC
TTCAGTGACA ACCAGATAGG CGTCTTTTTT ATAACTCCCT GCCCGGCCAA AGTAACTGCC
ATAAAACAAC CATTCGGGGA AAAATCCTAT GTGGACGGCG CTATTTCAAT GTCCGTAATC
TACGGCGAAC TGCTGCATAG TCTTGATTCA CCATCAACGA AACAATTACC TTTATTATCA
GGAGCCAGGG GTGTGGGCTG GGGAAAAGCC GGAGGAGAAA ACCATGCCAT CAGGTGCCAC
TCACATCTGG CTGTAGACGG TATTCACAGT ATTATCAACG TGCTGGAGGA AATTGAAAGA
GGTGGCTTAA GCGATATAGA TTACCTGGAA GCACAGGCCT GCACCGGCGG CTGTATCGGC
GGCCCGCTGG CGCCTCAAAA TCTGTTCGTA GCACGCGTTC GCATGGATAA CCTGGTCAAA
AAGCTGAGTT TGCAACCGGA CAGGCCAAAG AGTTTATCAA CGGATCCGGA GTTCTTACGC
CTGAATAAAC CAATTTTGCC CAGACCCGCC CTAAAACTGG ACCAGGATAT TAACCGGGCT
ATTGAAATGC TGGAAGAAGT TGAAAAGATA ACGGAAGATT TACCCGGTCT GGATTGCGGT
TCTTGCGGCT CTCCAAACTG CCGGGCACTG GCGGAGGATA TTGTCAGGGG CTACGCGGAT
ACCAGCTTCT GCATCTTCAA ACTCCGTCAA AGGCTGCAGA TTTTAGCCCG TGAGATAGTG
GACCTATCCC ACAAGCAGCC CCCGGCCATG GGCAGAGAGA GGGAAAAAGA ACTTAAAGAA
GGTGATGCCT GA
 
Protein sequence
MQKYFHSVRL DEDKCKGCTN CVKRCPTEAI RVREGRALII EERCIDCGEC IKICPNRAKL 
ATTDGLEQLQ NFHYTIALPA PSLYAQFEPN TSPEQILGAL LNIGFDDIFE VALGAEAVSL
ALRDYLSRDH PRPMISSACP AVLRLMQVRF PGLLQHIVPI ETPMEISAGL AREKARQQHG
FSDNQIGVFF ITPCPAKVTA IKQPFGEKSY VDGAISMSVI YGELLHSLDS PSTKQLPLLS
GARGVGWGKA GGENHAIRCH SHLAVDGIHS IINVLEEIER GGLSDIDYLE AQACTGGCIG
GPLAPQNLFV ARVRMDNLVK KLSLQPDRPK SLSTDPEFLR LNKPILPRPA LKLDQDINRA
IEMLEEVEKI TEDLPGLDCG SCGSPNCRAL AEDIVRGYAD TSFCIFKLRQ RLQILAREIV
DLSHKQPPAM GREREKELKE GDA