Gene Dtox_3829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3829 
Symbol 
ID8430843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4013122 
End bp4014999 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content49% 
IMG OID645036057 
Producthydrogenase, Fe-only 
Protein accessionYP_003193156 
Protein GI258516934 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGG TTCAGGAGTA CACGCCGGGT CCGGTCATTG AGGGTGCGGG CCACCTGTTT 
GAAAAAGATG AAAAGAACTT AGAAAAGGTG CGGCTGTTTA TTGACGGTAA GGAAGTGACG
GCGCTAAAGG GAATTTCTGT ACTGGAAGCA GCCAGATCTG TGGGCATAAC CATACCCAGT
CTATGTTATT TAAAGGGTAT TAATGAGATA GGTTCCTGCC GGGTTTGCAT GGTGGAGGTG
GAAGTTAACG GGGTTATGAC TTTGCAGGCT TCCTGTGTGT ATCCGGTTGC TGAGGGAATC
AGGGTGTACA CCAATACGCC GCAGGCCCGC CGTGTCAGGC GGACTATGGT GGAACTCCTG
CTTTCCGATC ACCACAGGGA GTGCACTGCT TGCATAAGAA ACTTAAACTG TGAACTGCAG
AACCTGGCTG ATGACCTGGG CATCAGAAAT ATTAAATACA CCGGTGAAAC GAATTATGTT
CCCATATTTG CAAATAACCC TTTTATCATG AGGGACTACA ATAAGTGCAT TAAATGCCGC
CGCTGTGAGG CTATCTGCAG CAAGGTTCAG GAGGTAAACG TGTATTCTGC TTTAGGCCGG
GGGTTTGAAG TAAAAATAGC GCCTGCTTTT ATACAGGATT TGTCCGCCGT CGCCTGCATT
ACCTGCGGGC AGTGTGTTAT TGCCTGTCCC ACCGGTTCAC TGGTGGAAAA AGAGTGCATT
GAAGAAGTAT GGGATGCTCT GGCAGATAAG GATAAGTATG TGGTGGTGCA AACCGCCCCG
TCCATACAGG TTACTTTAGG AGAGTCTTTC GGTCTTCCGG TAGGGACAAT TGTTACCGGA
AAACTTGTAT CGGCACTGAG GCGGATGGGG TTTGACAAAG TTTTTGCAAC CGATTTTACT
GCTGATTTAA CTATTATGGA AGAGGCGCAT GAACTGTTAG ACAGACTAAA CGGCCATGGC
AGGCTGCCGC TTTTGTCTTC CTGCAGTCCC GGATGGGTCA AATTCTGCGA GCATTTTTAT
CCCGAGTTCA CGGAAAATAT GTCTACCTGC AAGTCACCGC ATGAAATGTT CGGTGCCTTG
ACCAAGACCT ATTTTGCCGA GAAAGAGGGG CTTGATCCTG AGAAAATTGT GGTGGTGGCC
ATTATGCCTT GCACTGCCAA GAAGTATGAG GCCTCACGTC CGGAATTCGG CAATAAAAGA
TTTAAAGATG TAGATTGGGT GCTGACTACC AGGGAACTGT CCCGCATGAT TCGACAGATG
GGAATAAACT TTACTGAACT TCCTGATGAG GACTATGATC AGCCTATGGG CATGTCCACA
GGGGCGGGAG CAATTTTTGG CGCCACGGGC GGAGTAATTG AAGCCGCTGT GCGTACAGCT
TATTACCTGT CAACCGGAGA AGAAATGGAT TTATTGGACT ACAGTGAATT TGAGGGAATC
AGCGGCTTGA AAATAGCCGA GGTAAAGCTG AAGGACAGAA CTATTAAGGT AGCGATTGCC
CACGGTACCG GCAATGCCCG CCGCCTGCTG GACAGATTGA AGGCCGGAGA AGAATTTCAT
TATGCCGAAA TTATGGCCTG CCCCGGAGGT TGTGTGGGCG GCGGCGGACA GCCTATTTTC
GGAGGCAGGG AGCACAAGGA AATCTCGCTT GATTACAGGC ATAACAGGGC TGACGCCCTG
CACAGGATTG ATCTGAGCAA AGAACTTCGC CGATCACATG AAAACCCGGC GGTCAAGAAG
ATATATGAGG AATATTTAGG GCGTCCCCTG GGGGAGAAAT CCAGGGAATT GCTGCATACC
TGCTTTACCC CAAGGGGCAA AATGCCCGGT TTTGCCTGGC AGGATTTGCC GCCGGTGGAT
CATAAATGTT TGCATTGA
 
Protein sequence
MPQVQEYTPG PVIEGAGHLF EKDEKNLEKV RLFIDGKEVT ALKGISVLEA ARSVGITIPS 
LCYLKGINEI GSCRVCMVEV EVNGVMTLQA SCVYPVAEGI RVYTNTPQAR RVRRTMVELL
LSDHHRECTA CIRNLNCELQ NLADDLGIRN IKYTGETNYV PIFANNPFIM RDYNKCIKCR
RCEAICSKVQ EVNVYSALGR GFEVKIAPAF IQDLSAVACI TCGQCVIACP TGSLVEKECI
EEVWDALADK DKYVVVQTAP SIQVTLGESF GLPVGTIVTG KLVSALRRMG FDKVFATDFT
ADLTIMEEAH ELLDRLNGHG RLPLLSSCSP GWVKFCEHFY PEFTENMSTC KSPHEMFGAL
TKTYFAEKEG LDPEKIVVVA IMPCTAKKYE ASRPEFGNKR FKDVDWVLTT RELSRMIRQM
GINFTELPDE DYDQPMGMST GAGAIFGATG GVIEAAVRTA YYLSTGEEMD LLDYSEFEGI
SGLKIAEVKL KDRTIKVAIA HGTGNARRLL DRLKAGEEFH YAEIMACPGG CVGGGGQPIF
GGREHKEISL DYRHNRADAL HRIDLSKELR RSHENPAVKK IYEEYLGRPL GEKSRELLHT
CFTPRGKMPG FAWQDLPPVD HKCLH