Gene Dtox_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0403 
Symbol 
ID8427338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp413855 
End bp415147 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content54% 
IMG OID645032791 
Productoxidoreductase/nitrogenase component 1 
Protein accessionYP_003189969 
Protein GI258513747 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.455367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATT ATTTAAACAA TATTACGCCT GATAGCTTTT CAGGCGCACT GTTTGCGCTG 
GAGGGTGTTG CACGATCGGT CGTACTGCTT AACGGCCCCA CCGGCTGCAA ATTCTACCAC
TCGGCGACAT CGGACAACCA ATTAATCCGG CAATTTGAGT TTGATCCGCT TAACTACCCG
GAAAAGTGGT ATTTCGGCCA GCCGCGCGTA CCCTGTACCT ATCTTGACAA CGGCGATTAC
ATCTACGGCG GTGCGGACAA GCTGATTGAG GCGCTTACAT TTTTACGGGA CAATGTCACC
TTTGAACTGC TGTGCATCGT GAACTCCCCT GGTGCCGCGC TGATCGGGGA CGACCTCTGC
GGAATTGCGA AAACCGTCAT ACCGTACAGG CCCGTGGTTG TACTGGAAAC GCCCGGCTTT
TCGAGCGACG TGTGCGCGGG GCACGAGGCG GCGGCTCTGG CGCTTTTAAA GCAGCTGCCG
CCGCCAAAAG CGGACAGGAC AGTCAGCCGG CGTGTCAATC TCCTGGGACT TTCCCTCTTT
CACAGAAACC ACACCGGGGA CGTGGCGGAG CTGCGGCGTA TCTTTTCCCT CTGTGGCCTG
CATCTCGGCT GTGTTCTTTG CGGCGGCGGA AGCCTTGCGG ACATGGCAGT AATGCCGGAA
GCGGCGCTCA ATATTGTCAT TCACCCGGAA TATGGGCTGA AAACAGCGGA ATACCTTAAG
ACGCATTACG GCACACCTTT TTATGTATGC GACGGGCCGC CAATCGGCTT TGCGGCGACA
GAAAAGCTGC TGCGTGAGGT TTGCGACCTG ACAGGTGCGG ACGCATCGGA TGCCATTCGG
GAGAGTGAAC AGGCCCGCGC ACGCGCCTAC GCCTTTATCT CCCGGGTCAA TTCACTGACC
GGCTTGCCGA AGGGGGTTTC CTTTGCGGTG GAAGGAAACT GCTCGGAGCT GTATGTGTAT
GTTGATTTTC TGGTGCGTTA CTTTGGCATG ATCCCCGAGT GTGTCTCAAT TCTGAATCCG
CAAAGCAGCG TATTCAAAGA ACGCCTGACA GAACTTCTAT CGGACTTCGG ACTGACCAAC
GCTATGGAAC GTGACATTCT GAACACAAAC GCAGAGCTTG TTTTCGCAAG CGGCAACACA
ATTTCTAAGC TCAGGCTGAA AAAGCACGTC TTCACCGGCA TTGAAACGGC TCTGCCCACC
CTTGGTTATC TCGACGTGGT ACCCAAAACC CAGCTAGGCA CCGGCGGTGC ATTGCACGTG
ACAGAGCTGA TATTGAACGG ATTAATGTAT TAG
 
Protein sequence
MSNYLNNITP DSFSGALFAL EGVARSVVLL NGPTGCKFYH SATSDNQLIR QFEFDPLNYP 
EKWYFGQPRV PCTYLDNGDY IYGGADKLIE ALTFLRDNVT FELLCIVNSP GAALIGDDLC
GIAKTVIPYR PVVVLETPGF SSDVCAGHEA AALALLKQLP PPKADRTVSR RVNLLGLSLF
HRNHTGDVAE LRRIFSLCGL HLGCVLCGGG SLADMAVMPE AALNIVIHPE YGLKTAEYLK
THYGTPFYVC DGPPIGFAAT EKLLREVCDL TGADASDAIR ESEQARARAY AFISRVNSLT
GLPKGVSFAV EGNCSELYVY VDFLVRYFGM IPECVSILNP QSSVFKERLT ELLSDFGLTN
AMERDILNTN AELVFASGNT ISKLRLKKHV FTGIETALPT LGYLDVVPKT QLGTGGALHV
TELILNGLMY