Gene Dtox_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1029 
Symbol 
ID8427968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1051402 
End bp1052802 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content44% 
IMG OID645033364 
ProductNitrogenase 
Protein accessionYP_003190538 
Protein GI258514316 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAAGG AAAACCTGCA TTATAAAAAT GTTAATGAAA ATCCGTGCAA TATGTGTATG 
CCCATGGGGG GTATCCTACC CTTTAAGGGT TTGGAGAATT CCATGGTAAT TATTCACGGT
TCGCAGGGCT GCAGTACCTA TATGCGCAGG CATATGGCGG AGCACTTTAA TGAACCCATT
GATGTAGGCT CTACCTCCTT GAATGAAAAA GGTACAATTT ATGGTGGAGG AAACAACCTG
AAAAGGGGTC TGGATAACAT ATTGAAGGTT TATCAGCCAG GTTTGATCGG TGTGTTGACT
ACTTGTCTGG CCGAAACCAT TGGAGAGGAT ACAGAGAGGC TGTCGGCAGA ATACCTGCTG
GAGAGAGGAA TGCCTGACTA TCCCGTGATT CCTGTACCTA CTCCCGGTTA CGGGGGCAGT
CATGCGGAGG GCTATTGGCT GGCTGTAAGA AAAATAGTAG GTAAGCTTGC TCGTGAGACA
GAACCTCACA ATAAAATCAA CATCATTATT CCCAACATCA GTCCTGCTGA TATAAGGGAA
ATTAAGCGAT TGCTGCAACT GATGCAGGCG GATTATACAC TCTTGCCTGA CTTTTCCGAT
ACACTGGATA GGCCCTATGA ACGAAGCTAC AAGAAGATGC CGGAAGGAGG CACAAAGGTT
TCCGACATAA TACGAATGGC GGGAGCCATG GCTACTGTTC AGCTGGGGCT GACGGTAGAT
GAAAATTATT CACCGGGGCT TTACTTGGAA AGAGAATTTG GCGTACCCTT TTACAACTTA
CCCATACCTA TGGGAGTAGA GTCTGTGGAT TTGTTCCTAA AAGTGCTGTC TGATTTGACT
GGAAATGATG TGCCTGAGTG TTTATTGCAG GAAAGAGGCA GATTGCTGGA TTGCATGATT
GACTCACATA AATATAACTT TCAGGGTAAA AGTGTTATCT TCGGAGAACC GGAACTCGTC
TATGCCATAA GCAGAACCTG TCTGGAAAAC GGTATTAAGC CAGTGGTAGT GGCTACAGGC
AGCAAAACAG GGAGACTCTC CGAATTGCTT AAACCCCTTC TTGATGAAGC AAGTGAAAAG
AATTTTATTC TTGAGGAAAC TGATTTTGTG ACAGTTCGCA GTAAGAGTAA AGAAGCCGGT
GCCAATATTG CTATCGGGCA TTCGGACGGC AAATATTTGA CAGAAAGGGA AAGCATTCCG
TTGGTTCGCA TGGGTTTTCC CATTCATGAC AGGGTTGGTG GACAGAGATT ATTGTCGGTT
GGCTATACCG GAACAACTAT GTTTTTAGAT AGAGTAACCA ATAAGTTATT AGAGAATAAG
CACGGAAATT ACCGCAAGCT AATCTATCAA AATTTTTACC GGGGTACTGG GAGGAAAAAA
CTGTGCTGTC CGGGAAGTTG A
 
Protein sequence
MKKENLHYKN VNENPCNMCM PMGGILPFKG LENSMVIIHG SQGCSTYMRR HMAEHFNEPI 
DVGSTSLNEK GTIYGGGNNL KRGLDNILKV YQPGLIGVLT TCLAETIGED TERLSAEYLL
ERGMPDYPVI PVPTPGYGGS HAEGYWLAVR KIVGKLARET EPHNKINIII PNISPADIRE
IKRLLQLMQA DYTLLPDFSD TLDRPYERSY KKMPEGGTKV SDIIRMAGAM ATVQLGLTVD
ENYSPGLYLE REFGVPFYNL PIPMGVESVD LFLKVLSDLT GNDVPECLLQ ERGRLLDCMI
DSHKYNFQGK SVIFGEPELV YAISRTCLEN GIKPVVVATG SKTGRLSELL KPLLDEASEK
NFILEETDFV TVRSKSKEAG ANIAIGHSDG KYLTERESIP LVRMGFPIHD RVGGQRLLSV
GYTGTTMFLD RVTNKLLENK HGNYRKLIYQ NFYRGTGRKK LCCPGS