Gene Dtox_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1026 
Symbol 
ID8427965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1046750 
End bp1048387 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content43% 
IMG OID645033361 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_003190535 
Protein GI258514313 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTAA ACGAAAAAGA TTTGTTGATG GATAAGCAGA TAGAAGAGAT GTTGAACCAG 
TATCCGGCTA AAGTCAAGAG AAACCGGAAA AACCACATAG TGCTCAATGA TACTTCCTTG
GAGCACCAGC AGATTGAAGC CAACACACGT ACGATTCCCG GCATCATAAC CAATCGGGGC
TGTGCTTATG CCGGTTGTAA AGGCGTTGTA TTGGGGCCTC TGAAAGACAT GGTGCATATA
GTTCACGGCC CTATCGGCTG TTCATATTAT GCCTGGATGA CCAGGAGAAA TAAGGCCAAG
TCAGAGGATC CCAAGAAAAA TTTTCTCAAT TATTGTGTTT CAACTGATAT GCAGGAAAGT
GATATTGTTT TTGGCGGGGA GAAAAAACTG GCCAGAATGA TTGACGAAGT AGTGGAGATA
TTCAAACCAA ACGCTATTAC AGTATCCGCT ACTTGTCCGG TAGGCCTGAT CGGAGACGAT
ATACAGTCGA TTGCCAAGGC AGCTGAGGAT AAGTACGGCA TACCTATTAT GGCTTTTAAC
TGCGAGGGCT ACAAGGGTGT GAGTCAGTCG GCCGGACACC ATATCGCCAA TAACGGTTTA
ATGGAGCATG TCATAGGCAG AGGTGATTTG GAAGAGGCCC CGGCCAAATA TACCATAAAT
ATGCTAGGTG AATATAACAT AGGCGGAGAC AGTTGGGAAA TAGAAAGAGT TCTTACGGAG
TTAGGCTATC ATGTACAAAC GGTAATGACG GGAAACGGTG CCTATGAGGA TTTAAAAAAC
GCCCATATAG CTCAATTAAA TTTGGTGCAG TGTCATCGTT CCATTAACTA TATAGCAGAT
ATGCTGGAAA AAAAATATGG TACTCCCTGG TTAAAGGTGA ACTTTATAGG CATTCGCAGC
ACTATTGAAT CACTGCGCAA CATGGCTATG TATTTCGGTG ACAAAGAGTT AATTGATAAA
ACCGAAGAAG TGATTGCCCG TGAATTGGCT GAAATTGAAC CGGCCCTGGA TAAGTACAGA
AAATACACCA ATGGAAAAAC AGCCTTTTGT TTTGTCGGGG GTTCACGCGG CCACCACTAC
CAGATTCTCT ATGAGGAACT GGGTATAAAG ACACTCCTGG CAGGTTACGA GTTTGCGCAT
CGGGATGATT ATGAAGGAAG AGTGGTTCTT CCTGATATCA AACCGGATGC TGACAGTAAA
AACATTCCTG ACCTGCATGT AGAGCCGGAT GAAAGACGTT ACCGGGTTAA GATTCCACAG
GAAAGAATGG CTGAATTGCA GGAAAAAATG CCTTTGTGTA ATTATAAAGG CATGAGGATG
GATATGATTG ACAACACTGT AATGGTGGAT GACTTAAATC ACTACGAAAC AGAAGAATTT
ATCCGGATAT TAAAACCAGA TATCTTCTCC TCGGGTATTA AAGACAAGTA CATTGTGCAG
AAAATGGGTA TTCCATCTAA GCAGCTGCAT AATTATGACT ACAGCGGCCC ATATGCCGGA
TTTAAAGGAG CAGTAAACTT TGCCCGCGAT ATAACCATGG GAATTGTTTC GCCAACCTGG
AAATATATTA CCCCACCCTG GAAGAGCAGT CCTTTAATTG AAGGAACTAT TGAGGAAGGG
AGGTCATGCG GATGCTAG
 
Protein sequence
MAVNEKDLLM DKQIEEMLNQ YPAKVKRNRK NHIVLNDTSL EHQQIEANTR TIPGIITNRG 
CAYAGCKGVV LGPLKDMVHI VHGPIGCSYY AWMTRRNKAK SEDPKKNFLN YCVSTDMQES
DIVFGGEKKL ARMIDEVVEI FKPNAITVSA TCPVGLIGDD IQSIAKAAED KYGIPIMAFN
CEGYKGVSQS AGHHIANNGL MEHVIGRGDL EEAPAKYTIN MLGEYNIGGD SWEIERVLTE
LGYHVQTVMT GNGAYEDLKN AHIAQLNLVQ CHRSINYIAD MLEKKYGTPW LKVNFIGIRS
TIESLRNMAM YFGDKELIDK TEEVIARELA EIEPALDKYR KYTNGKTAFC FVGGSRGHHY
QILYEELGIK TLLAGYEFAH RDDYEGRVVL PDIKPDADSK NIPDLHVEPD ERRYRVKIPQ
ERMAELQEKM PLCNYKGMRM DMIDNTVMVD DLNHYETEEF IRILKPDIFS SGIKDKYIVQ
KMGIPSKQLH NYDYSGPYAG FKGAVNFARD ITMGIVSPTW KYITPPWKSS PLIEGTIEEG
RSCGC