Gene Dtox_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0404 
Symbol 
ID8427339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp415116 
End bp417311 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content57% 
IMG OID645032792 
ProductNifH/frxC-family protein 
Protein accessionYP_003189970 
Protein GI258513748 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR01287] nitrogenase iron protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.394931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA TCGCTGTTTA CGGCAAGGGC GGTATCGGGA AATCGACCAT CAGTGCAAAC 
CTCTCGGCGG CGTTTGCGCT TGCGGGGCAT CGGGTACTTC AGATTGGATG TGACCCGAAA
CACGATTCCA CGCGGCTTCT TCTGCAGGGG AAGCGGCTGA CGACCGTGCT GGATTATTTG
CGCATCACTA ACCCACTGGA ATACCGGCTG GAGGACATTC TGGAGGAAGG CTTTGCGGGC
ATCGGCTGCG TGGAAGCCGG TGGGCCGAAG CCCGGTGTGG GATGCGCCGG GCGCGGCATT
ATCAGTACGT TTGAGTTACT AGAGAAGTTT AAGCTGCATG ACCGCTATGG TGTGACGGTG
TATGATGTCC TGGGCGACGT GGTCTGCGGC GGCTTTGCCG TCCCGATCCG TCGCGAATAT
GCCGATACTA TCCTGATTGT CACCTCAGGG GAGTTCATGG CTCTGTACGC GGCCAACAAT
ATTCTGCGGG GCATCCGCAA TTATGATGGG ACGGAACGGC GCGTCGCCGG GCTTGTCTAC
AACCGCCGCA ATGTGGAGGG CGAGGACGAA CACGTCGCGC GATTTGCCGC CGCCGTCGGT
TTACCTATCT GTGCCACGGT TCCGCGCAGC GATGCGTTTT CCCACGCCGA ACGTGCCAAC
CGGACGGTTA TAGAGATGGA AGCCGATGCG GAGGTCTGCG CCATCTTTCA CAGGCTTGCC
GCAGCGCTTG ACGGGTGTAC ATTGTATGAG GCCAAGCCGC TTACAGATGA GGCGCTTGAG
GAAACCGTAC TCGGCATTTC GCTTGAACAA ACGCCCACCG AAGCGGCCAA TTCCGGCGCA
GAGCCCGTAG AGAAACTGCC GCCGCCTGCT GAAATTGGTC TGACAGACCA AAACCGGTAT
CTGTCAAAAA ATCTGATTCG CGGTGAGCCC CTGCATGGCT GTGCCTTCAA CGGCGCGGTT
TCCACGGCGG TGCATCTGCG CGATGCCTTG GTACTGGCCC ACGCGCCGAA AAGCTGTGCC
CACATCTCCT ACCAAACCAT CATCTCCAGC GGACGGCGCA AGCTGTTTGA GCGCGGCGCG
CTTTTGCCCG CCTCCCTGAT GCCAAATTTG GAATGCACCG AAATGGGTGA ACCGGAAATC
GTATTCGGCG GGACAGAAAA GCTGGAGGAA AGGCTTTTGG CACTTAAAAA GCAAAAGCCC
CGCGCCATTG TCGTAGTCAG CTCATGCCCT GCCGGTATTA TCGGGGATGA TATCGACCGC
GTAAAGAATT TATCCGAGCC GGGACTGCCC GTCATCCCCG TGCGTGCCGA CGGCAATATG
GCGGGTGACT ATCTTCAAGG TATGTTCATG AGCTATATCG CGCTGGCGCG CAGCATAATT
AAGCGGAATG TTCCTCCTGT GCCGGATACG GTCAATATTG TGGCGGAGAA AATCGTTGTG
ACGAATACGG AGGATAATTT CAGTGTGATC AACGGCTATT TAAGCCAAAT GGGCGTGCGT
GTAAACTGCC GCTTTCTCTA TAACACAACG GCGAACGCGC TGGAGAACTT CTGCGCCGCG
CCACTGAACC TTCCGGCATA CGGCGACTAT ACGGCAAATA CGCTGCGGGG CTTTTTTGAA
AAGGAATATG GGGCCCGATT TTTCCCTCTG GCCTTTCCCG TGGGCCTGGA AGAAACAGCC
GACTGGCTGC GCGGCGTAGC TGCCTTCTTT GGACGCCCGG ATGCCGCGGC AAAAATTATT
GCGGAGCATA CAGAGCGGTA TATGCGGGAA ATTGCGGCGC TGCGCCCACA GCTGTCGGGG
AAAAAGCTGA TGATCATTAC ATACAACCAT GAACTGGACT GGATTCTGAA AACGGCGCTG
GACGCGGGCA TGGAAATTGC CAAAATCGGC GTTCTCAACT ATTCACAGGA CGAGGGCTTC
CGCACCCGGC TTTCCGTCAG CCTCCCTGTG GAGGAAAACT ATGACCGCGA AAGGCGGGCG
GACGACATCG CCGCGTACCG CCCCGATGTG CTGCTGACCA ACTATGCCTC CTCCGTGGCG
GAGGATGTGC CGGTGGCGGA CACTATTCCC ATGTGCCCGA ATGTAGGCTT TGAATCGGGA
CTTACGCTGG CGCGGCGCTG GGCGCGGCTC TTACAGCTCA ATGTGAAAGG AGCGTGGCGG
CAGGATGAGC AATTATTTAA ACAATATTAC GCCTGA
 
Protein sequence
MKRIAVYGKG GIGKSTISAN LSAAFALAGH RVLQIGCDPK HDSTRLLLQG KRLTTVLDYL 
RITNPLEYRL EDILEEGFAG IGCVEAGGPK PGVGCAGRGI ISTFELLEKF KLHDRYGVTV
YDVLGDVVCG GFAVPIRREY ADTILIVTSG EFMALYAANN ILRGIRNYDG TERRVAGLVY
NRRNVEGEDE HVARFAAAVG LPICATVPRS DAFSHAERAN RTVIEMEADA EVCAIFHRLA
AALDGCTLYE AKPLTDEALE ETVLGISLEQ TPTEAANSGA EPVEKLPPPA EIGLTDQNRY
LSKNLIRGEP LHGCAFNGAV STAVHLRDAL VLAHAPKSCA HISYQTIISS GRRKLFERGA
LLPASLMPNL ECTEMGEPEI VFGGTEKLEE RLLALKKQKP RAIVVVSSCP AGIIGDDIDR
VKNLSEPGLP VIPVRADGNM AGDYLQGMFM SYIALARSII KRNVPPVPDT VNIVAEKIVV
TNTEDNFSVI NGYLSQMGVR VNCRFLYNTT ANALENFCAA PLNLPAYGDY TANTLRGFFE
KEYGARFFPL AFPVGLEETA DWLRGVAAFF GRPDAAAKII AEHTERYMRE IAALRPQLSG
KKLMIITYNH ELDWILKTAL DAGMEIAKIG VLNYSQDEGF RTRLSVSLPV EENYDRERRA
DDIAAYRPDV LLTNYASSVA EDVPVADTIP MCPNVGFESG LTLARRWARL LQLNVKGAWR
QDEQLFKQYY A