Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1029 |
Symbol | |
ID | 8427968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1051402 |
End bp | 1052802 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645033364 |
Product | Nitrogenase |
Protein accession | YP_003190538 |
Protein GI | 258514316 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAAAGG AAAACCTGCA TTATAAAAAT GTTAATGAAA ATCCGTGCAA TATGTGTATG CCCATGGGGG GTATCCTACC CTTTAAGGGT TTGGAGAATT CCATGGTAAT TATTCACGGT TCGCAGGGCT GCAGTACCTA TATGCGCAGG CATATGGCGG AGCACTTTAA TGAACCCATT GATGTAGGCT CTACCTCCTT GAATGAAAAA GGTACAATTT ATGGTGGAGG AAACAACCTG AAAAGGGGTC TGGATAACAT ATTGAAGGTT TATCAGCCAG GTTTGATCGG TGTGTTGACT ACTTGTCTGG CCGAAACCAT TGGAGAGGAT ACAGAGAGGC TGTCGGCAGA ATACCTGCTG GAGAGAGGAA TGCCTGACTA TCCCGTGATT CCTGTACCTA CTCCCGGTTA CGGGGGCAGT CATGCGGAGG GCTATTGGCT GGCTGTAAGA AAAATAGTAG GTAAGCTTGC TCGTGAGACA GAACCTCACA ATAAAATCAA CATCATTATT CCCAACATCA GTCCTGCTGA TATAAGGGAA ATTAAGCGAT TGCTGCAACT GATGCAGGCG GATTATACAC TCTTGCCTGA CTTTTCCGAT ACACTGGATA GGCCCTATGA ACGAAGCTAC AAGAAGATGC CGGAAGGAGG CACAAAGGTT TCCGACATAA TACGAATGGC GGGAGCCATG GCTACTGTTC AGCTGGGGCT GACGGTAGAT GAAAATTATT CACCGGGGCT TTACTTGGAA AGAGAATTTG GCGTACCCTT TTACAACTTA CCCATACCTA TGGGAGTAGA GTCTGTGGAT TTGTTCCTAA AAGTGCTGTC TGATTTGACT GGAAATGATG TGCCTGAGTG TTTATTGCAG GAAAGAGGCA GATTGCTGGA TTGCATGATT GACTCACATA AATATAACTT TCAGGGTAAA AGTGTTATCT TCGGAGAACC GGAACTCGTC TATGCCATAA GCAGAACCTG TCTGGAAAAC GGTATTAAGC CAGTGGTAGT GGCTACAGGC AGCAAAACAG GGAGACTCTC CGAATTGCTT AAACCCCTTC TTGATGAAGC AAGTGAAAAG AATTTTATTC TTGAGGAAAC TGATTTTGTG ACAGTTCGCA GTAAGAGTAA AGAAGCCGGT GCCAATATTG CTATCGGGCA TTCGGACGGC AAATATTTGA CAGAAAGGGA AAGCATTCCG TTGGTTCGCA TGGGTTTTCC CATTCATGAC AGGGTTGGTG GACAGAGATT ATTGTCGGTT GGCTATACCG GAACAACTAT GTTTTTAGAT AGAGTAACCA ATAAGTTATT AGAGAATAAG CACGGAAATT ACCGCAAGCT AATCTATCAA AATTTTTACC GGGGTACTGG GAGGAAAAAA CTGTGCTGTC CGGGAAGTTG A
|
Protein sequence | MKKENLHYKN VNENPCNMCM PMGGILPFKG LENSMVIIHG SQGCSTYMRR HMAEHFNEPI DVGSTSLNEK GTIYGGGNNL KRGLDNILKV YQPGLIGVLT TCLAETIGED TERLSAEYLL ERGMPDYPVI PVPTPGYGGS HAEGYWLAVR KIVGKLARET EPHNKINIII PNISPADIRE IKRLLQLMQA DYTLLPDFSD TLDRPYERSY KKMPEGGTKV SDIIRMAGAM ATVQLGLTVD ENYSPGLYLE REFGVPFYNL PIPMGVESVD LFLKVLSDLT GNDVPECLLQ ERGRLLDCMI DSHKYNFQGK SVIFGEPELV YAISRTCLEN GIKPVVVATG SKTGRLSELL KPLLDEASEK NFILEETDFV TVRSKSKEAG ANIAIGHSDG KYLTERESIP LVRMGFPIHD RVGGQRLLSV GYTGTTMFLD RVTNKLLENK HGNYRKLIYQ NFYRGTGRKK LCCPGS
|
| |