Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_0837 |
Symbol | |
ID | 8376492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 909014 |
End bp | 910654 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645000076 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_003157371 |
Protein GI | 256828643 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCAG CTGCAAAAAA GCTCGTCAAC TGGACGCCCA CCGACATCAA GGCGGACATG CTCGCCAAGT ATCCGCCCAA GGTGGCCAGA AAGCGCGCTT CCCAGATTCT CATCAACGAG GCGCTCGGCA ACGAAACGCC TGAAATATCG GCCAATGTGC GCACCATCCC CGGCATCATC ACCATGCGCG GCTGCACCTA CGCCGGATGC AAGGGCGTTA TCCTCGGGCC CACCCGCGAC ATCGTCAACA TCACCCACGG CCCCATCGGC TGCGGCTTCT ACTCCTGGCT GACCCGCCGC AACCAGACCG ACGCATCGGC CGAGGGCGCG GAAAACTTCA TGCCATACTG CTTTTCCACG GACATGCAGG ACCAGGACAT CATCTTCGGC GGCGAGAAGA AACTGCAGGC CGCCATCCAG GAAGCCTACG ACCTCTTCCA TCCCAAGGCC ATCGCCATCT TCGCGACCTG TCCCGTCGGC CTCATCGGCG ACGACATCCA CGCCGTGGCG CGCAAGATGA AGGCCAAATT CGGCGACTGC AACGTCTTCG CCTTCAGCTG TGAAGGATAC AAGGGCGTCA GCCAGTCCGC CGGCCACCAC ATCGCCAACA ACAAGATCTT CAGCGAAGTG GTGGGCGAGA ACGATGCCGA GAAGCCCGGC CAGTTCAAGA TCAACCTCCT GGGCGAATAC AACATCGGCG GCGATGGGTT CGAGATCGAC CGTATCCTTA AAAAATGCGG CATCACCAAC ATCTCCACCT TCTCCGGCAA CTCGACCTAC GACCAGTTCG CCTCGGCCCA CAAGGCCGAC CTGTCCGCGG TCATGTGCCA CCGCTCTATA AACTACGTGG CCGACATGCT TGAAACCAAG TTCGGCATCC CGTGGATCAA GGTCAACTTC ATCGGCGCCA AGTCCACGGC CAAGTCCCTG CGCAAGATCG CGGAATATTT CGGCGATCCG GGCCTGACCG CCCGCGTGGA AGAGGTCATC GCCGAAGAAA TGCCCGCCGT GGAAGCCGTC ATCAGCGACG TGCTGCCCCG CACCACGGGC AAGACGGCCA TGCTCTTCGT CGGCGGCTCC CGCGCCCATC ATTACCAGGA CCTCTTTGCC GAGATGGGCA TGAAGACCCT GGCCGCCGGT TACGAGTTCG CCCACCGCGA CGACTACGAA GGCCGCCACG TCATCCCGAA TTTGAAGGTC GATGCCGACA GCCGCAACAT CGAGGAAATC GAGGTCGAGG CGGACGAGAA GCGTTACGCC CCGCGCAAGA CCTCCGAAGA GATGGCCAGG CTCGAAGCCG CCGGTTTGAA GTTCAAGGAA TACGAAGGCC TGATCCCGGA CATGGACCAC CAGACTCTCG TTATCGACGA CCTCAACCAG TACGAGGCCG AAAAGCTGGT CGAGATCGTG AAGCCCGACA TCTTTTGCGC AGGCGTCAAG GAGAAGTTCT CCATCCAGAA GCTGGGCATC CCCATGAAAC AGCTGCACAG CTACGATTCC GGCGGTCCCT ATGCGGGATT TCAGGGCGCG GTCAATTTCT ATCACGAAAT CGACCGTCTC GTGAACAGCA AGGTCTGGAG CTACATGAAG GCCCCCTGGC AGGAAAGCCC GGAACTGTCC GCCACGTACG TGTGGGAATA A
|
Protein sequence | MSSAAKKLVN WTPTDIKADM LAKYPPKVAR KRASQILINE ALGNETPEIS ANVRTIPGII TMRGCTYAGC KGVILGPTRD IVNITHGPIG CGFYSWLTRR NQTDASAEGA ENFMPYCFST DMQDQDIIFG GEKKLQAAIQ EAYDLFHPKA IAIFATCPVG LIGDDIHAVA RKMKAKFGDC NVFAFSCEGY KGVSQSAGHH IANNKIFSEV VGENDAEKPG QFKINLLGEY NIGGDGFEID RILKKCGITN ISTFSGNSTY DQFASAHKAD LSAVMCHRSI NYVADMLETK FGIPWIKVNF IGAKSTAKSL RKIAEYFGDP GLTARVEEVI AEEMPAVEAV ISDVLPRTTG KTAMLFVGGS RAHHYQDLFA EMGMKTLAAG YEFAHRDDYE GRHVIPNLKV DADSRNIEEI EVEADEKRYA PRKTSEEMAR LEAAGLKFKE YEGLIPDMDH QTLVIDDLNQ YEAEKLVEIV KPDIFCAGVK EKFSIQKLGI PMKQLHSYDS GGPYAGFQGA VNFYHEIDRL VNSKVWSYMK APWQESPELS ATYVWE
|
| |