Gene Dbac_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_0837 
Symbol 
ID8376492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp909014 
End bp910654 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content60% 
IMG OID645000076 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_003157371 
Protein GI256828643 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCAG CTGCAAAAAA GCTCGTCAAC TGGACGCCCA CCGACATCAA GGCGGACATG 
CTCGCCAAGT ATCCGCCCAA GGTGGCCAGA AAGCGCGCTT CCCAGATTCT CATCAACGAG
GCGCTCGGCA ACGAAACGCC TGAAATATCG GCCAATGTGC GCACCATCCC CGGCATCATC
ACCATGCGCG GCTGCACCTA CGCCGGATGC AAGGGCGTTA TCCTCGGGCC CACCCGCGAC
ATCGTCAACA TCACCCACGG CCCCATCGGC TGCGGCTTCT ACTCCTGGCT GACCCGCCGC
AACCAGACCG ACGCATCGGC CGAGGGCGCG GAAAACTTCA TGCCATACTG CTTTTCCACG
GACATGCAGG ACCAGGACAT CATCTTCGGC GGCGAGAAGA AACTGCAGGC CGCCATCCAG
GAAGCCTACG ACCTCTTCCA TCCCAAGGCC ATCGCCATCT TCGCGACCTG TCCCGTCGGC
CTCATCGGCG ACGACATCCA CGCCGTGGCG CGCAAGATGA AGGCCAAATT CGGCGACTGC
AACGTCTTCG CCTTCAGCTG TGAAGGATAC AAGGGCGTCA GCCAGTCCGC CGGCCACCAC
ATCGCCAACA ACAAGATCTT CAGCGAAGTG GTGGGCGAGA ACGATGCCGA GAAGCCCGGC
CAGTTCAAGA TCAACCTCCT GGGCGAATAC AACATCGGCG GCGATGGGTT CGAGATCGAC
CGTATCCTTA AAAAATGCGG CATCACCAAC ATCTCCACCT TCTCCGGCAA CTCGACCTAC
GACCAGTTCG CCTCGGCCCA CAAGGCCGAC CTGTCCGCGG TCATGTGCCA CCGCTCTATA
AACTACGTGG CCGACATGCT TGAAACCAAG TTCGGCATCC CGTGGATCAA GGTCAACTTC
ATCGGCGCCA AGTCCACGGC CAAGTCCCTG CGCAAGATCG CGGAATATTT CGGCGATCCG
GGCCTGACCG CCCGCGTGGA AGAGGTCATC GCCGAAGAAA TGCCCGCCGT GGAAGCCGTC
ATCAGCGACG TGCTGCCCCG CACCACGGGC AAGACGGCCA TGCTCTTCGT CGGCGGCTCC
CGCGCCCATC ATTACCAGGA CCTCTTTGCC GAGATGGGCA TGAAGACCCT GGCCGCCGGT
TACGAGTTCG CCCACCGCGA CGACTACGAA GGCCGCCACG TCATCCCGAA TTTGAAGGTC
GATGCCGACA GCCGCAACAT CGAGGAAATC GAGGTCGAGG CGGACGAGAA GCGTTACGCC
CCGCGCAAGA CCTCCGAAGA GATGGCCAGG CTCGAAGCCG CCGGTTTGAA GTTCAAGGAA
TACGAAGGCC TGATCCCGGA CATGGACCAC CAGACTCTCG TTATCGACGA CCTCAACCAG
TACGAGGCCG AAAAGCTGGT CGAGATCGTG AAGCCCGACA TCTTTTGCGC AGGCGTCAAG
GAGAAGTTCT CCATCCAGAA GCTGGGCATC CCCATGAAAC AGCTGCACAG CTACGATTCC
GGCGGTCCCT ATGCGGGATT TCAGGGCGCG GTCAATTTCT ATCACGAAAT CGACCGTCTC
GTGAACAGCA AGGTCTGGAG CTACATGAAG GCCCCCTGGC AGGAAAGCCC GGAACTGTCC
GCCACGTACG TGTGGGAATA A
 
Protein sequence
MSSAAKKLVN WTPTDIKADM LAKYPPKVAR KRASQILINE ALGNETPEIS ANVRTIPGII 
TMRGCTYAGC KGVILGPTRD IVNITHGPIG CGFYSWLTRR NQTDASAEGA ENFMPYCFST
DMQDQDIIFG GEKKLQAAIQ EAYDLFHPKA IAIFATCPVG LIGDDIHAVA RKMKAKFGDC
NVFAFSCEGY KGVSQSAGHH IANNKIFSEV VGENDAEKPG QFKINLLGEY NIGGDGFEID
RILKKCGITN ISTFSGNSTY DQFASAHKAD LSAVMCHRSI NYVADMLETK FGIPWIKVNF
IGAKSTAKSL RKIAEYFGDP GLTARVEEVI AEEMPAVEAV ISDVLPRTTG KTAMLFVGGS
RAHHYQDLFA EMGMKTLAAG YEFAHRDDYE GRHVIPNLKV DADSRNIEEI EVEADEKRYA
PRKTSEEMAR LEAAGLKFKE YEGLIPDMDH QTLVIDDLNQ YEAEKLVEIV KPDIFCAGVK
EKFSIQKLGI PMKQLHSYDS GGPYAGFQGA VNFYHEIDRL VNSKVWSYMK APWQESPELS
ATYVWE