Gene Dhaf_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1050 
Symbol 
ID7258018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1144697 
End bp1146130 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content51% 
IMG OID643560964 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_002457546 
Protein GI219667111 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA GTGAAATGGT CCAGGCCAGA AAAGAACTGG TCAATCAAGT GTTGGAAGTC 
TATCCGGAAA AGGCTAAGAA AAACCGCAGG CAGCATCTTT CTGTCAAAGA AAGTGATTGC
TCAAGCTGCG CTGTTAAGTC CAATGGGAAA ACCGTACCGG GAATCATGAC CGCCCGGGGC
TGCGCTTACG CCGGAGCCAA AGGGGTTGTG TGGGGGCCGG TCAAGGATAT CGTGCATATT
TCCCACGGGC CGGTGGGGTG CGGGTTTTAT TCCTGGGCTA ACCGCCGGAA CCTGGCAGAA
GGTGAAGTGG GCATAGACAA TTTCGTACCC TTCCAATTCA CCTCGGACTT CCAGGAGAGC
GATATCATCT ACGGCGGGGA CAAAAAACTG GAGAAGATCA TTGAAGAGGT CGTCGAACTG
TTCCCCAATG CCAAAGGAGT TTCCGTACTG TCCGAATGCC CTGTGGGGCT GATCGGAGAC
GATATCGAAA GTGTCGCCCG CCGGATGACG GAGAAAACCC AGCGCCCTGT GGTGCCGGTG
CGCTGCGAAG GGTTCAGGGG AATCAGTCAG TCCCTCGGTC ACCATATTGC CAATGATGCA
ATTCGCGATC ATATCATCGG CAAGGGGCCG GAGCGGGAAA TCGGACCCTA CGATATTGGG
ATTATCGGCG ATTACAATAT CGGTGGTGAT GCCTGGGCCA GCAAAAAAAT CCTGGAAGAA
ATCGGCTTGA ATGTGGTGAA TATCTGGACC GGTGATTCTA CTCTGGAAAT GCTTCAGAAC
GGGCATCTGG TCAAACTGAA CTTAATCCAT TGCTACCGGA GTATGAACTA TATGGCCAAC
TATATGGAGG AAACCTACGG AACCCCCTGG CTGGAATTCA ATTTCTTCGG ACCCACGAAG
ATTAAGGAAT CCCTGCTTAA CATTGCAGCT CATTTTGACG ACTCCATTAG GGAGAATACC
CAAAGAGTCA TTGCCAAATA TGAAGCCCAG ATGCAAAAAG TAATCGATAT CTACCGGCCC
CGTTTGGCCG GCAAAAAGGT CATGCTGTAT GTAGGCGGCC TTCGCCCCCG GCATGTGGTG
GGCGCTTATG AAGATTTGGG CATGGAGATT ATCGGTACGG GCTATGAGTT TGCCCATAAG
GAGGATTACG AACGGACTTA CCCCCAATTG AAGGAAGGAA CCCTTATTTA CGATGATGTA
TCCGCTTTGG AACTGGAAGA GTTTGTTAAG GATCTTAAGC CGGATCTGGT GGGCTCGGGA
ATTAAGGAAA AGTATGTCTT TGAAAAAATG GGCCTGCCTT TCCGCCAGAT GCACTCCTGG
GATTACTCCG GACCTTATCA CGGTTATGAT GGATTCCCCA TTTTTGCCCG GGATATGGAT
ATGGCGGTGA ACAGTCCCAC CTGGAAGTCC ATTAAAGCTC CTTGGATGAA ATAA
 
Protein sequence
MSISEMVQAR KELVNQVLEV YPEKAKKNRR QHLSVKESDC SSCAVKSNGK TVPGIMTARG 
CAYAGAKGVV WGPVKDIVHI SHGPVGCGFY SWANRRNLAE GEVGIDNFVP FQFTSDFQES
DIIYGGDKKL EKIIEEVVEL FPNAKGVSVL SECPVGLIGD DIESVARRMT EKTQRPVVPV
RCEGFRGISQ SLGHHIANDA IRDHIIGKGP EREIGPYDIG IIGDYNIGGD AWASKKILEE
IGLNVVNIWT GDSTLEMLQN GHLVKLNLIH CYRSMNYMAN YMEETYGTPW LEFNFFGPTK
IKESLLNIAA HFDDSIRENT QRVIAKYEAQ MQKVIDIYRP RLAGKKVMLY VGGLRPRHVV
GAYEDLGMEI IGTGYEFAHK EDYERTYPQL KEGTLIYDDV SALELEEFVK DLKPDLVGSG
IKEKYVFEKM GLPFRQMHSW DYSGPYHGYD GFPIFARDMD MAVNSPTWKS IKAPWMK