Gene Sfum_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1016 
Symbol 
ID4460952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1251951 
End bp1253606 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content60% 
IMG OID639701779 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_845145 
Protein GI116748458 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAA TCCATGAAGG AAAGGCCGCA CCCGCGGCCG AAGCGATCGA CCCGGCTGCA 
GTGAAGCAGG AACTCATCGC CAAGTACCCG ACCAAGGTTG CCCGCAAGCG CGGGAAGCAG
ATCATCGTCA ACAGGGTGGG CGAAGACTGT TCGGTCCCGG AGATCGGGGC CAACACCCGC
ACCATTCCCG GCATCATTAC CCAGCGCGGG TGCAGCTACG CGGGCTGCAA AGGCGTGGTG
CTGGGACCGA CCAGGGACCT TGTGAACTTG ACCCACGGTC CGATCGGGTG CGGCTTTTAC
AGTTGGCTCA CCCGGAGGAA CCAGACCAGG CCGGCGACTC CCGAGGAGGC GAACTTCATG
CCCTACTGTT TCTCCACGGA TCTTCAGGAC GAGGACATCG TGTTCGGAGG GGAGAAGAAG
CTGAGGGCGG CAATCCTTGA AGCTTACGAC ATCTTTCATC CCAAGGCGAT CAGCATTTTT
GCCACGTGTC CCGTCGGCCT TATCGGCGAC GACATTCACA CCGTGGCGAA GGAGATGAAG
GAGAGACTGG GCATCAACGT CTTTGCGTTC AGCTGCGAGG GCTACAAAGG GGTGAGCCAG
TCGGCGGGCC ACCACATCGC GAACAACGGC ATATTCAAAC ACGTGGTGGG GCTGGACGAC
ACCTCGCGCG AAGGAAAGTA CCGGATCAAC CTGCTCGGCG AATACAACAT CGGCGGAGAT
GCGTTCGAGA TCGAGCGGGT GCTCGAAAAA TGCGGCATCA CGCTGGTGGC CACGTTCAGC
GGCAATTCCA CTTACGAGCA GTTCGCGAGT TCCCACATGG CGGACCTGAA TACTGTGATG
TGCCATCGTT CCATCAATTA CGTCGCCGAG ATGATGGAAC GAAAGTTCGG CATTCCCTGG
ATCAAGACCA ATTTCATCGG CGCCGGATCG GCTGCGAAGT CCCTGCGAAA GATCGCGAAG
TACTTCGAGG ATCGCGAACT GAGCGATCGG GTCGAAGAGA TCATCGCCGA GGAGATGGTG
GAAGTGGAAA AGGTGCAGGC CGAAGTCCGG GCGCGCTGCG AAGGCAAGCT GGCGATGCTC
TTTGTCGGGG GCTCGCGCGC CCACCATTAC CAGGACCTCT TCGCCGAGAT CGGAATGAGG
ACGATCTCCG CGGGGTATGA ATTCGCGCAC CGCGACGACT ACGAAGGCCG GCGCGTCCTC
CCGGACATCA AGGTGGACGC GGACAGCCGC AGCATCGAGG AACTCGAAGT ACATCCCGAC
CCGGACCGGT ATCGTCCCAG GAAGACCGCG GAACAGATTG CGGAACTCAA GAGAGACGGT
CTGTCCTTCA ACGACTACGA GGGGATGATG GTCCGGATGG CGGACGGGAC CCTGGTGATC
GACGACATCA GCCAGTACGA AACGGAGAGG ATGATCGAGA CCTACAAGCC CGCGATCTTC
TGCGCGGGCA TCAAGGAAAA GTACGCGGTC CAGAAGAAGG GCATCCCGAT GAAGCAACTG
CATAGCTACG ATTCGGGCGG TCCCTATGCC GGATTCAAGG GAGCGGTGAA TTTCTACCGA
GAGATCGACC GCATGGTGAA CAGCAGGATC TGGTCGTACC TCAAAGCCCC CTGGCAGACC
AACCCCGAAC TCGCCGCAAC TTACGCGTGC GAATGA
 
Protein sequence
MSAIHEGKAA PAAEAIDPAA VKQELIAKYP TKVARKRGKQ IIVNRVGEDC SVPEIGANTR 
TIPGIITQRG CSYAGCKGVV LGPTRDLVNL THGPIGCGFY SWLTRRNQTR PATPEEANFM
PYCFSTDLQD EDIVFGGEKK LRAAILEAYD IFHPKAISIF ATCPVGLIGD DIHTVAKEMK
ERLGINVFAF SCEGYKGVSQ SAGHHIANNG IFKHVVGLDD TSREGKYRIN LLGEYNIGGD
AFEIERVLEK CGITLVATFS GNSTYEQFAS SHMADLNTVM CHRSINYVAE MMERKFGIPW
IKTNFIGAGS AAKSLRKIAK YFEDRELSDR VEEIIAEEMV EVEKVQAEVR ARCEGKLAML
FVGGSRAHHY QDLFAEIGMR TISAGYEFAH RDDYEGRRVL PDIKVDADSR SIEELEVHPD
PDRYRPRKTA EQIAELKRDG LSFNDYEGMM VRMADGTLVI DDISQYETER MIETYKPAIF
CAGIKEKYAV QKKGIPMKQL HSYDSGGPYA GFKGAVNFYR EIDRMVNSRI WSYLKAPWQT
NPELAATYAC E