Gene Mboo_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2081 
Symbol 
ID5409790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2155631 
End bp2156761 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content56% 
IMG OID640869326 
Productoxidoreductase/nitrogenase, component 1 
Protein accessionYP_001405238 
Protein GI154151620 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAGT GCGGGACTCC GCTGTGGCCC TGCGCCATGA CCGGGGCTGC TGCATGCCTT 
GCCGGTTTTG ACGGGATATC CGTGGTGATC CATGGATCCA GCGGGTGTTA TTACTACCCG
GCCACACTCC TCAACACACA CCTGTACGGG ACGTTTATCC TTGAAAACGA GGTGATTTTT
GGATCAGGCG ACAGGCTTAG AGAGGTGATC AGCGATCTCT CCGGGAAGGG TGACCGGATT
GCGGTTGTCA CCACCTGCGT TCCCTCCGCG CTGGGCGAGG ATATTCGTGA AATGGTGAGC
GGAAGCGAGC TCATTGTTGT GGACAGCCCC GGTTTTGCCG GGGATCTGGA GGCCGGGTAC
CAGGCAGCAC TTGCCGCGCT TGGACCTTCA GTCAACCCGG AGCAATCAGG GGTAAATATC
GATGGCATTT CCCTTGCAGA TCCGTTCCAT GAGGGAAATA TCCAGGAAAT CACCCGTCTG
CTCTATCGGG CCGGTATCCC TGTCGGAACA GTATTTTGCC GGGACCGGGC AGATAAGGTA
AAAACGGCTT CGCCTTTTAC CATCGGTACA AATGAGGATT TTGGATCCGG GGTCGGGACC
TCCCTGGGGG GTACTCTGGG TATACCTTCA CTCCGCACAA CCTTAGGAAA GCTCGCCGGA
ATTTTTGACC ATGCAGATAC GGATCCGGTA TTAAAAGAGC TAGATCTGCA GGAAGAACGA
TTGATTCATG CCGGCGATAA GTTCCTGCGC CGGTACGAGC CTCCCCGTGT TGCGATCTTT
GCCGGTGCAG CCTATGCCCT TTTTGCGGCA GATACCCTGA AACGGTATCT TGATGCGGAG
ATTCTCATTG TCGGCACACG CAACGACCTG CCGGAACAAC CGGGGGGTCA GTACCGGGTG
GAGAAACTGG CCGGGCTCGA AGCGGTTGGT CGCAGGATTA CGGAATCTGA TCCGGATCTT
GTGATCGGCT CCTCATTTGA ACGTTCTTTA TGCCCGGACC GGGCATTTGT AGGAATTATA
CCGCCATTCC GCGGACAGGT CAGGCTTGCC CATTACCCCC TTGCAGGTCT GGGAGGCTCC
CTTTATTTTG TCGAAAATGT CCTGAATGCC TGCATGGACC GGGCACCATA A
 
Protein sequence
MHECGTPLWP CAMTGAAACL AGFDGISVVI HGSSGCYYYP ATLLNTHLYG TFILENEVIF 
GSGDRLREVI SDLSGKGDRI AVVTTCVPSA LGEDIREMVS GSELIVVDSP GFAGDLEAGY
QAALAALGPS VNPEQSGVNI DGISLADPFH EGNIQEITRL LYRAGIPVGT VFCRDRADKV
KTASPFTIGT NEDFGSGVGT SLGGTLGIPS LRTTLGKLAG IFDHADTDPV LKELDLQEER
LIHAGDKFLR RYEPPRVAIF AGAAYALFAA DTLKRYLDAE ILIVGTRNDL PEQPGGQYRV
EKLAGLEAVG RRITESDPDL VIGSSFERSL CPDRAFVGII PPFRGQVRLA HYPLAGLGGS
LYFVENVLNA CMDRAP