Gene Mboo_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2010 
Symbol 
ID5411891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2080019 
End bp2081365 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content56% 
IMG OID640869252 
Productnitrogenase 
Protein accessionYP_001405167 
Protein GI154151549 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.696694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.320293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAAG CAGAAACCGT ACCTCAGGAT ATTGATACCC GGTACCCCGA GGTTGTGGAG 
GCACCGCGGT GGACCTGCTC ACTTGGGGGA GCATACATTA CGAGTACCGG GGTCTACGGA
GTAGTCCCCA TCCTCCATGC CGGGGCAGGC TGTGGCATTG CTCAACTGCT TGGCGTGTAC
TATGCAGCGG GTGAGAATGC TGCCGGCGGG CAGGGCGGGA CGAGCACTCC CTGTTCATGC
CTTATCGAGA AGCACGTGAT CTTCGGCGGG GAGGACAAAC TCCGGAAGCT GATCGATTCA
ACGATCCAGC TGATGGAGGG CGATCTCTAC GTGGTAATCT CCGGGTGTGT CCCGGCGCTG
ATCGGAGACG ACGTTGACTC CGTAGTCAGG GAGTTCAAAG GGAAAGCAGA TGTCATCTTT
GTTAATACCG CCGGGTTCAA GGGCAACACG TTCGATGGCT ACGAGGAGTT CCTAGGTGCG
GTCATTGACC AGTTCCTGGA ACCAAGGAAA AAGAAGAAGA AGGTTGTCAA CATCCTCGGG
GTCGTTCCCT TCCAGCACGT CTTCTGGAAA GGCGACCTCA ATGTCGTCAA GAACCTGTTG
GCAAAGATCG GCGTGGAGGC AAATATCCTC TTCACCCAGT TCGATGGCGT CAAAAAACTC
AAAGAGATTC CGGAAGCGGA ACTCAACATC GTCCTTTCCA CCTGGAACGG GCACAAGGCT
GCCGGAAAAC TCAAAGAGAG ATTCGGCCAG GAATACCTGA CGTTCCCAAG CCCGCCAATC
GGGCCGAAGC AGACGTCAGA ATTCCTCCGC TCCGTGGCAA AGAAACTCAG GATCCCCAAA
AAGGATGTGG AACGGGTCAT CGCAGAGGAA GAGCGGCACG TATACCGGTT CACCGAGTAC
CTGACCGATG CAATCATCAT AGGCCTTCCC CATCCGTACG CAGCCTTTGT CGGGGATTCC
AACACCGTAA TTGGCATTGC AAAATACCTT GCAAACGAGG TCGGGTACCT TCCCGAGGTG
GTGCAGATTA CCGATGAACC CCCCGAGGAA GCCCGGGAGT GGATCCGGCG GGAGCTGACC
GAGAACATCG AGTCCACGAT CAAACCGGAT ATCCTCTTCG AGAAAGACAC GTTCCGGATC
CGTGAGAACC TGCGGGACAG GAGTTTCCAA GTCATGCTCG CCACCTCGCT TGAAAAGTGG
CCTGCGGCAA AGGAATTCGG CGTTGCCCAC CTGAGTGTTG GTTTCCCGAT GTACGATCGC
GTTATCGTTG ACCGGAACTA TGCCGGGTAC AGAGGCGGGC TTGCTCTTCT GGAAGACCTC
ATCGCCAAAT ATGTCGGCCC TCTTTAA
 
Protein sequence
MTEAETVPQD IDTRYPEVVE APRWTCSLGG AYITSTGVYG VVPILHAGAG CGIAQLLGVY 
YAAGENAAGG QGGTSTPCSC LIEKHVIFGG EDKLRKLIDS TIQLMEGDLY VVISGCVPAL
IGDDVDSVVR EFKGKADVIF VNTAGFKGNT FDGYEEFLGA VIDQFLEPRK KKKKVVNILG
VVPFQHVFWK GDLNVVKNLL AKIGVEANIL FTQFDGVKKL KEIPEAELNI VLSTWNGHKA
AGKLKERFGQ EYLTFPSPPI GPKQTSEFLR SVAKKLRIPK KDVERVIAEE ERHVYRFTEY
LTDAIIIGLP HPYAAFVGDS NTVIGIAKYL ANEVGYLPEV VQITDEPPEE AREWIRRELT
ENIESTIKPD ILFEKDTFRI RENLRDRSFQ VMLATSLEKW PAAKEFGVAH LSVGFPMYDR
VIVDRNYAGY RGGLALLEDL IAKYVGPL