Gene Mboo_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2004 
Symbol 
ID5410428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2072781 
End bp2074328 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content52% 
IMG OID640869246 
Productnitrogenase 
Protein accessionYP_001405161 
Protein GI154151543 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.811675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA ATAAATCACC ACATATTATC GATTCCAAGA TCAATCTTGA GGAAGCAACG 
TGCCCGAACC GTGAACAGCG TGCAAATGGG ATTAATGTCT GGTACGGAAA GGCCAGCGAT
CTGGTAAAGG AGGCGCGTGA AGGAACCCTT ACGCGTCGGG AACGAAAATT CCAGCAGACC
TCAGGCTGCG TGCTCAACTT CTATCTCACG GTACGGGTGG GAACGATACG CGATGCAGCG
GTAGTTTACC ATGCACCGGT TGGATGTTCC TCATCCGCTC TTGGGTACCG GGAACTGTAC
CGCGGTGTTC CGGTTGAACT CGGGCGACCG GCAGAATATG ATCTCCACTG GATAACCACC
AACCTCCGGG AAAATGATGT TGTCTATGGC GCAACCGAAA AACTCAAGAG TGCCATATTT
GAGGCACAAC GCCGCTACAA TCCCAAGGCC ATATTCGTTA TGACTTCATG CACCTCCGGG
ATCATCGGAG AGGATATTGA AGGAGTGGTT GCAGAGGTTC AGCCAAAAGT GAAAGCCAGG
ATAGTTCCGG TCCACTGCGA AGGTTCGCGA TCCCGGCTGG TACAGACCGG GTACGATGCG
TTCTGGCATG GGGTCCTGAA GTACCTTGTG AAAAAACCGC AGAAGAAGCA GAAGGATCTG
GTCAATGTTG CAAGCATGCT TTCGTATACC TGGCAGGACA GGCTTGAGAT CAAGAGATTG
CTCGAAAAAG TCGGGCTCCG GGTCAATTTT ATTCCGGAAT TTGCAACTGT CGAGCAGCTG
GAACAGCTCA GTGAAGCTGC GGTAACGGCA CCCTTGTGCC CGACCTACAC TGATTACCTC
TCCCGGGGAC TTGAACAGGA ATACGGTGTC CCCTATTTCC TCTATCCTTC CCCGATGGGT
ATTGCAAACA CGGACGCCTG GCTGCGGGAG ATTGGAAAAC ATACCGGGAA ATCAAAAGAG
ATTGAGAAAC TCATCGAGGA TGAACATAAA GTCTGGATTC CCAAGCTTAA GGCAATTCAG
GAAGAATTTG CAAAAGTTAA AGCCGACGGT AAGAAAGTGG AAGTGCTTGG CGCACTCGGC
CAGGGGCGGC TTCTTGCCCA GCTCCCGTAC TTCGATGAAC TGGGACTCAA ATCTTCGGCA
GCCATGTGCC AGGATTTTGA TAACCTGATT CTCGGGGATC TCGAAAACCT GATCAAAAAT
GTAGGAGACT TTGATATCCT GGTCAACACG TTCCAGGCAG CGGAACAGTC ACACATAACA
AGAAAACTTG ATCCGGATAT TGCTCTCACC TGTCCGTTCC AGGGAGGAGC GTTCAAGCGG
GATAAAGGTA TGACCAGGAT TCACGCACTC CGGGGCGATC CGGATCCCTG GAGCCGGCAA
AGCGGGTACA CAGGTGCGAT CGCATTTGGG AATTTCCTGC TTCAGTCGCT CAAAAGCAGT
GCGTTCCAGC GGACCATGCT CGAAAAAACA GAGAACACCT ACAAGGACTG GTGGTACCGT
CAGCCCGATC CCCTCCACTA CCTGATAAAA GAGGATGGAG AGCCATGA
 
Protein sequence
MTENKSPHII DSKINLEEAT CPNREQRANG INVWYGKASD LVKEAREGTL TRRERKFQQT 
SGCVLNFYLT VRVGTIRDAA VVYHAPVGCS SSALGYRELY RGVPVELGRP AEYDLHWITT
NLRENDVVYG ATEKLKSAIF EAQRRYNPKA IFVMTSCTSG IIGEDIEGVV AEVQPKVKAR
IVPVHCEGSR SRLVQTGYDA FWHGVLKYLV KKPQKKQKDL VNVASMLSYT WQDRLEIKRL
LEKVGLRVNF IPEFATVEQL EQLSEAAVTA PLCPTYTDYL SRGLEQEYGV PYFLYPSPMG
IANTDAWLRE IGKHTGKSKE IEKLIEDEHK VWIPKLKAIQ EEFAKVKADG KKVEVLGALG
QGRLLAQLPY FDELGLKSSA AMCQDFDNLI LGDLENLIKN VGDFDILVNT FQAAEQSHIT
RKLDPDIALT CPFQGGAFKR DKGMTRIHAL RGDPDPWSRQ SGYTGAIAFG NFLLQSLKSS
AFQRTMLEKT ENTYKDWWYR QPDPLHYLIK EDGEP