Gene Mboo_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2008 
Symbol 
ID5411889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2077493 
End bp2078467 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content54% 
IMG OID640869250 
Productnitrogenase iron protein 
Protein accessionYP_001405165 
Protein GI154151547 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR01287] nitrogenase iron protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.221735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.365396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC AGAGAAACAT TGCGATATAT GGAAAAGGCG GCATCGGAAA GTCCACCACC 
TCGTCCAATA TCAGTGCCGC ACTCTCGGAA CTTGGCCTCA AAGTGATGCA GATCGGATGT
GATCCCAAGA GCGATTCAAC AAACACGCTC CGGGGAGGCC GGTTCATCCC GACAGTTCTG
GATTCCCTGA GAAGCGGAAA GAGGGTTGAG ACAAGTGACA TTATCCACGA AGGATTCAAT
GGTGTCCTCT GCGTTGAAGC CGGGGGCCCT GAGCCCGGCG TCGGCTGTGC CGGCCGCGGC
ATCATCACCG CAATTGAACT CCTCCGCCAG CGGAAGGTCT TTGAAGAGTT CAAACCGGAT
GTCGTTATTT ACGATGTCCT TGGGGATGTG GTCTGCGGGG GATTTGGTAT ACCCATCCGT
GAAGGCGTGG CCGAACAGGT CTACACCGTG AGTTCGTCCG ACTTCATGGC GATTTATGCG
GCAAACAATC TCTTCAAAGG CATCAAGAAG TATGCGAACA GCGGGGGAGC ATTGTTCTCC
GGTATCATAG CAAATTCCAC AAACCTGCCT GTACAGCGCG AGATTGTCGA GGACTTTGCC
GCTTCCACAA AGACAACTAT TGCAGAATAT GTTCCCCGTT CGCTTACCGT AACCAAAAGT
GAGCTCCAGG GAAAGACTGT GATAGAGGCG GCTCCGGAGT CGGAGCAGGC CGAGGTCTAC
CGTGTCCTTG CAAAGAAGAT CCTCAGTAAT CAGGACCGGT ATGTCCCGGC TCCGCTGGAT
ACCGACCAGC TCAAAGACTG GGCCGAGAGC TGGTCTGACA AACTGCTCGA GCAGCGCGAC
TCCCCGACCC ATGTGAAGTG TGAACTCGAA TGCATTGTCC CCGGCGCAGA GCCAATTATT
GCAAAACCCC GATCCCAGCG ATCTCCTGCA GTAAAAGCGG CCACTCCCCG GAGAACTGCC
AAAGCAAAAG GATGA
 
Protein sequence
MAKQRNIAIY GKGGIGKSTT SSNISAALSE LGLKVMQIGC DPKSDSTNTL RGGRFIPTVL 
DSLRSGKRVE TSDIIHEGFN GVLCVEAGGP EPGVGCAGRG IITAIELLRQ RKVFEEFKPD
VVIYDVLGDV VCGGFGIPIR EGVAEQVYTV SSSDFMAIYA ANNLFKGIKK YANSGGALFS
GIIANSTNLP VQREIVEDFA ASTKTTIAEY VPRSLTVTKS ELQGKTVIEA APESEQAEVY
RVLAKKILSN QDRYVPAPLD TDQLKDWAES WSDKLLEQRD SPTHVKCELE CIVPGAEPII
AKPRSQRSPA VKAATPRRTA KAKG