Gene Mbar_A0168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0168 
Symbol 
ID3624471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp191343 
End bp192944 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content44% 
IMG OID637699057 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_303733 
Protein GI73667718 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0488812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.836234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGAG CAGAAATAGA TTCCAGAATT GAAGAAGAAC AAAAGCTTGT CGATGATATG 
TTAAAAGTGC TCCCGGAAAA AGCTTCGAGA AACAGAAGAA AACACATAGT TGTCAGGAAT
TGTTCCACTG AACAACACAT CGAAGCCGAC GACAAGGTAA TTCCTGGCAT CCTTACAAAC
CGCGGATGCG CTTTTGCAGG TACCAAGGGT GTGGTTTTCG GGCCGATTAA GGATATGGTA
CACCTCGTTC ACGGTCCTAT AGGCTGTGCA TTCTATACCT GGGGCACCAG ACGAAACTTT
GCAAAGGCAG AAGAGGGTGG AGATAACTTC ATGAATTACT GCGTATGCAC TGACATGAAG
GAAACCGATA TTGTCTTTGG AGGAGAAAAG AAGCTCAAAA CAGCTATTGA CGAAGTTGTG
AAAATTTTCC ATCCTGGAGC TATCACTATC TGCGCAACCT GTCCTGTAGG ACTTATCGGA
GATGACATTG AATCTGTTGC CAGAGAAGCT GAAATGGAGC ATGGGATCAA GGTAATTCCC
GCCCGTTGCG AGGGATACCG GGGAGTTAGC CAGTCGGCAG GCCACCATAT CGCAAGCAAC
GCCCTGATGG AACACCTCAT TGGGACTGAA GAAATTAAAA GTCCAACACC TTTTGATATA
AACGTCTTTG GAGAGTATAA TATTGGAGGA GATCTCTGGG AAGTCAAACC AATTTTTGAA
AAAATTGGGT ACAGAATTGT CTCAAGTTTA ACCGGAGACG GCTCATTCCA CAGGATCTCA
CAGGCTCATC AGGCAAAACT AAGTATTCTG CTTTGCCACC GTTCCATTAA CTATACTAAC
CGGATGATGG AAGAAAAATA CGGTGTTCCC TGGCTGAAAG TAAATTACAT CGGTACGAAA
GGAACTGAAA AATCCCTGCG GAAAATGGCA GAGTTCTTTG ACGATCCGGA AATTACCCGG
AAGACAGAAG AAGTCATTGC CGAAGAAAAA GCAAAATATG CAGATGATAT CGAAAAATAC
AGGAAAAAAC TTCAGGGAAA AACTGCTTTC ATCTATGCCG GAGGTTCTAG ATCGCATCAC
TATATAAATC TCTTTGAGGA ACTTGGCATG AAAGTGATTG CTGCAGGCTA TCAATTTGCT
CACAGAGACG ACTATGAAGG CAGGCAGATT ATCCCTCATA TGAAAGAAAA AGCTCTTGGC
TCAATTCTTG AAGATGTGCA CTACGAAAGG GATGAAAACG TCAAATCCGC AGTCAGTCCT
GAAAGGATTG AAGAACTGAA AACAAAAATT GGACTCATGG ACTATAAAGG CCTTTTCCCG
GATGCGGAAG ACGGAACCAT TGTTATTGAT GATCTTAACC ACCATGAGAC TGAAGTCCTT
GTAAAGACCC TTAAGCCGGA TATTTTCTGC TCTGGAATCA AGGATAAGTA CTGGGCTCAG
AAACTTGGGG TCCCGTCAAG ACAGATCCAT TCGTACGATT ACAGTGGAAG GTATACAGGT
TTTTCCGGAG TTTTGAACTT TGCAAGGGAC ATTGACATGG CTCTGCACAG CCCAACCTGG
AAGTTCATAC GCCCACCCTG GAAAGCAGAA GATGTAGAAT AA
 
Protein sequence
MMGAEIDSRI EEEQKLVDDM LKVLPEKASR NRRKHIVVRN CSTEQHIEAD DKVIPGILTN 
RGCAFAGTKG VVFGPIKDMV HLVHGPIGCA FYTWGTRRNF AKAEEGGDNF MNYCVCTDMK
ETDIVFGGEK KLKTAIDEVV KIFHPGAITI CATCPVGLIG DDIESVAREA EMEHGIKVIP
ARCEGYRGVS QSAGHHIASN ALMEHLIGTE EIKSPTPFDI NVFGEYNIGG DLWEVKPIFE
KIGYRIVSSL TGDGSFHRIS QAHQAKLSIL LCHRSINYTN RMMEEKYGVP WLKVNYIGTK
GTEKSLRKMA EFFDDPEITR KTEEVIAEEK AKYADDIEKY RKKLQGKTAF IYAGGSRSHH
YINLFEELGM KVIAAGYQFA HRDDYEGRQI IPHMKEKALG SILEDVHYER DENVKSAVSP
ERIEELKTKI GLMDYKGLFP DAEDGTIVID DLNHHETEVL VKTLKPDIFC SGIKDKYWAQ
KLGVPSRQIH SYDYSGRYTG FSGVLNFARD IDMALHSPTW KFIRPPWKAE DVE