Gene Mbar_A2703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2703 
Symbol 
ID3624451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3428171 
End bp3430411 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content49% 
IMG OID637701557 
Productnitrogen fixation protein NifH/NifE 
Protein accessionYP_306187 
Protein GI73670172 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase)
[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01287] nitrogenase iron protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.15509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATAAGC GGGTCCTTCA GGTCGGCTGC GACCCAAAAC ACGATTCTAC CCGTCTGCTC 
CTTGGAGGAG CTGACATGCC CACAGTATTG GACTATATGC GTGAAACTCC TCAGGAAGCA
CAGAAGCTTG AAGATCTGGT ATTTGAAGGC TACGGAGGTA CTGCTTGCGT GGAAGCCGGG
GGTCCAAAAC CCGGGATCGG ATGTGCAGGT AGAGGGATAC TGAGCAGTTT TGAGGTCTTA
AAACGTGTGG GACTCAGGGC TTCTTCCTTT GACATCGTAC TTTATGATGT GCTTGGAGAT
GTTGTCTGCG GGGGATTTGC AGTTCCCCTG AGAAAAGAAT ATGCCGATGC TGTTTTTCTG
GTAACTTCAG GGGAATACAT GGCCTTGTAC GCTGCTAACA ATATCCTCAG AGGAATCCGA
AATTTTGAGG ATTCAGTTCC CAGGGTTGCA GGTATTATCT TCAACAGACG AGGGCTTTTT
GAGGAGGAGG AAAGGGTCTT TGCCTTTGCC CGGGCAACAG GTTTGCCTGT GCTGGCTTCT
ATCCCAAGGG ATGAAGTCTT TTCTCAGGCT GAAAAAGTAG GGAAAACCCT GGCTGAAGCT
TTTCCTTCCT CTGCTCCTGC AGGTATTTTC AGAGAGCTGG CATTGTATGT GAAAAACCTG
GAAAAGGACC GGAACCTGCT GCATCCTGCC AACCCTCTCG GTGACATTGA GCTTGAGGCT
CTGGTGCTGG GAAGGCGGGA ACGGCCCTGT ATAAAGCCAT TCGGAACAGG TTCTATATCC
AGAAAAATAT CCTGTGCTGA ACCTCTTTCT CTCAGCCAAG AAAATAGACC TTTCGCAGAA
ATCCATATGG AGATACCAGA AACAAGTTCC GAGAAAATCT GTCCGGTGAT TTCAGAAGCG
CAGTCTGCAG AAATTTCCTC CAGAACTCCA ACAGAAGAAG TATCCCCATC ATCCCAGGCA
AAACACCTCT CCAGACCTCT CCTTTGCGGC TGTGCCTTTT CAGGAGCTGT GACTGCTACT
TTTCAGGTTA ACGATGCCGC CACGGTCCTG CATGGGCCTA GGAGTTGCAC TTATATAACG
GCCGATGCCC TTACACGCTC GTTTTTATTA GGAGATGGAA GGAATATCAC AGATAAGGAA
AAACAAGTCC CTGGCCTCCT GCCGACAGAC ATGAAGGAAG AGGACATTAT CTTTGGTGGA
CTCGAAAAAC TTGAGAATAG AATAGAGAAA GCCCTTTTAG CTGGCTGGCA AACGGTCTTT
GTGGTGAGCA CCTGTCCAGG AGGGATCATA GGAGACGATA TAAAGGAAGC AGTATCAAGA
GCCAAAGTTC GTTTTCCTGG AGCCAGGGTC ATTCCTATCC CTGTGGAAGG CGTCATTACG
GGAGATTTTT CGGCTGGTAT GCTCGAGGGT CGCAAACGCG TTGCTGATCT GATCGACCCC
TCCGTAAGGC CCGAAAAAGG CCTGGTTAAT ATAATAGGAG AAAGGAATTT TTCCTCTCAA
GAAGAAAAAA ACTTCTGTAT TGTTGAGAAA TTGCTTCAAA GACTGGGATA CAGGGTCAAC
TGCAGGTTTT TAAAAGAGAC AGATACCCTT TCCCTTCGCT CTTTTAAAAA AGCGGAGCTC
AACCTTTTAG CCTGTACCGA CCCTGATACC CGGGCCATCC AGGAATACCT CTCCGAGCGT
TTCAGGCTTG AGTTTTTTGA GCTTCCATTC CCTGTGGGTT TTCGGGAAAG TTCCATTTGG
GTAAAAGCTC TGGCAGCACG CTTGCTTCCC GGAAAGGATG TCTCTTCCTT GCTGCAGGAA
CAGGAAAAGG TGTACAAAGC AGGGATCGAG AAATATGTTC CCAATCTCTC GGGAAAGCGT
GTACTCTTGG TGAGTTACAC CGAAGATATC AGCTGGGTTC TGGACACAAT CCGAGACCTG
GGAATGGAAA TCCTCAAGGT TGGTTTTTCA GTCTCGACTT TCAGGAAGGA ACTTCCGGAC
CTCCTGTCCG ATGAAGGATT CCCTGTGGAG AGAAACTATA CGGATGAAAT GAGGGCTAAG
GATGTCAGGG ATCTCAGACC GGACCTTGTT CTTTCGAGTT ATGTCCCTTC GGTCCCTGAA
GGTGGAGTGC ATTATGATAC CATACCCGTC TCCCCGCAGG TTGGGTTCCT TAGCGGGCTT
GAACTTGCAA AACGCTGGAG TACGCTTTTG AGGCTGCCAG TTGTGGAGGG ATGGAAGTAT
GATGGAGGTG ATGAAAGTTG A
 
Protein sequence
MNKRVLQVGC DPKHDSTRLL LGGADMPTVL DYMRETPQEA QKLEDLVFEG YGGTACVEAG 
GPKPGIGCAG RGILSSFEVL KRVGLRASSF DIVLYDVLGD VVCGGFAVPL RKEYADAVFL
VTSGEYMALY AANNILRGIR NFEDSVPRVA GIIFNRRGLF EEEERVFAFA RATGLPVLAS
IPRDEVFSQA EKVGKTLAEA FPSSAPAGIF RELALYVKNL EKDRNLLHPA NPLGDIELEA
LVLGRRERPC IKPFGTGSIS RKISCAEPLS LSQENRPFAE IHMEIPETSS EKICPVISEA
QSAEISSRTP TEEVSPSSQA KHLSRPLLCG CAFSGAVTAT FQVNDAATVL HGPRSCTYIT
ADALTRSFLL GDGRNITDKE KQVPGLLPTD MKEEDIIFGG LEKLENRIEK ALLAGWQTVF
VVSTCPGGII GDDIKEAVSR AKVRFPGARV IPIPVEGVIT GDFSAGMLEG RKRVADLIDP
SVRPEKGLVN IIGERNFSSQ EEKNFCIVEK LLQRLGYRVN CRFLKETDTL SLRSFKKAEL
NLLACTDPDT RAIQEYLSER FRLEFFELPF PVGFRESSIW VKALAARLLP GKDVSSLLQE
QEKVYKAGIE KYVPNLSGKR VLLVSYTEDI SWVLDTIRDL GMEILKVGFS VSTFRKELPD
LLSDEGFPVE RNYTDEMRAK DVRDLRPDLV LSSYVPSVPE GGVHYDTIPV SPQVGFLSGL
ELAKRWSTLL RLPVVEGWKY DGGDES