Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A2703 |
Symbol | |
ID | 3624451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 3428171 |
End bp | 3430411 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637701557 |
Product | nitrogen fixation protein NifH/NifE |
Protein accession | YP_306187 |
Protein GI | 73670172 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01287] nitrogenase iron protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.15509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATAAGC GGGTCCTTCA GGTCGGCTGC GACCCAAAAC ACGATTCTAC CCGTCTGCTC CTTGGAGGAG CTGACATGCC CACAGTATTG GACTATATGC GTGAAACTCC TCAGGAAGCA CAGAAGCTTG AAGATCTGGT ATTTGAAGGC TACGGAGGTA CTGCTTGCGT GGAAGCCGGG GGTCCAAAAC CCGGGATCGG ATGTGCAGGT AGAGGGATAC TGAGCAGTTT TGAGGTCTTA AAACGTGTGG GACTCAGGGC TTCTTCCTTT GACATCGTAC TTTATGATGT GCTTGGAGAT GTTGTCTGCG GGGGATTTGC AGTTCCCCTG AGAAAAGAAT ATGCCGATGC TGTTTTTCTG GTAACTTCAG GGGAATACAT GGCCTTGTAC GCTGCTAACA ATATCCTCAG AGGAATCCGA AATTTTGAGG ATTCAGTTCC CAGGGTTGCA GGTATTATCT TCAACAGACG AGGGCTTTTT GAGGAGGAGG AAAGGGTCTT TGCCTTTGCC CGGGCAACAG GTTTGCCTGT GCTGGCTTCT ATCCCAAGGG ATGAAGTCTT TTCTCAGGCT GAAAAAGTAG GGAAAACCCT GGCTGAAGCT TTTCCTTCCT CTGCTCCTGC AGGTATTTTC AGAGAGCTGG CATTGTATGT GAAAAACCTG GAAAAGGACC GGAACCTGCT GCATCCTGCC AACCCTCTCG GTGACATTGA GCTTGAGGCT CTGGTGCTGG GAAGGCGGGA ACGGCCCTGT ATAAAGCCAT TCGGAACAGG TTCTATATCC AGAAAAATAT CCTGTGCTGA ACCTCTTTCT CTCAGCCAAG AAAATAGACC TTTCGCAGAA ATCCATATGG AGATACCAGA AACAAGTTCC GAGAAAATCT GTCCGGTGAT TTCAGAAGCG CAGTCTGCAG AAATTTCCTC CAGAACTCCA ACAGAAGAAG TATCCCCATC ATCCCAGGCA AAACACCTCT CCAGACCTCT CCTTTGCGGC TGTGCCTTTT CAGGAGCTGT GACTGCTACT TTTCAGGTTA ACGATGCCGC CACGGTCCTG CATGGGCCTA GGAGTTGCAC TTATATAACG GCCGATGCCC TTACACGCTC GTTTTTATTA GGAGATGGAA GGAATATCAC AGATAAGGAA AAACAAGTCC CTGGCCTCCT GCCGACAGAC ATGAAGGAAG AGGACATTAT CTTTGGTGGA CTCGAAAAAC TTGAGAATAG AATAGAGAAA GCCCTTTTAG CTGGCTGGCA AACGGTCTTT GTGGTGAGCA CCTGTCCAGG AGGGATCATA GGAGACGATA TAAAGGAAGC AGTATCAAGA GCCAAAGTTC GTTTTCCTGG AGCCAGGGTC ATTCCTATCC CTGTGGAAGG CGTCATTACG GGAGATTTTT CGGCTGGTAT GCTCGAGGGT CGCAAACGCG TTGCTGATCT GATCGACCCC TCCGTAAGGC CCGAAAAAGG CCTGGTTAAT ATAATAGGAG AAAGGAATTT TTCCTCTCAA GAAGAAAAAA ACTTCTGTAT TGTTGAGAAA TTGCTTCAAA GACTGGGATA CAGGGTCAAC TGCAGGTTTT TAAAAGAGAC AGATACCCTT TCCCTTCGCT CTTTTAAAAA AGCGGAGCTC AACCTTTTAG CCTGTACCGA CCCTGATACC CGGGCCATCC AGGAATACCT CTCCGAGCGT TTCAGGCTTG AGTTTTTTGA GCTTCCATTC CCTGTGGGTT TTCGGGAAAG TTCCATTTGG GTAAAAGCTC TGGCAGCACG CTTGCTTCCC GGAAAGGATG TCTCTTCCTT GCTGCAGGAA CAGGAAAAGG TGTACAAAGC AGGGATCGAG AAATATGTTC CCAATCTCTC GGGAAAGCGT GTACTCTTGG TGAGTTACAC CGAAGATATC AGCTGGGTTC TGGACACAAT CCGAGACCTG GGAATGGAAA TCCTCAAGGT TGGTTTTTCA GTCTCGACTT TCAGGAAGGA ACTTCCGGAC CTCCTGTCCG ATGAAGGATT CCCTGTGGAG AGAAACTATA CGGATGAAAT GAGGGCTAAG GATGTCAGGG ATCTCAGACC GGACCTTGTT CTTTCGAGTT ATGTCCCTTC GGTCCCTGAA GGTGGAGTGC ATTATGATAC CATACCCGTC TCCCCGCAGG TTGGGTTCCT TAGCGGGCTT GAACTTGCAA AACGCTGGAG TACGCTTTTG AGGCTGCCAG TTGTGGAGGG ATGGAAGTAT GATGGAGGTG ATGAAAGTTG A
|
Protein sequence | MNKRVLQVGC DPKHDSTRLL LGGADMPTVL DYMRETPQEA QKLEDLVFEG YGGTACVEAG GPKPGIGCAG RGILSSFEVL KRVGLRASSF DIVLYDVLGD VVCGGFAVPL RKEYADAVFL VTSGEYMALY AANNILRGIR NFEDSVPRVA GIIFNRRGLF EEEERVFAFA RATGLPVLAS IPRDEVFSQA EKVGKTLAEA FPSSAPAGIF RELALYVKNL EKDRNLLHPA NPLGDIELEA LVLGRRERPC IKPFGTGSIS RKISCAEPLS LSQENRPFAE IHMEIPETSS EKICPVISEA QSAEISSRTP TEEVSPSSQA KHLSRPLLCG CAFSGAVTAT FQVNDAATVL HGPRSCTYIT ADALTRSFLL GDGRNITDKE KQVPGLLPTD MKEEDIIFGG LEKLENRIEK ALLAGWQTVF VVSTCPGGII GDDIKEAVSR AKVRFPGARV IPIPVEGVIT GDFSAGMLEG RKRVADLIDP SVRPEKGLVN IIGERNFSSQ EEKNFCIVEK LLQRLGYRVN CRFLKETDTL SLRSFKKAEL NLLACTDPDT RAIQEYLSER FRLEFFELPF PVGFRESSIW VKALAARLLP GKDVSSLLQE QEKVYKAGIE KYVPNLSGKR VLLVSYTEDI SWVLDTIRDL GMEILKVGFS VSTFRKELPD LLSDEGFPVE RNYTDEMRAK DVRDLRPDLV LSSYVPSVPE GGVHYDTIPV SPQVGFLSGL ELAKRWSTLL RLPVVEGWKY DGGDES
|
| |