Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A0166 |
Symbol | |
ID | 3626694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 188326 |
End bp | 189882 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637699055 |
Product | nitrogenase associated protein E |
Protein accession | YP_303731 |
Protein GI | 73667716 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA AAATTGGAAT AGTGGATACT CTCGAAGAAA GGAAGCCATA TATTACCAGA AAACAAGAAA AAGGGCAGGA AATTCCTCTT GCCTGTGATA ATAACTCTCT TGCAGGAGCT ATCAGCCAGC GAGCCTGTGT TTATTCGGGA GCCAGGGTTG TCTTAAACCC TGTAACCGAT GCCGTTCACC TTGTCCACGG CCCAATCGGC TGTGCAGGCT ATACCTGGGA CATAAGAGGT GCAAAGTCCA GTGGGATTGA AACAAACCGC ACTAGCTTTA GCACGGATAT GAAAGAGATC GATGTAGTCT TCGGAGGAGA GAAAAAGCTT TCAAGTGCAA TTGATGAACT GGTGGAGCTC TACCACCCTC CTGTTATTTT CGTTTATTCC ACGTGCATAG TTGGAATCAT TGGGGATGAT CTGGAGTCCG TGTGCAAAAC TGCAAGCCAG AAACACAATA TCCATGTAAT TCCTGTAAAA TCCGAAGGAT TCAAAGGCAA TAAGTCCGAC GGATATAAAG CTGCCTGTGA CGCCTTAAAA AGGTTGATCA AAAGACCTTC CGAAGATGAA ATTAAGAAGA AAGGTCCCAG AGTTCCTGAT AACATAAAGC CAAAGATTAA CATTTTAGGG GACTTTAACG TAGCCGGAGA TGTCTGGCTC GTAAAGCCTC TCTTTGAGCA GATGGGAATT GAGGTTATAG TCTCAATGAC AGGAGATTCA ACTGCAAAAG CCATATCAAG GGCAGCTGAA GCTGACCTTA ACCTTGTCCA GTGCAGTGGG TCCATGACCT ATCTTGCAAA ATGGATGCAG ACGGAATATG GAATTCCCTA CTTAAACGCA AGTTTCTTTG GAATTGAAGA TATCTCCTTA GCCTTGCGAA GAACTGCGGA TTATTTTGGT TCCGAAAAGA TGAGAGAACG GGCTGATAGA ATTCTGGAAG CTGAAATAAA CCGTATAATG CCTGAAATTT CCAGAGTTCG GGAAAGGGTC AAAGGAAAGA AGGCCGCCAT TTACATGGGA GGGCCTGCAA AAGCTCTCAC GCTTATCAAA GGTTTTGCCG AACTTGGCAT GGAAGTCGTT ATTATCGGGA CCCAGACAGG GAAAAAAGAG GATTACGAGC AAATCAGTTA TTCGGTAAGG GATGGGACAG TTATTGTTGA TGATGCGAAC CCCCTTGAAC TTGCCGAACT GCTCATTAGA CAGAAAGCTG ACCTGATGGT TGCAGGCGTA AAGGAGAGAT TTATTGCATA CAAGCTTGGA ATTGCTTTCT GTGACTTCAA CCATGACAGG GTGGTGGAGT TCGAAGGTTT TGATGGCTTT GTAAATTTTG CACGAGAAGT GGACGCTTCC ATCAGTAGCC CTGTATGGAA AGCTGTTAAA GAAAGAATTC TGAAACCCGA AGCAGTGGAA TCAGAACAAA AATTAGGTAA AATAGAGAAA GTGGCAGTAA AAGACATGAC TTCTGGAGAA AATTACGCAA AAGAGTGTAA AGGCATGCTT CTGAAACCTG AACTTTTGCA CCAGAAATCC GAGGCTGCAG TTGAGAGTCA GGTATGA
|
Protein sequence | MKEKIGIVDT LEERKPYITR KQEKGQEIPL ACDNNSLAGA ISQRACVYSG ARVVLNPVTD AVHLVHGPIG CAGYTWDIRG AKSSGIETNR TSFSTDMKEI DVVFGGEKKL SSAIDELVEL YHPPVIFVYS TCIVGIIGDD LESVCKTASQ KHNIHVIPVK SEGFKGNKSD GYKAACDALK RLIKRPSEDE IKKKGPRVPD NIKPKINILG DFNVAGDVWL VKPLFEQMGI EVIVSMTGDS TAKAISRAAE ADLNLVQCSG SMTYLAKWMQ TEYGIPYLNA SFFGIEDISL ALRRTADYFG SEKMRERADR ILEAEINRIM PEISRVRERV KGKKAAIYMG GPAKALTLIK GFAELGMEVV IIGTQTGKKE DYEQISYSVR DGTVIVDDAN PLELAELLIR QKADLMVAGV KERFIAYKLG IAFCDFNHDR VVEFEGFDGF VNFAREVDAS ISSPVWKAVK ERILKPEAVE SEQKLGKIEK VAVKDMTSGE NYAKECKGML LKPELLHQKS EAAVESQV
|
| |