Gene Mbar_A0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0166 
Symbol 
ID3626694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp188326 
End bp189882 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content44% 
IMG OID637699055 
Productnitrogenase associated protein E 
Protein accessionYP_303731 
Protein GI73667716 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA AAATTGGAAT AGTGGATACT CTCGAAGAAA GGAAGCCATA TATTACCAGA 
AAACAAGAAA AAGGGCAGGA AATTCCTCTT GCCTGTGATA ATAACTCTCT TGCAGGAGCT
ATCAGCCAGC GAGCCTGTGT TTATTCGGGA GCCAGGGTTG TCTTAAACCC TGTAACCGAT
GCCGTTCACC TTGTCCACGG CCCAATCGGC TGTGCAGGCT ATACCTGGGA CATAAGAGGT
GCAAAGTCCA GTGGGATTGA AACAAACCGC ACTAGCTTTA GCACGGATAT GAAAGAGATC
GATGTAGTCT TCGGAGGAGA GAAAAAGCTT TCAAGTGCAA TTGATGAACT GGTGGAGCTC
TACCACCCTC CTGTTATTTT CGTTTATTCC ACGTGCATAG TTGGAATCAT TGGGGATGAT
CTGGAGTCCG TGTGCAAAAC TGCAAGCCAG AAACACAATA TCCATGTAAT TCCTGTAAAA
TCCGAAGGAT TCAAAGGCAA TAAGTCCGAC GGATATAAAG CTGCCTGTGA CGCCTTAAAA
AGGTTGATCA AAAGACCTTC CGAAGATGAA ATTAAGAAGA AAGGTCCCAG AGTTCCTGAT
AACATAAAGC CAAAGATTAA CATTTTAGGG GACTTTAACG TAGCCGGAGA TGTCTGGCTC
GTAAAGCCTC TCTTTGAGCA GATGGGAATT GAGGTTATAG TCTCAATGAC AGGAGATTCA
ACTGCAAAAG CCATATCAAG GGCAGCTGAA GCTGACCTTA ACCTTGTCCA GTGCAGTGGG
TCCATGACCT ATCTTGCAAA ATGGATGCAG ACGGAATATG GAATTCCCTA CTTAAACGCA
AGTTTCTTTG GAATTGAAGA TATCTCCTTA GCCTTGCGAA GAACTGCGGA TTATTTTGGT
TCCGAAAAGA TGAGAGAACG GGCTGATAGA ATTCTGGAAG CTGAAATAAA CCGTATAATG
CCTGAAATTT CCAGAGTTCG GGAAAGGGTC AAAGGAAAGA AGGCCGCCAT TTACATGGGA
GGGCCTGCAA AAGCTCTCAC GCTTATCAAA GGTTTTGCCG AACTTGGCAT GGAAGTCGTT
ATTATCGGGA CCCAGACAGG GAAAAAAGAG GATTACGAGC AAATCAGTTA TTCGGTAAGG
GATGGGACAG TTATTGTTGA TGATGCGAAC CCCCTTGAAC TTGCCGAACT GCTCATTAGA
CAGAAAGCTG ACCTGATGGT TGCAGGCGTA AAGGAGAGAT TTATTGCATA CAAGCTTGGA
ATTGCTTTCT GTGACTTCAA CCATGACAGG GTGGTGGAGT TCGAAGGTTT TGATGGCTTT
GTAAATTTTG CACGAGAAGT GGACGCTTCC ATCAGTAGCC CTGTATGGAA AGCTGTTAAA
GAAAGAATTC TGAAACCCGA AGCAGTGGAA TCAGAACAAA AATTAGGTAA AATAGAGAAA
GTGGCAGTAA AAGACATGAC TTCTGGAGAA AATTACGCAA AAGAGTGTAA AGGCATGCTT
CTGAAACCTG AACTTTTGCA CCAGAAATCC GAGGCTGCAG TTGAGAGTCA GGTATGA
 
Protein sequence
MKEKIGIVDT LEERKPYITR KQEKGQEIPL ACDNNSLAGA ISQRACVYSG ARVVLNPVTD 
AVHLVHGPIG CAGYTWDIRG AKSSGIETNR TSFSTDMKEI DVVFGGEKKL SSAIDELVEL
YHPPVIFVYS TCIVGIIGDD LESVCKTASQ KHNIHVIPVK SEGFKGNKSD GYKAACDALK
RLIKRPSEDE IKKKGPRVPD NIKPKINILG DFNVAGDVWL VKPLFEQMGI EVIVSMTGDS
TAKAISRAAE ADLNLVQCSG SMTYLAKWMQ TEYGIPYLNA SFFGIEDISL ALRRTADYFG
SEKMRERADR ILEAEINRIM PEISRVRERV KGKKAAIYMG GPAKALTLIK GFAELGMEVV
IIGTQTGKKE DYEQISYSVR DGTVIVDDAN PLELAELLIR QKADLMVAGV KERFIAYKLG
IAFCDFNHDR VVEFEGFDGF VNFAREVDAS ISSPVWKAVK ERILKPEAVE SEQKLGKIEK
VAVKDMTSGE NYAKECKGML LKPELLHQKS EAAVESQV