Gene Mpal_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0555 
Symbol 
ID7271971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp547235 
End bp548605 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content61% 
IMG OID643569202 
ProductNitrogenase 
Protein accessionYP_002465651 
Protein GI219851219 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.376218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA CACGGGTACG ACAGGTGAAC GAGAACCAGT GCCAGATGTG TATGCCCCTC 
GGGGGCGTCG TCGCCTTCAA GGGGATCGAA GGGGCGATGG TGCTGGTGCA CGGTTCCCAG
GGATGCAGCA CCTACATGCG GCTCGCAAAT GTCGAACATT ACAACGAACC GATCGATGTG
GCGTCATCAG CTCTCAACGA GAAACAGACC ATCTACGGCG GGGAGAAGAA CCTGAAGAAG
GCACTGGACA ACGTGATCAG GGTTTATGAG CCGAAGGTGC TCGGGATCGT CACCTCCTGC
CTCGCAGAGA CGATGGGAGA GGACCTCACG CGGATGATCG AGTCCTACAC CAGGGAACGG
AGTACCGAGG GGATAGACAT CATCCCGGTG GCCACACCGA GTTATGCGGG GAGCCACACC
GAGGGATTCT GGGCGGCGAC AAGAGACCTC ATCGCCTACT TTGCCAGACC GACCGAACCG
CACCAGCGGA TCAATGTGAT CATCCCCCAT ATCAGCCCGG CGGATATTCG TGAGATCAAG
CGGATCTTCG ATCTGATGGG GCTTGAGTAC ATGCTGATCC CCGACTACTC CATGACCCTG
GACCGCCCCT TTGGGGGACG GTACCAGAAG ATCCCGCCAG GCGGCACCAG CACCGCCGAC
ATCGCAGCGA TGCCCGGGGC ACGGGCTACC GTCCAGTTCG GGCTGACCTG CCCGGACGAC
CTGTCGCCGG GGCTGTACCT GCAGAAGCAG TTCGGCGTCC CGCTGATCAC CCTGCCGTTA
CCGATTGGCC TCCAGAACAC CGACCGGCTG ATGGAGACCC TGCAGAGACT GAGCGGCCGG
CCGCTGCCCG AAACCCTGGC CCTGGAGCGG GGATGGCTCC TCGATGGGAT GGCGGACTCC
CACAAGTACA ATGCAGAAGG ACGCCCGGTC ATCTATGGTG AGCCTGAACT GGTCAACGCC
TGTGTCAGCC TTTGCCTGGA GAACGGAGCC ATTCCAGCAG TCATCGCCAG CGGAACCAGG
AACAGCCGGC TGGAGGAGGT GCTCACACCC CAACTGGCAG ATGCTGATGA AGCGCCGGTG
CTCCTTGAGG AGGCCGACAT CGCCGCCATT TCAGAGGCAG CCTGCACGAC GAAGGCAAAT
ATCGCCATCG GCCATTCAGG GGGACGGTCC CTGACCGAAC GACAGGGGAT CCCCATCGTC
AGGGTGGGAT TTCCCATACA TGACCGGGTT GGAGGTCAGC GGCTTCTCTC CGCTGGATAT
GCGGGGACAC TGGCATTCCT CGACCGGTTC ACCAACACGC TGCTGGAGGC AAAGTACAGT
TCCTATCGGC AGCAGCGAAA AGACGAGATG ATCACCAGAG GAGGTATCTG A
 
Protein sequence
MSETRVRQVN ENQCQMCMPL GGVVAFKGIE GAMVLVHGSQ GCSTYMRLAN VEHYNEPIDV 
ASSALNEKQT IYGGEKNLKK ALDNVIRVYE PKVLGIVTSC LAETMGEDLT RMIESYTRER
STEGIDIIPV ATPSYAGSHT EGFWAATRDL IAYFARPTEP HQRINVIIPH ISPADIREIK
RIFDLMGLEY MLIPDYSMTL DRPFGGRYQK IPPGGTSTAD IAAMPGARAT VQFGLTCPDD
LSPGLYLQKQ FGVPLITLPL PIGLQNTDRL METLQRLSGR PLPETLALER GWLLDGMADS
HKYNAEGRPV IYGEPELVNA CVSLCLENGA IPAVIASGTR NSRLEEVLTP QLADADEAPV
LLEEADIAAI SEAACTTKAN IAIGHSGGRS LTERQGIPIV RVGFPIHDRV GGQRLLSAGY
AGTLAFLDRF TNTLLEAKYS SYRQQRKDEM ITRGGI