Gene Mpal_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0236 
Symbol 
ID7270622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp269351 
End bp270397 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID643568887 
Productoxidoreductase/nitrogenase component 1 
Protein accessionYP_002465343 
Protein GI219850911 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.690347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCGA AGACCTCGCA TTCAAGTACA TATCGATATG AAGGGTGCAC GCTGACCGGA 
GCACTCTCGG TCACGACCGC GATCACCGAT GGGATCAGTA TTATCCATGG CCCGGCGGGC
TGTGCCCATC ATAACCTTTC GCTGATCTAT GCGACGCTGC TCGACCATGA GAGCGGCCCG
CTTCCTGCGC TCTGCTCTGA CCGGATTGGC GAAGAGGAGA TCATCTTCGG CGGCGAGGAG
CAGTTGACCG CCGTAATCAG AAACGCCGTC AGGGACGGGT ACACCTCTGT CATGGTCCTC
GGCACCTGCG TGACCGCTGC CATCGGCGAC GATATCGATT CGATCTGTGG ACAGGACTGG
CCGGTTCCGG TGATCCCGGT GAAGACCCAG GGGTTTCTGG GAGGGGTCTT CTCGACCGGG
TTTTTCAACG CCCTCTCAGC CTTGGCCGGC CTCGCACCGA CAGGCCAGGA GAAGAGGGAT
GGGCAAGGAG TTGAACCTCG CGTGAACCTT ATCGGGGAGA AGAACCTGGA GTACGAAGTG
GATGAGAACG CTGCCGAGGT GACCCGGTTG CTCGATCGGG CAGGGATCGA AGTGAACCTC
AGGTTTGTCA GGGGGATCAG TACCGACGAG ATCGCCAGGC TTGGGAGGGC TGACCTGAAC
ATCCTCCGTG AACCTTCGCT TGTCGCGTTC GGCGAGGAAC TGCAGCAGCA GTTCTCCATC
CCGTACCTGG AAGGGTTCCC GGTCGGCCTT GCGGGAACTC TCCGATTCGT CCAGGAGACG
GCTGACCGCT GTGCTGTCGA CGGCACCACC GCAGTCGAGG AGGAGGAGAT CTTCCAGGCT
CAGATGCTCG ATCAGTTCGA GCGGATCAGA GGCGCCCGGG TCCGGTTCAG CCAGCCCTCC
GACCGATTCA CTGCAGAATT GGTAGACCGA CTCGGTCTCA TCATCAGTAG CGATGGCGCA
CCGGTCCGGC TTCCGGTGCC GCTCCCGGTC GGGACCGCCG GCATCCGCCG GATGCTGCAG
CAGTGGAGGC GGACGATCGA TGCCTGA
 
Protein sequence
MNSKTSHSST YRYEGCTLTG ALSVTTAITD GISIIHGPAG CAHHNLSLIY ATLLDHESGP 
LPALCSDRIG EEEIIFGGEE QLTAVIRNAV RDGYTSVMVL GTCVTAAIGD DIDSICGQDW
PVPVIPVKTQ GFLGGVFSTG FFNALSALAG LAPTGQEKRD GQGVEPRVNL IGEKNLEYEV
DENAAEVTRL LDRAGIEVNL RFVRGISTDE IARLGRADLN ILREPSLVAF GEELQQQFSI
PYLEGFPVGL AGTLRFVQET ADRCAVDGTT AVEEEEIFQA QMLDQFERIR GARVRFSQPS
DRFTAELVDR LGLIISSDGA PVRLPVPLPV GTAGIRRMLQ QWRRTIDA