Gene Mpal_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1686 
Symbol 
ID7271249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1745866 
End bp1749036 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content51% 
IMG OID643570302 
Productperiplasmic copper-binding 
Protein accessionYP_002466718 
Protein GI219852286 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat
[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.482609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCATT CCCTCCTGTT ATCAGGCACC ATCATTCTCC TGTTCTGCCT GATCATAATC 
CCAACAGAAG CGCTCGTCCA GTCTGAATCT GCAGTACGGC CGACTATGTT CGCCTGGGAG
GAGAAGATGA GTTCTGATCT CCTGCCCCTG GTCGATGAGC GGTTTCTTTC ACCAGGAGAG
ACTCAGGGGG AGGCCGCAAA GGGAACAGTT CGACAGACTG GAATCAATGG GACTCTCAAC
GATGAGGTCT TCATCTATAT CAGGGTTACA GAGACCATAT CAGTAAAGGA TCTCGATCGC
TGGTGCGTCG AGATCACTGA CCGCGATGAT GGGGAGCATC GTGTGGCAGC CTGGGTCAGA
GTCAGTGACC TTCCCAGTCT CGCCTCATTG GAATGGATCA GGTCAATCCA GACAGTACTT
CCACCAATTG TCAATACCGG CTCAGTAGAC ACACAGGGAG ATCAGATTAT CAGGGCAGAT
CAGATCCGGA GAACCACGGG GTCCTCGGGC CAGGGTGTAA AGGTCGGTGT GATCTCCACA
GGAGTTGACC ACTGGCGAGA CGCTGCAGCA ACCGGAGACC TGCCCTCGAC CCTCCACGTT
CTCTCAAACT CATTTGGCGG TGATGAGGGA ACGGCCATGC TTGAGATCAT TCATGACATT
GCACCAGATG CTGACCTATA TTTCTATGAT ACCGGTACAA ACACAATCGC GTTCAATCGA
GCTGTGGACG CATTGACAGA GGCCGGCTGT TCGGTGATCG TCGACGACAT CTCCTGGTTG
GGTGAACCAT TCTTTGAAGA CGGTTCGGTC GCGACACATA TCCAGGAGAA GATCCAGAAT
GGGAACCTCG TTTATGTCAC CTCTGCCGGC AACTACGCAC AGAAACATTA TCAGGGGACC
TATTTCAATG ACGGAAGCGG GTGGCATGAC TTCAGTGCAG GAAGTTCATC CAGAAAAAAG
ATCTACCTGA GCATCCCGCC AGGCGGAAGC GTCAGAGCTG TTCTACAGTG GGATGATCCC
TTCGGCACTT CAGCGAACGA CTACGATCTG TACCTGAACG AACAATATCC ATATTCTGGA
ATCACCTTAA AAAAAAGCAC GAATGCCCAG ACCGGAAGCG CGGACCCGAT TGAGTGGATC
ACCTATACCA ACAGCAACTC CTACACGATC AACGCCGAGC TCGACGTCAA CAACTACATG
AACCTCGCGG AGGAACGAAC CCTTGAACTA CAGATGTACA TCAGTTCGGG AACCACGATC
AGTCCAGATA ATCTGGTCTC TGCTGATTCT ATCAGTGGTC AGGCTGCTGT ACCCGAGGTC
CTGACCATTG GTGCACTTGG CGCAACCACA CCCAATCAAC TGGAACCGTT CTCCTCCCAA
GGCCCTGTCA CGATCGTATA TCCACACCGC GAGACACGTC AGAAGCCCGA TCTATGCGGG
ATCGATGGGG TCGCCGTGAC CGGGGCGGGA GGGTTTCCCA CTCGGTTCTA TGGGACCAGT
GCCGCAGCTC CGCATATCGC AGGGGTTGCA GCACTTCTCT GGAGTCTGAA CCCATCCCTC
ACTCCTGACC AGATCCGAAC AGTGCTTATT GAAGGCGCGG TCGACCTTGG TGATCCTGGA
TGGGACACGC TCTATGGATC TGGCCGAGCC GACGCACTTG CCTCAAAGGA TTTGATCAGA
ACTGATGGGA AGATCACGGT ATCGGGGCCG GTAGTGATCG ATAAACCAGG CACCTATGTA
CTCGACCGTG ACATCATAGA TTGCCAGAAT ACGGTAGGAA TTGAGATAAA GGCGTCAAAT
GTCGTCTTCG ACGGTCAGGG TCATCTGATC AGCGGACAGA ATAGAGGTGG ATCGGCAGGG
ATCTTTGTCT CGAAGGATCC AGATAATCCA CTGACTGGCG TTATCATAAA AAATGTGGAT
GTGGACCACT GGGACTATGG GATCTATTAC CTGAATGCTA TCGAGGGAAT GATCCAGACG
ATCAGAACTA CAGGGAATGA TAAATATGGG ATTGTTCTCT ACGCGGGGAG CAGTGGGAAT
ACAGTCGCTG ACAGCACGCT CACCGATAAT GGAGACGGTA TCTACCTAAC CGCCTCAAGC
GACAGGAATA CAATCCAAAA CACGTCGATC AGAGAGAACC GGAACCATGG AATCTCTATC
TATGACTCAA CCAGCAACCT ACTGGAGGGT AACAGCATCA CTGGCGCAAC CGCAGTTGGG
GTCCAGTTCT TCACCACGAA TAACAACACC CTGACCAGCA GCACGATTAC CGGGGACACC
CTGTATGGTG TCCAGATTTA CCATTCTGAT GGCAACATTC TGAAGAATAA CACCATCACC
GGCAGCACCA GTGCCGGAGT ATACCTCAAC CAATCCCAGG AGAACAGTAT CTACAACAAT
TACTTCAACA ATCCAAATAA TGCCCTTGTT GAAGGAACGA CGAGCCAAAA CACCTGGAAC
CACGATCCAA TGCTTGGAAC CAACATCGTC AGAGGTCCAT CGATTGGAGG AAATTATTGG
GCGACTCCAT CGAGAAATGG TTTTTCAGAG ACTCATCCGG ACAACAACCG TGATGGGTTC
TGTGACGAGG GATATATCAT CACATCTGAA CAGAAGGGAA AAAACATCGA CGAGGCCCCC
CTCATATTAC CGCAATCGTC CAATATGACC AGACCGACCG CAGGTTTTTG GGCAATCCCA
ACCAAGGGGA CTGCCCCGCT CTCTGTCCAG TTCACTGACA CCTCAACTGG TATCCCAGCA
CGGTGGACCT GGGACTTCGG AGATGGTCAG AATTCAACAG ATCAAATGAC AGAGGTTGTT
AACAGGACAA TACAGAACCC GGTACATTCC TATACACAAC CAGGTACCTA CTCTGTCACC
CTGACTGTCT CAAATCCACT GGGTGACGAC TCTATGGAAA GAACCGGATA TATCATCGTC
TCTGGATCAG TCATTCCAAT ACAATCTACG AACGTAATTC CAAAAGATCT GAACGGCGAC
GGGAAGTACG AAGATCTCAA CGGGAATGGT AGACCTGACT TTGAAGACGT GGTGCTGTTC
TTCGATCAGA TGGACTGGAT TGAACAATAT GAACCAATCG ATGCGTTTGA CTTCAACAAT
AACAATGCGA TCGACTTCAA CGACATCGTC GTCCTCTTCA ACCAGTTATG A
 
Protein sequence
MRHSLLLSGT IILLFCLIII PTEALVQSES AVRPTMFAWE EKMSSDLLPL VDERFLSPGE 
TQGEAAKGTV RQTGINGTLN DEVFIYIRVT ETISVKDLDR WCVEITDRDD GEHRVAAWVR
VSDLPSLASL EWIRSIQTVL PPIVNTGSVD TQGDQIIRAD QIRRTTGSSG QGVKVGVIST
GVDHWRDAAA TGDLPSTLHV LSNSFGGDEG TAMLEIIHDI APDADLYFYD TGTNTIAFNR
AVDALTEAGC SVIVDDISWL GEPFFEDGSV ATHIQEKIQN GNLVYVTSAG NYAQKHYQGT
YFNDGSGWHD FSAGSSSRKK IYLSIPPGGS VRAVLQWDDP FGTSANDYDL YLNEQYPYSG
ITLKKSTNAQ TGSADPIEWI TYTNSNSYTI NAELDVNNYM NLAEERTLEL QMYISSGTTI
SPDNLVSADS ISGQAAVPEV LTIGALGATT PNQLEPFSSQ GPVTIVYPHR ETRQKPDLCG
IDGVAVTGAG GFPTRFYGTS AAAPHIAGVA ALLWSLNPSL TPDQIRTVLI EGAVDLGDPG
WDTLYGSGRA DALASKDLIR TDGKITVSGP VVIDKPGTYV LDRDIIDCQN TVGIEIKASN
VVFDGQGHLI SGQNRGGSAG IFVSKDPDNP LTGVIIKNVD VDHWDYGIYY LNAIEGMIQT
IRTTGNDKYG IVLYAGSSGN TVADSTLTDN GDGIYLTASS DRNTIQNTSI RENRNHGISI
YDSTSNLLEG NSITGATAVG VQFFTTNNNT LTSSTITGDT LYGVQIYHSD GNILKNNTIT
GSTSAGVYLN QSQENSIYNN YFNNPNNALV EGTTSQNTWN HDPMLGTNIV RGPSIGGNYW
ATPSRNGFSE THPDNNRDGF CDEGYIITSE QKGKNIDEAP LILPQSSNMT RPTAGFWAIP
TKGTAPLSVQ FTDTSTGIPA RWTWDFGDGQ NSTDQMTEVV NRTIQNPVHS YTQPGTYSVT
LTVSNPLGDD SMERTGYIIV SGSVIPIQST NVIPKDLNGD GKYEDLNGNG RPDFEDVVLF
FDQMDWIEQY EPIDAFDFNN NNAIDFNDIV VLFNQL