Gene Mpal_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1823 
Symbol 
ID7270369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1932265 
End bp1935135 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content60% 
IMG OID643570438 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002466852 
Protein GI219852420 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.668358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTCCG TCGGCCCGTG CGTCAAAAGA AGTCTCTATA TGGGAGCGAA CCCTCTAATT 
TCTGACGATA CCATGAGGGA GATCATCATC AAAGGGGCGA GGGAGCATAA CCTCAAGAAT
ATCAGCGTGG TGCTCCCCCG TGACAAACTG ATCGTCTTCA CCGGCCTGTC CGGTTCAGGA
AAATCGACGC TCGCGTTCGA TACGCTCTAT GCCGAAGGGC AGCGGCGGTA TGTGGAATCC
CTCTCCACCT ATGCCCGGCA GTTTCTCGGG GTGATGCACA AGCCGGATGT GGACTCGATC
GAAGGGCTCT CTCCTGCGAT CTCGATCGAG CAGAAGACCA CCTCCAAGAA CCCGCGCTCC
ACGGTCGGCA CGATCACCGA GATCTACGAC TACCTCCGGC TTCTGTATGC GAGGATCGGA
ACACCCTACT GCCCGGTGCA CAACATCAAG ATCGAATCGC AGACCCCGGA ACGGATCGCC
GATACGATCA CCGCGGAGCA GGCCGGCATG GTGACGATTC TCGCCCCGAT CATCAGGCAG
AAGAAGGGGA CCTACCAGCA ACTCTTCAAG GACCTGAACC GGGAGGGGTT TGCCAGGGTC
CGGGTGAACG GGACGATCGT TCGGACCGAC GAGGAGATCA CCCTCGACCG GTACAAGAAG
CACACTATCG ATATCGTGCT CGACCGGTTC GATACGATCG ACCGGACCCG CCTCGTCGAG
AGCATCGAGG TCGGGCTGAA GCGAGCCGAG GGGCTGATCA TCGTTGTGGA CGAGGAGGGG
AAAGAGACGA CCTACTCCTC GCTGATGGCC TGCCCGATCT GCGGGATCTC CTTCGAAGAA
CTCCAGCCGC GGATGTTCTC GTTCAACAGT CCGTTCGGCG CCTGCGAGGA GTGCAACGGT
CTCGGTTTCC GGATGGTCTT CGATCCGGAC CTGATCATCC CTGACAAGAG CCTCTGTATC
GTGGACGGGG CGATCGCCCT GTATCGGAAT GTCCTGGAGG GCTTCCGGGG TCAGCAGCTC
GACACCGTCG CCAAGAGTTT CGGTTTCGAC CTCTTCACGC CCATCCAGGA TCTGACTGAA
GAGCAGTACA ATGGGCTGAT GTTCGGTTCT GACAAGCAGA TCGACTTCTC GGTCACGATG
AAGCAGGGGG ATGTACACTG GTCTCACCGG GGTACCTGGG AGGGGCTCCT CCCGCAGGCT
GAACGGTTGT ATCATCAGAC GCAGTCTGAA TACCGGAAGA AGGAACTTGA GAAGTTCATG
CGGATCTATG AGTGCCCCAC CTGTAAGGGA GCCCGGCTGA AGGAGAAGAT CCGGGCGGTC
CGGATCAATG ATCGATCCAT CGTCGATGTG ACCCGTCTCT CGGTCACCGC CTGTCGGGAT
TTCTTTGCTA ACCTGACGCT GACCCCGAAA CAGGCAGAGA TCGCCATGCT GGTGGTCAAG
GAGATCACCG ACCGGCTGAA CTTCCTCGAA CGGGTCGGGC TCGGGTACCT GAACCTCTCG
CGGTCGGCAG GGACGCTCTC TGGTGGAGAA GCCCAGCGGA TCCGGCTGGC GACTCAGATC
GGGGCGAACC TGATGGGGGT GCTGTACGTG CTGGACGAGC CCTCGATCGG TCTCCATCAG
AGGGATAACC AGCGACTGAT CGATTCACTC TGTGCGCTCC GCGATCTGGG CAACACGCTG
ATCGTCGTCG AGCATGACGA GGAGACGATC CGTCATGCCG ACTACGTGGT CGACATCGGC
CCCGGGGCCG GGGTGCACGG CGGTCAGGTG GTGGCCAAGG GGACGCCGCT TCAGATCGAG
CGGTCGATCA ACTCGCTGAC CGGGCTGTAC CTCGCCGGCT CGCTCAAGAT CGATACTCCG
AAATGGCGGC GGTCCAGCGA CCACTTCATC AGAATCACCG GGGCGGCCGA GAACAACCTC
AAGGGGATCG ATGTCCAGTT CCCGATCGGG GTGCTGACGG TGGTGACCGG TGTCTCCGGC
TCCGGGAAAT CGACGCTGGT CTATGATATC CTGTACAAGG CCCTGCAGAA GAAACTGAAG
AGAAGCAGCG AACCGGCTGG AAAACACGAG TCGTTGACGC TCGATTCCGA GATCGACAAG
GTGATCGTGA TCGACCAGAG TCCAATCGGT CGGACCCCCC GGTCCAACCC TGCCACGTAT
ACCAAGATCT TCGACGAGAT CCGGTCGGTC TTCGCCGGGG TGCCGGAGGC GAAGGTGCGG
GGGTACCAAC CCGGCCGGTT CTCGTTCAAT GTCAAGGGCG GACGGTGCGA GGCCTGCCAG
GGCGACGGGC TGATCAAGAT CGAGATGAAC TTTCTGCCCG AGGTCTATGT GGAGTGCGAG
GAGTGCAAGG GGAAGCGGTA CAACCGCGAG ACCCTTGAGG TGAAGTACAA GGGTCATTCC
ATCGCCGATG TGCTGGATAT GTCCGTCGAT GAGGCGCTCC ATCTCTTCGA GTCGCTTCCG
GCGATCAGAA CCAAACTCGA GACCCTCTCC CGGGTCGGCC TCGACTATAT CAAACTCGGA
CAGTCGTCGA CGACTCTCTC CGGCGGTGAA GCGCAGCGGA TCAAACTGAC CCGGGAACTG
GCCAAGCGGG CGACCGGCAA GACGCTCTAC CTGCTCGACG AGCCGACGAC CGGCCTCCAC
TTCCATGATG TGAAGAAACT GATCCAGGTG CTGGACGACC TGGTCAAGAA GGGGAACTCG
GTGCTGGTGA TCGAGCACAA CCTGGACGTC ATCAAGTCTG CCGACCATGT GATAGACCTC
GGACCGGACG GAGGTGACCG GGGCGGACAG GTGATCGCGA CCGGAACGCC GGAGGAGATC
GCGGCGACGC CGGGCAGTTA CACAGGGGAG TACCTGAAGA AGGTGTTATG A
 
Protein sequence
MWSVGPCVKR SLYMGANPLI SDDTMREIII KGAREHNLKN ISVVLPRDKL IVFTGLSGSG 
KSTLAFDTLY AEGQRRYVES LSTYARQFLG VMHKPDVDSI EGLSPAISIE QKTTSKNPRS
TVGTITEIYD YLRLLYARIG TPYCPVHNIK IESQTPERIA DTITAEQAGM VTILAPIIRQ
KKGTYQQLFK DLNREGFARV RVNGTIVRTD EEITLDRYKK HTIDIVLDRF DTIDRTRLVE
SIEVGLKRAE GLIIVVDEEG KETTYSSLMA CPICGISFEE LQPRMFSFNS PFGACEECNG
LGFRMVFDPD LIIPDKSLCI VDGAIALYRN VLEGFRGQQL DTVAKSFGFD LFTPIQDLTE
EQYNGLMFGS DKQIDFSVTM KQGDVHWSHR GTWEGLLPQA ERLYHQTQSE YRKKELEKFM
RIYECPTCKG ARLKEKIRAV RINDRSIVDV TRLSVTACRD FFANLTLTPK QAEIAMLVVK
EITDRLNFLE RVGLGYLNLS RSAGTLSGGE AQRIRLATQI GANLMGVLYV LDEPSIGLHQ
RDNQRLIDSL CALRDLGNTL IVVEHDEETI RHADYVVDIG PGAGVHGGQV VAKGTPLQIE
RSINSLTGLY LAGSLKIDTP KWRRSSDHFI RITGAAENNL KGIDVQFPIG VLTVVTGVSG
SGKSTLVYDI LYKALQKKLK RSSEPAGKHE SLTLDSEIDK VIVIDQSPIG RTPRSNPATY
TKIFDEIRSV FAGVPEAKVR GYQPGRFSFN VKGGRCEACQ GDGLIKIEMN FLPEVYVECE
ECKGKRYNRE TLEVKYKGHS IADVLDMSVD EALHLFESLP AIRTKLETLS RVGLDYIKLG
QSSTTLSGGE AQRIKLTREL AKRATGKTLY LLDEPTTGLH FHDVKKLIQV LDDLVKKGNS
VLVIEHNLDV IKSADHVIDL GPDGGDRGGQ VIATGTPEEI AATPGSYTGE YLKKVL