Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1823 |
Symbol | |
ID | 7270369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1932265 |
End bp | 1935135 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643570438 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_002466852 |
Protein GI | 219852420 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.668358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGTCCG TCGGCCCGTG CGTCAAAAGA AGTCTCTATA TGGGAGCGAA CCCTCTAATT TCTGACGATA CCATGAGGGA GATCATCATC AAAGGGGCGA GGGAGCATAA CCTCAAGAAT ATCAGCGTGG TGCTCCCCCG TGACAAACTG ATCGTCTTCA CCGGCCTGTC CGGTTCAGGA AAATCGACGC TCGCGTTCGA TACGCTCTAT GCCGAAGGGC AGCGGCGGTA TGTGGAATCC CTCTCCACCT ATGCCCGGCA GTTTCTCGGG GTGATGCACA AGCCGGATGT GGACTCGATC GAAGGGCTCT CTCCTGCGAT CTCGATCGAG CAGAAGACCA CCTCCAAGAA CCCGCGCTCC ACGGTCGGCA CGATCACCGA GATCTACGAC TACCTCCGGC TTCTGTATGC GAGGATCGGA ACACCCTACT GCCCGGTGCA CAACATCAAG ATCGAATCGC AGACCCCGGA ACGGATCGCC GATACGATCA CCGCGGAGCA GGCCGGCATG GTGACGATTC TCGCCCCGAT CATCAGGCAG AAGAAGGGGA CCTACCAGCA ACTCTTCAAG GACCTGAACC GGGAGGGGTT TGCCAGGGTC CGGGTGAACG GGACGATCGT TCGGACCGAC GAGGAGATCA CCCTCGACCG GTACAAGAAG CACACTATCG ATATCGTGCT CGACCGGTTC GATACGATCG ACCGGACCCG CCTCGTCGAG AGCATCGAGG TCGGGCTGAA GCGAGCCGAG GGGCTGATCA TCGTTGTGGA CGAGGAGGGG AAAGAGACGA CCTACTCCTC GCTGATGGCC TGCCCGATCT GCGGGATCTC CTTCGAAGAA CTCCAGCCGC GGATGTTCTC GTTCAACAGT CCGTTCGGCG CCTGCGAGGA GTGCAACGGT CTCGGTTTCC GGATGGTCTT CGATCCGGAC CTGATCATCC CTGACAAGAG CCTCTGTATC GTGGACGGGG CGATCGCCCT GTATCGGAAT GTCCTGGAGG GCTTCCGGGG TCAGCAGCTC GACACCGTCG CCAAGAGTTT CGGTTTCGAC CTCTTCACGC CCATCCAGGA TCTGACTGAA GAGCAGTACA ATGGGCTGAT GTTCGGTTCT GACAAGCAGA TCGACTTCTC GGTCACGATG AAGCAGGGGG ATGTACACTG GTCTCACCGG GGTACCTGGG AGGGGCTCCT CCCGCAGGCT GAACGGTTGT ATCATCAGAC GCAGTCTGAA TACCGGAAGA AGGAACTTGA GAAGTTCATG CGGATCTATG AGTGCCCCAC CTGTAAGGGA GCCCGGCTGA AGGAGAAGAT CCGGGCGGTC CGGATCAATG ATCGATCCAT CGTCGATGTG ACCCGTCTCT CGGTCACCGC CTGTCGGGAT TTCTTTGCTA ACCTGACGCT GACCCCGAAA CAGGCAGAGA TCGCCATGCT GGTGGTCAAG GAGATCACCG ACCGGCTGAA CTTCCTCGAA CGGGTCGGGC TCGGGTACCT GAACCTCTCG CGGTCGGCAG GGACGCTCTC TGGTGGAGAA GCCCAGCGGA TCCGGCTGGC GACTCAGATC GGGGCGAACC TGATGGGGGT GCTGTACGTG CTGGACGAGC CCTCGATCGG TCTCCATCAG AGGGATAACC AGCGACTGAT CGATTCACTC TGTGCGCTCC GCGATCTGGG CAACACGCTG ATCGTCGTCG AGCATGACGA GGAGACGATC CGTCATGCCG ACTACGTGGT CGACATCGGC CCCGGGGCCG GGGTGCACGG CGGTCAGGTG GTGGCCAAGG GGACGCCGCT TCAGATCGAG CGGTCGATCA ACTCGCTGAC CGGGCTGTAC CTCGCCGGCT CGCTCAAGAT CGATACTCCG AAATGGCGGC GGTCCAGCGA CCACTTCATC AGAATCACCG GGGCGGCCGA GAACAACCTC AAGGGGATCG ATGTCCAGTT CCCGATCGGG GTGCTGACGG TGGTGACCGG TGTCTCCGGC TCCGGGAAAT CGACGCTGGT CTATGATATC CTGTACAAGG CCCTGCAGAA GAAACTGAAG AGAAGCAGCG AACCGGCTGG AAAACACGAG TCGTTGACGC TCGATTCCGA GATCGACAAG GTGATCGTGA TCGACCAGAG TCCAATCGGT CGGACCCCCC GGTCCAACCC TGCCACGTAT ACCAAGATCT TCGACGAGAT CCGGTCGGTC TTCGCCGGGG TGCCGGAGGC GAAGGTGCGG GGGTACCAAC CCGGCCGGTT CTCGTTCAAT GTCAAGGGCG GACGGTGCGA GGCCTGCCAG GGCGACGGGC TGATCAAGAT CGAGATGAAC TTTCTGCCCG AGGTCTATGT GGAGTGCGAG GAGTGCAAGG GGAAGCGGTA CAACCGCGAG ACCCTTGAGG TGAAGTACAA GGGTCATTCC ATCGCCGATG TGCTGGATAT GTCCGTCGAT GAGGCGCTCC ATCTCTTCGA GTCGCTTCCG GCGATCAGAA CCAAACTCGA GACCCTCTCC CGGGTCGGCC TCGACTATAT CAAACTCGGA CAGTCGTCGA CGACTCTCTC CGGCGGTGAA GCGCAGCGGA TCAAACTGAC CCGGGAACTG GCCAAGCGGG CGACCGGCAA GACGCTCTAC CTGCTCGACG AGCCGACGAC CGGCCTCCAC TTCCATGATG TGAAGAAACT GATCCAGGTG CTGGACGACC TGGTCAAGAA GGGGAACTCG GTGCTGGTGA TCGAGCACAA CCTGGACGTC ATCAAGTCTG CCGACCATGT GATAGACCTC GGACCGGACG GAGGTGACCG GGGCGGACAG GTGATCGCGA CCGGAACGCC GGAGGAGATC GCGGCGACGC CGGGCAGTTA CACAGGGGAG TACCTGAAGA AGGTGTTATG A
|
Protein sequence | MWSVGPCVKR SLYMGANPLI SDDTMREIII KGAREHNLKN ISVVLPRDKL IVFTGLSGSG KSTLAFDTLY AEGQRRYVES LSTYARQFLG VMHKPDVDSI EGLSPAISIE QKTTSKNPRS TVGTITEIYD YLRLLYARIG TPYCPVHNIK IESQTPERIA DTITAEQAGM VTILAPIIRQ KKGTYQQLFK DLNREGFARV RVNGTIVRTD EEITLDRYKK HTIDIVLDRF DTIDRTRLVE SIEVGLKRAE GLIIVVDEEG KETTYSSLMA CPICGISFEE LQPRMFSFNS PFGACEECNG LGFRMVFDPD LIIPDKSLCI VDGAIALYRN VLEGFRGQQL DTVAKSFGFD LFTPIQDLTE EQYNGLMFGS DKQIDFSVTM KQGDVHWSHR GTWEGLLPQA ERLYHQTQSE YRKKELEKFM RIYECPTCKG ARLKEKIRAV RINDRSIVDV TRLSVTACRD FFANLTLTPK QAEIAMLVVK EITDRLNFLE RVGLGYLNLS RSAGTLSGGE AQRIRLATQI GANLMGVLYV LDEPSIGLHQ RDNQRLIDSL CALRDLGNTL IVVEHDEETI RHADYVVDIG PGAGVHGGQV VAKGTPLQIE RSINSLTGLY LAGSLKIDTP KWRRSSDHFI RITGAAENNL KGIDVQFPIG VLTVVTGVSG SGKSTLVYDI LYKALQKKLK RSSEPAGKHE SLTLDSEIDK VIVIDQSPIG RTPRSNPATY TKIFDEIRSV FAGVPEAKVR GYQPGRFSFN VKGGRCEACQ GDGLIKIEMN FLPEVYVECE ECKGKRYNRE TLEVKYKGHS IADVLDMSVD EALHLFESLP AIRTKLETLS RVGLDYIKLG QSSTTLSGGE AQRIKLTREL AKRATGKTLY LLDEPTTGLH FHDVKKLIQV LDDLVKKGNS VLVIEHNLDV IKSADHVIDL GPDGGDRGGQ VIATGTPEEI AATPGSYTGE YLKKVL
|
| |