Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mhun_3131 |
Symbol | |
ID | 3923959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanospirillum hungatei JF-1 |
Kingdom | Archaea |
Replicon accession | NC_007796 |
Strand | - |
Start bp | 3414332 |
End bp | 3417622 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637898740 |
Product | peptidase C1A, papain |
Protein accession | YP_504536 |
Protein GI | 88604358 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.34084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000341484 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCAAGC CAATAGTATT AATATTCATT CTGATATTGC TCATAAGTAT CGGTTTAGAG TCGGGAGCGG CAACATCAGT TACGATATTG CCCGGTTCAC AGGCGGTGAT GAAACCTCCT GATAGTGGTG TCGCTCCTCC ATATTCCCTG CCGTCGATTG TAATCCCAGT CACTCCAAAG TCAACTTTAC AAGGAGATGG GCCGGTATCA GGCATTGTAC AGTCACCAGT TCTGTCAGGT ATAATATCCT CCAAGGTGGA ACCTGAACAA TTACTCCCTT CTACAAACAT TACACAAAAT AGTTCTGTAT GTGCACAAGC ATATATTCCT GACCGGGTTA TTGTAAAGTT TAAGACAGAC CACTTTTCTC CCTTATCTTC TGTAAATCAG ATTCAGGCAG AGGCACACGC AGCCATGGGC GCAACGGTTC TTGCAGACCC ATCCACTCTG GGTGTTGAGG GAATGCAGGT TGTCAGTGTT CCCAACACTA CCGGGACCAT GAAAGCGATT GAACTATATC GGATGAACCC GATGGTCGAG TATGCCCAGC CGGATTACCT GTATTCTATA AATTCAACAA TTATTGATCC TCCGGTGTCT GTAAATCAGC AGACAGTCCC CTATCCTGAA ACTATTCAGG TAAAACCGTC TACAGTTCAG GTTCCGAATG TTCCTCCTCC ATCATCACCG CCTTCCTGGT CGACCTGTTT ATCATCGGGA TATCAGAGCG GGGTAGTAGG ATTTGAAGAT CCGTTGAGCG AAGAAGAGCG ATATAATGCA GCACAAGCAG AAGTGGATGA CATAAATGCC TATGTGAAAG AACATAATCT CTCCTGGACT GCAGCGGTAA ATCCAATCAT GCTCATGAGT CCAGAAGAGC GTGAACATCT GAAAGGACTC CGGCATGATC TGAAAAGCAG CACGATAGTG AGTGGCGCCG GTATCACACC AATGGAAGGA CTTCCCACTT CGTTTGACTG GCGGAACAAT GGTGGAGATT ATACCACCCC TATTAAGAAT CAGGGAAGTT GCGGGAGTTG CTGGGCATTT GCAACAACCG GTGCCTTTGA ATCATATAAA GAGATAAAAT CCGGAAATCC GGGTATGAAC CCTGATTATG CTGAGCAGTA CCTGGTGAAC TGTGCAGGTG ATCAGCGTGG ATGTAATGGC GGACTCTTCA CGGCAATGGC ATACTTTGTA AATAAGGCGG GTTTGAGTGG TGGAGTCGGG ACGGTTACCG AGGCGAACTA TCCCTATACC GGTTCGGATG GTACGTGTAA GAGTCTGTCC GGGTATACCA GGTATTCGGT AGATACGGCC GCAGGAGAGA CCTGGGGGTA TGTCGGTGGA GGGAATGAGT GGAGTATCCC ATCTGATGAT GCGATAAAGA CGGCGATTTA TCTCTATGGT CCGGTTGCCG CCGGAGTCTA TGCAGAGAGC ACCTTTGATT CATATCGATC AGGTATACTT GACAGTACGT CCAGTGCATC CTATGCAAAT CATGCAATTA TTATTGTGGG ATGGGGAACG TTAAATGGCC GGACTTACTG GATTTGTAAG AACAGTTGGG GGACATCCTG GGGCGAATCA GGGTGGTTTA GAATTTTCTC AGGAAGGCTC CGTATCGGGG AAGGTGCTGC ATATTTTAAA TATACAGCCT CAAATCCTTC TGGCGGGACG ATTGCTTTCA ATTCTAATCC ATCCGGGGCC CAAATCTGGA TTGATGGGGT GAACACCGGT CAGGTTACTC CCTATACTCA AACATCAGTT CCAATCGGAA CCTATTCGGT GACCTTGAAA CTCAGCGGGT ACCAGGAGTA TACTCGTTCG GTATCTGTAA CTTCAGGACA GACAACTGTC ATATCTGCCA CCCTTTCACC AATACCTACC GGGAGTATTG CAGTTAGTTC AACTCCTTCC GGTGCACGGA TCTGGCTTGA CGGGGTTGAT ACCACAAAGA GCACACCTGC GACGTTATCT TCCGTGCCGA TTGGTTCACA TGCTGTATCA CTGGTTCTCT CTGGATATAA TTCATATTCG ACCGTTGTGA TGGTTCATGA GGGTCAGACA AGTATAGTCT CCGGAACTTT GAATCAAATT AACCCTGGCA CCCAGGTTCT TCCGAATGAT CCTTCATTTA GCAGTCTGTG GGGACTTCAT AATACCGGAC AGAGTGGAGG AACAGGTGAT GCCGATATAG ATGCTCCGGA AGCATGGAGC ATAACCACCG GTTCACTAGG GGTTATTGTC GCGGTTGTTG ATACCGGTGT GGATTACAAT CACCCAGATC TGGTCGCAAA TATCTGGAGG GATCCGGTAA CAAATACTCC CGGGTATGAT TTCTATGGTT CTAATGATCC AAATCCCATG GATGAACATG GTCATGGGAC TCATTGTGCC GGGACGATTG GAGCAGTCGG AAATAATGGG ATTGGTGTGA CCGGGGTGAA CTGGAATGTG AAAATTATGC CTCTTCGGTT CCTTGGAGCT GACGGGTATG GCTCTACAAG TGATGCGATT GAGGCTTTTG CCTGGGGATA TGCAAAGGGA GCGAGAATTT TCTCAAACTC ATGGGGAGCC TATGGTATCG ATTATGCTCT TCGTGATTCC ATTAATCTCT ATCCCGATGC ACTCTTTGTA TGTGCGGCCG GAAATGGCGA TATATATGGA AATCCCTATA ATACTGATTC ATATCCCCAT TCTCCATCAT CGCTGGCTAA TGTGAATATT CTCTCTGTAA CAGCAACAAA CCGATATGAT CAAAGAGCTT CCTGGGCAAA TTATGGTGCG ACAACGGTTG ATGTTGCTGC ACCGGGTGTG TCTATCATGA GTACTACCAA AGGCAACTCC TATGGCACCA TGAGCGGGAC CTCTATGGCG ACCCCGCATG TGGCAGGTGT TGCAGCCCTC ATAAAGGCAC AGAACCCTTC ATATTCAGCA TCCCAGATAA AATCCGCCAT AATGAATAAT GTTGATCTCA AATCCGGACT ATCTGGTAGA TGTGTTACCG GAGGTCGGAT CAATGCGTTC GCCAGTCTGG CCTCCTCCCT TCCACTTAAG GCAAAGTTTT ACGGGGTTCC TGATACAACA ATCAAACCTC TTCGAATCCG GTTCTATGAT GTCTCTGAAG GGATAATCTC CTCAAGACTC TGGAACTTTG GGGATGGAAA TACTACTGGT GAAGTTAATC CATCGCACAC CTATTATAAT CCAGGCATCT ATACTGTCAC GCTTCAGGTA AATGACGGTG TTGGTACTCA TGCCTCTGTG CTGGAGATAC AGGGAGGTTG A
|
Protein sequence | MSKPIVLIFI LILLISIGLE SGAATSVTIL PGSQAVMKPP DSGVAPPYSL PSIVIPVTPK STLQGDGPVS GIVQSPVLSG IISSKVEPEQ LLPSTNITQN SSVCAQAYIP DRVIVKFKTD HFSPLSSVNQ IQAEAHAAMG ATVLADPSTL GVEGMQVVSV PNTTGTMKAI ELYRMNPMVE YAQPDYLYSI NSTIIDPPVS VNQQTVPYPE TIQVKPSTVQ VPNVPPPSSP PSWSTCLSSG YQSGVVGFED PLSEEERYNA AQAEVDDINA YVKEHNLSWT AAVNPIMLMS PEEREHLKGL RHDLKSSTIV SGAGITPMEG LPTSFDWRNN GGDYTTPIKN QGSCGSCWAF ATTGAFESYK EIKSGNPGMN PDYAEQYLVN CAGDQRGCNG GLFTAMAYFV NKAGLSGGVG TVTEANYPYT GSDGTCKSLS GYTRYSVDTA AGETWGYVGG GNEWSIPSDD AIKTAIYLYG PVAAGVYAES TFDSYRSGIL DSTSSASYAN HAIIIVGWGT LNGRTYWICK NSWGTSWGES GWFRIFSGRL RIGEGAAYFK YTASNPSGGT IAFNSNPSGA QIWIDGVNTG QVTPYTQTSV PIGTYSVTLK LSGYQEYTRS VSVTSGQTTV ISATLSPIPT GSIAVSSTPS GARIWLDGVD TTKSTPATLS SVPIGSHAVS LVLSGYNSYS TVVMVHEGQT SIVSGTLNQI NPGTQVLPND PSFSSLWGLH NTGQSGGTGD ADIDAPEAWS ITTGSLGVIV AVVDTGVDYN HPDLVANIWR DPVTNTPGYD FYGSNDPNPM DEHGHGTHCA GTIGAVGNNG IGVTGVNWNV KIMPLRFLGA DGYGSTSDAI EAFAWGYAKG ARIFSNSWGA YGIDYALRDS INLYPDALFV CAAGNGDIYG NPYNTDSYPH SPSSLANVNI LSVTATNRYD QRASWANYGA TTVDVAAPGV SIMSTTKGNS YGTMSGTSMA TPHVAGVAAL IKAQNPSYSA SQIKSAIMNN VDLKSGLSGR CVTGGRINAF ASLASSLPLK AKFYGVPDTT IKPLRIRFYD VSEGIISSRL WNFGDGNTTG EVNPSHTYYN PGIYTVTLQV NDGVGTHASV LEIQGG
|
| |