Gene Mhun_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_3131 
Symbol 
ID3923959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp3414332 
End bp3417622 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content47% 
IMG OID637898740 
Productpeptidase C1A, papain 
Protein accessionYP_504536 
Protein GI88604358 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.34084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000341484 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAAGC CAATAGTATT AATATTCATT CTGATATTGC TCATAAGTAT CGGTTTAGAG 
TCGGGAGCGG CAACATCAGT TACGATATTG CCCGGTTCAC AGGCGGTGAT GAAACCTCCT
GATAGTGGTG TCGCTCCTCC ATATTCCCTG CCGTCGATTG TAATCCCAGT CACTCCAAAG
TCAACTTTAC AAGGAGATGG GCCGGTATCA GGCATTGTAC AGTCACCAGT TCTGTCAGGT
ATAATATCCT CCAAGGTGGA ACCTGAACAA TTACTCCCTT CTACAAACAT TACACAAAAT
AGTTCTGTAT GTGCACAAGC ATATATTCCT GACCGGGTTA TTGTAAAGTT TAAGACAGAC
CACTTTTCTC CCTTATCTTC TGTAAATCAG ATTCAGGCAG AGGCACACGC AGCCATGGGC
GCAACGGTTC TTGCAGACCC ATCCACTCTG GGTGTTGAGG GAATGCAGGT TGTCAGTGTT
CCCAACACTA CCGGGACCAT GAAAGCGATT GAACTATATC GGATGAACCC GATGGTCGAG
TATGCCCAGC CGGATTACCT GTATTCTATA AATTCAACAA TTATTGATCC TCCGGTGTCT
GTAAATCAGC AGACAGTCCC CTATCCTGAA ACTATTCAGG TAAAACCGTC TACAGTTCAG
GTTCCGAATG TTCCTCCTCC ATCATCACCG CCTTCCTGGT CGACCTGTTT ATCATCGGGA
TATCAGAGCG GGGTAGTAGG ATTTGAAGAT CCGTTGAGCG AAGAAGAGCG ATATAATGCA
GCACAAGCAG AAGTGGATGA CATAAATGCC TATGTGAAAG AACATAATCT CTCCTGGACT
GCAGCGGTAA ATCCAATCAT GCTCATGAGT CCAGAAGAGC GTGAACATCT GAAAGGACTC
CGGCATGATC TGAAAAGCAG CACGATAGTG AGTGGCGCCG GTATCACACC AATGGAAGGA
CTTCCCACTT CGTTTGACTG GCGGAACAAT GGTGGAGATT ATACCACCCC TATTAAGAAT
CAGGGAAGTT GCGGGAGTTG CTGGGCATTT GCAACAACCG GTGCCTTTGA ATCATATAAA
GAGATAAAAT CCGGAAATCC GGGTATGAAC CCTGATTATG CTGAGCAGTA CCTGGTGAAC
TGTGCAGGTG ATCAGCGTGG ATGTAATGGC GGACTCTTCA CGGCAATGGC ATACTTTGTA
AATAAGGCGG GTTTGAGTGG TGGAGTCGGG ACGGTTACCG AGGCGAACTA TCCCTATACC
GGTTCGGATG GTACGTGTAA GAGTCTGTCC GGGTATACCA GGTATTCGGT AGATACGGCC
GCAGGAGAGA CCTGGGGGTA TGTCGGTGGA GGGAATGAGT GGAGTATCCC ATCTGATGAT
GCGATAAAGA CGGCGATTTA TCTCTATGGT CCGGTTGCCG CCGGAGTCTA TGCAGAGAGC
ACCTTTGATT CATATCGATC AGGTATACTT GACAGTACGT CCAGTGCATC CTATGCAAAT
CATGCAATTA TTATTGTGGG ATGGGGAACG TTAAATGGCC GGACTTACTG GATTTGTAAG
AACAGTTGGG GGACATCCTG GGGCGAATCA GGGTGGTTTA GAATTTTCTC AGGAAGGCTC
CGTATCGGGG AAGGTGCTGC ATATTTTAAA TATACAGCCT CAAATCCTTC TGGCGGGACG
ATTGCTTTCA ATTCTAATCC ATCCGGGGCC CAAATCTGGA TTGATGGGGT GAACACCGGT
CAGGTTACTC CCTATACTCA AACATCAGTT CCAATCGGAA CCTATTCGGT GACCTTGAAA
CTCAGCGGGT ACCAGGAGTA TACTCGTTCG GTATCTGTAA CTTCAGGACA GACAACTGTC
ATATCTGCCA CCCTTTCACC AATACCTACC GGGAGTATTG CAGTTAGTTC AACTCCTTCC
GGTGCACGGA TCTGGCTTGA CGGGGTTGAT ACCACAAAGA GCACACCTGC GACGTTATCT
TCCGTGCCGA TTGGTTCACA TGCTGTATCA CTGGTTCTCT CTGGATATAA TTCATATTCG
ACCGTTGTGA TGGTTCATGA GGGTCAGACA AGTATAGTCT CCGGAACTTT GAATCAAATT
AACCCTGGCA CCCAGGTTCT TCCGAATGAT CCTTCATTTA GCAGTCTGTG GGGACTTCAT
AATACCGGAC AGAGTGGAGG AACAGGTGAT GCCGATATAG ATGCTCCGGA AGCATGGAGC
ATAACCACCG GTTCACTAGG GGTTATTGTC GCGGTTGTTG ATACCGGTGT GGATTACAAT
CACCCAGATC TGGTCGCAAA TATCTGGAGG GATCCGGTAA CAAATACTCC CGGGTATGAT
TTCTATGGTT CTAATGATCC AAATCCCATG GATGAACATG GTCATGGGAC TCATTGTGCC
GGGACGATTG GAGCAGTCGG AAATAATGGG ATTGGTGTGA CCGGGGTGAA CTGGAATGTG
AAAATTATGC CTCTTCGGTT CCTTGGAGCT GACGGGTATG GCTCTACAAG TGATGCGATT
GAGGCTTTTG CCTGGGGATA TGCAAAGGGA GCGAGAATTT TCTCAAACTC ATGGGGAGCC
TATGGTATCG ATTATGCTCT TCGTGATTCC ATTAATCTCT ATCCCGATGC ACTCTTTGTA
TGTGCGGCCG GAAATGGCGA TATATATGGA AATCCCTATA ATACTGATTC ATATCCCCAT
TCTCCATCAT CGCTGGCTAA TGTGAATATT CTCTCTGTAA CAGCAACAAA CCGATATGAT
CAAAGAGCTT CCTGGGCAAA TTATGGTGCG ACAACGGTTG ATGTTGCTGC ACCGGGTGTG
TCTATCATGA GTACTACCAA AGGCAACTCC TATGGCACCA TGAGCGGGAC CTCTATGGCG
ACCCCGCATG TGGCAGGTGT TGCAGCCCTC ATAAAGGCAC AGAACCCTTC ATATTCAGCA
TCCCAGATAA AATCCGCCAT AATGAATAAT GTTGATCTCA AATCCGGACT ATCTGGTAGA
TGTGTTACCG GAGGTCGGAT CAATGCGTTC GCCAGTCTGG CCTCCTCCCT TCCACTTAAG
GCAAAGTTTT ACGGGGTTCC TGATACAACA ATCAAACCTC TTCGAATCCG GTTCTATGAT
GTCTCTGAAG GGATAATCTC CTCAAGACTC TGGAACTTTG GGGATGGAAA TACTACTGGT
GAAGTTAATC CATCGCACAC CTATTATAAT CCAGGCATCT ATACTGTCAC GCTTCAGGTA
AATGACGGTG TTGGTACTCA TGCCTCTGTG CTGGAGATAC AGGGAGGTTG A
 
Protein sequence
MSKPIVLIFI LILLISIGLE SGAATSVTIL PGSQAVMKPP DSGVAPPYSL PSIVIPVTPK 
STLQGDGPVS GIVQSPVLSG IISSKVEPEQ LLPSTNITQN SSVCAQAYIP DRVIVKFKTD
HFSPLSSVNQ IQAEAHAAMG ATVLADPSTL GVEGMQVVSV PNTTGTMKAI ELYRMNPMVE
YAQPDYLYSI NSTIIDPPVS VNQQTVPYPE TIQVKPSTVQ VPNVPPPSSP PSWSTCLSSG
YQSGVVGFED PLSEEERYNA AQAEVDDINA YVKEHNLSWT AAVNPIMLMS PEEREHLKGL
RHDLKSSTIV SGAGITPMEG LPTSFDWRNN GGDYTTPIKN QGSCGSCWAF ATTGAFESYK
EIKSGNPGMN PDYAEQYLVN CAGDQRGCNG GLFTAMAYFV NKAGLSGGVG TVTEANYPYT
GSDGTCKSLS GYTRYSVDTA AGETWGYVGG GNEWSIPSDD AIKTAIYLYG PVAAGVYAES
TFDSYRSGIL DSTSSASYAN HAIIIVGWGT LNGRTYWICK NSWGTSWGES GWFRIFSGRL
RIGEGAAYFK YTASNPSGGT IAFNSNPSGA QIWIDGVNTG QVTPYTQTSV PIGTYSVTLK
LSGYQEYTRS VSVTSGQTTV ISATLSPIPT GSIAVSSTPS GARIWLDGVD TTKSTPATLS
SVPIGSHAVS LVLSGYNSYS TVVMVHEGQT SIVSGTLNQI NPGTQVLPND PSFSSLWGLH
NTGQSGGTGD ADIDAPEAWS ITTGSLGVIV AVVDTGVDYN HPDLVANIWR DPVTNTPGYD
FYGSNDPNPM DEHGHGTHCA GTIGAVGNNG IGVTGVNWNV KIMPLRFLGA DGYGSTSDAI
EAFAWGYAKG ARIFSNSWGA YGIDYALRDS INLYPDALFV CAAGNGDIYG NPYNTDSYPH
SPSSLANVNI LSVTATNRYD QRASWANYGA TTVDVAAPGV SIMSTTKGNS YGTMSGTSMA
TPHVAGVAAL IKAQNPSYSA SQIKSAIMNN VDLKSGLSGR CVTGGRINAF ASLASSLPLK
AKFYGVPDTT IKPLRIRFYD VSEGIISSRL WNFGDGNTTG EVNPSHTYYN PGIYTVTLQV
NDGVGTHASV LEIQGG