Gene Mhun_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0644 
Symbol 
ID3923096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp746842 
End bp750120 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content47% 
IMG OID637896283 
Productperiplasmic copper-binding 
Protein accessionYP_502119 
Protein GI88601941 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0259302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTCAG GGAAGAGTCT GTGCATTCTT CTTATTATTG CATTAATAGC AGGAGGGACT 
CACTGCTATG CGGATGGAGA GGATTTCAGT GAGGGGGGGT TTGCTCCCTT GTCACTTCAA
TGGCAGAAGA TAACAGAATC TGAGAATACC TCAGCAGACA GCCAATATTG CTGCGGAAGC
GGGCCGCTTC GGGAGAGCCC CGTGGATCTC TCCCACCTGA TAGGAAAGAA GATTCGAAGT
CTGTCAATTC TGGCAGATTA TCCATCAAAA TTTGATCTTC GGGATTCAAA ACGGGTCCCT
GCTATCCGTG ACCAGGGGCA GAGCGGGAGC TGCTGGGATT TTGCTGCAGT AAAATCTCTT
GAATCATCTC TTCTTCCTGA AGTTGCCAAA GATTTTTCTG AAAACAACCT GAAAAACCAT
GTCTCTATAT ATGATCCGGA AGGGTTCGAC TTTGCCGATG GGGGAAATGA TCTCATGGCT
GCCGCCTATT TCCTTCGGGG TTCTGGGCCG GTACCGGAAA AACTTGACCC ATATAATCCG
TATTCTTTTA TTTCACCCCA GAATCTCCCG GTAGAGACAC GGGTCAGCGA TGTTCCGATG
ATACCCGGAC GCTCCGGTCC GACCGATAAT GAAAATGTAA AATGGGCTCT GGTGAATCTT
GGCTGTCTTT ACTCAACTAT CCTCTATGAT GACCAGTACT TCAATACCGG AACAAATGGG
TATTATAATC CTGATGGAAC AACCCCCAAC CATGCGATCT CCATAATCGG CTGGGATGAT
ACCTATCCTG CATCGAATTT TAATACTGAA CCGGCAGGAG ACGGGGCATT TATCTGTGCA
AACAGCTGGG GTACAAACTG GGGAGATAAA GGGTTCTTTT ATATCTCATA TTACGACACT
CTGATTGCAA AGCGGATCTC TGCATTTACT GCAGCAACCG ACAATACTGA CGGTAGTTAT
GGCTATGACA CATTAGGCTG GGTAAACAGT TTCGGATTTG GATTTGCCGA TGCCTCTGCT
GCAAATGTTT TCACTGCTGA TGCTGATATC GAGGTTACGA GTGCCGGGTT TTACATCCCA
CAGGTCGGTA CCAGTGTTAC CGCATCAGTA TATCTCAATC CAGATAATGG ACCGGTAGGA
AGCGCTCCGG TGGCAACGTC AAATGGTGAA GTATATCAGA TTCCCGGGTA CCACTCACTG
CAATTTGATC TCCCGGTAAA GGTAAAGAAA GGTGAGAAAT TCTCTATAGT TGTTGATTTT
AACACCCCGG ACTATGGATT TCCTATCCCG GTTGAATATC CGGTGCCAGG GTACAGCAGC
AAAGCGACTG CACAGCCTGG ACAGAGTTAT GTGAAAACAT CCACTGGAAA ATGGTCTGAT
CTGACAACCT GGGATCCTCA GGGGAATGCA TGTATTCGGG CAGGATACCG GCTTATTTCC
GGGCCAAAAG CAGATTTTTA CGCAGAACCA ACCAGTGGGA GTGCACCGCT GACGGTAACG
TTCCATGATA TCTCAACCGG AAACCCTGAA CGCTGGCTCT GGCAATTTGG TGACGGCGCT
ACTTCTACTG AGCAAAATCC CATCCATACT TATACCTATA ACGGAGTATA CTCCGTCACC
CTCACAAGTG ATACCCCGGC CGGAGAATCT ATGATGGTTA AGAAGAATTA TATAACCGTC
TCCGAACCGA CAAAGATAAT CGTTCCCGAT GACCATTCGC TTATTCAGGA AGCGATCAAC
ACCGCTCCAC CAGGATCGGC GATTCTCGTC AGATATGGGT ATTATCCTGA AAAATTGACT
ATAAACAAAC CGATAACCCT TATCGGAGAG AGCAGTTCCG ATGGACAAAA ACCGATCATT
GATGCACAAT TTACCGGAAC ACCAGTCAGC ATCACTGCCG CCGGAGTAAC GGTTGAGAAC
TTTTCCCTGA CAGGAGCATG GTCTGAAACT GCCATCCGCC CTGGTGTTGC AGTCAGAGGG
AATAAAGCGG TTATCAGGAA TAACTGGATC TTTGAGAATT ACGCAGGAGT AAGGTTTGAA
AGTGTTGTCG GAGGAATTCT TGATGGAAAC ATCATCTGGA ACTCAACCAG CAATGCAATC
TACGGAGAGT CCAGTTCATA TCTTGATATA TCAAATAACA CCGTTGTCTG GACAAAAGAT
TCATCAGCGG TCAGGCTTGT ATCTGCATAC AACTCCGTTC TCAAAGGCAA TGCAATCGCA
GAAAATAAAA AAACGGGACT TTCTGTTACC GGAACGGGGA TGACCATCTA TGATAATTAC
CTGAATAATT CCCAAAATGT TGCACTGACA CCAGATACGA AAGTTACCTG GAATATTCCA
AAGACAACCG GACCAAATAT TGTTCTTGGC CCGTATATCG GCGGTAATTT CTGGGCTACA
CCGGATGGAA CCGGATTTTC AGAGACGCAC AAAGATGAAA ACGGCGATGG TTTTTGTGAT
GAGGTCTACC GGATTGGGGG AGACAATGTA GATGAACTAC CGCTCGCTAT TCCCGATTCC
GTGCCTCCAT CAGCATCATT TGAGGCTGAA CCAAGGACCG GGAGTCTGCC ACTGACTGTC
CAGTTCAGAG ATACCTCACT CGGCACCATT GAGAGCTGGC TCTGGGATTT CGGAGATGGT
ACGTCCTCAT CCGAGCAGCA CCCCGTTCAT GTATACGAGA ACATTGGATC ATATAATGTA
AACCTGACAG TTACCGGCCC GAAAGGGAGT GATGCCGAAT TAAAACTCTC ATATATCACG
GTCACCGGCA CCGGAAACAA ATATATCTTG ACCTTGATGC CTGGCTGGAA TTTCTTCACT
CCTCCAAAAT CACTCTCACC AGGAAGTGAT ACTGCCGCAC TATTTGGATC CATAGAAACC
AGCGGGCACT CAATATTCGA ATTTCCAAAC CAGACATGGG GCTGGACAAA AGTAAACAGG
GATACAGTTC TGCATCCGGT GACGGGGTAC TGGATTTACT CCAAGAACCG GGTGGACACA
ACTCTCTGGC TAGATCCGGT TAGTGGCGGG AAGAAAACGG TTGAACCTGG ATGGAACGCT
ATCGGATCTC CGGGAATCGG ACCGATTAAA GCTAAGGATG TAATGAGTAC TCTGGGGGAC
AGCTGGACAT ATCTCATTGG ATATGATGAG AGTATGCAAA AATACGAGGA CGTGATAATC
AGGAAGGGCT CAGGGATTCA TTCGGATGAC CGCCTTCTCA AATCAGGACA TGGATACTGG
CTTTATGCGA CCGGGGCGGG AGATATCTAT GCTGCCTAA
 
Protein sequence
MWSGKSLCIL LIIALIAGGT HCYADGEDFS EGGFAPLSLQ WQKITESENT SADSQYCCGS 
GPLRESPVDL SHLIGKKIRS LSILADYPSK FDLRDSKRVP AIRDQGQSGS CWDFAAVKSL
ESSLLPEVAK DFSENNLKNH VSIYDPEGFD FADGGNDLMA AAYFLRGSGP VPEKLDPYNP
YSFISPQNLP VETRVSDVPM IPGRSGPTDN ENVKWALVNL GCLYSTILYD DQYFNTGTNG
YYNPDGTTPN HAISIIGWDD TYPASNFNTE PAGDGAFICA NSWGTNWGDK GFFYISYYDT
LIAKRISAFT AATDNTDGSY GYDTLGWVNS FGFGFADASA ANVFTADADI EVTSAGFYIP
QVGTSVTASV YLNPDNGPVG SAPVATSNGE VYQIPGYHSL QFDLPVKVKK GEKFSIVVDF
NTPDYGFPIP VEYPVPGYSS KATAQPGQSY VKTSTGKWSD LTTWDPQGNA CIRAGYRLIS
GPKADFYAEP TSGSAPLTVT FHDISTGNPE RWLWQFGDGA TSTEQNPIHT YTYNGVYSVT
LTSDTPAGES MMVKKNYITV SEPTKIIVPD DHSLIQEAIN TAPPGSAILV RYGYYPEKLT
INKPITLIGE SSSDGQKPII DAQFTGTPVS ITAAGVTVEN FSLTGAWSET AIRPGVAVRG
NKAVIRNNWI FENYAGVRFE SVVGGILDGN IIWNSTSNAI YGESSSYLDI SNNTVVWTKD
SSAVRLVSAY NSVLKGNAIA ENKKTGLSVT GTGMTIYDNY LNNSQNVALT PDTKVTWNIP
KTTGPNIVLG PYIGGNFWAT PDGTGFSETH KDENGDGFCD EVYRIGGDNV DELPLAIPDS
VPPSASFEAE PRTGSLPLTV QFRDTSLGTI ESWLWDFGDG TSSSEQHPVH VYENIGSYNV
NLTVTGPKGS DAELKLSYIT VTGTGNKYIL TLMPGWNFFT PPKSLSPGSD TAALFGSIET
SGHSIFEFPN QTWGWTKVNR DTVLHPVTGY WIYSKNRVDT TLWLDPVSGG KKTVEPGWNA
IGSPGIGPIK AKDVMSTLGD SWTYLIGYDE SMQKYEDVII RKGSGIHSDD RLLKSGHGYW
LYATGAGDIY AA