Gene Mhun_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2781 
Symbol 
ID3922518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp3050056 
End bp3053184 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content49% 
IMG OID637898392 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_504195 
Protein GI88604017 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.759966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA CGGGATTATG TGAATCTGAC GTAGAAGAAA CAGCACTTGA TATCTTTCGT 
GACCTTGGGT ACGAGGTCCT CTATGGGTAT GAGGTCGCAC CCGGAGAACG GGATCAGATC
CGAAACTCGT TTGAAGAATA CCTCCTTGGG TCCAGACTCG AAGACGCCAT CTTCCGGCTC
AACAAAACGC TCCCCTGTGA TGCCCATGAA GCAGCACAGA AGATCCTCAC CAATCCCGGT
CATACCACCC TCATACAGAA TAATAAGACC GTTCATGAGA TGCTCTCCGG TGGAATCACG
GTTGAATACC GGCGGAAGGA TGGATCCGTT GGTGGAGATC AGGTCAGGCT TATTGACTTT
GAACACCCGG AGAACAACGA GTTCCTTGTC GTCAATCAGT TCACCATTCA GGAAGGGAAA
AACCACCGAA GGCCTGATAT CGTCCTCTTC ATCAACGGAA TCCCGGTCAT CATCTTTGAA
CTCAAAAACC CCACCGATGA ACGGGCCACC CTGACCATTG CCCATAACCA GCTCCAGACG
TATATGCAGG AGATCCCCTC ATTCTTCTCA TACAATACCA TGGCGATCAT CTCAGACGGA
ATCCAGACAC GGATTGGAAC CCTCTCCGCA TCACTTGAAC GGTTCTCCCG GTGGAGGACG
ATTGACGGAG CACAGGAAGC CCCGAATCAT ATCCCTGAGA TCGAAGTGCT CATCCGGGGT
GTCTGTAACA AAAAGCGGCT TCTTGACCTG ATCCGGTTCT TTATCGTATT TGAAGAGGAT
GAACGGGGAA ACACCATCAA GAAACTGGCC GGATACCACC AGTATCATGC CGTAAACCTG
GCGGTCAGGT CTACCGAGAC AGCTATCAGT GACGTCGGAG ACCGCCGGTG CGGAGTCGTC
TGGCATACCC AGGGGTCAGG AAAAAGTCTC ACCATGGTCT TTTACTCAGG AAAGATCATC
CAGACTCCAT CACTCCAGAA CCCGACGATC CTGGTCATCA CCGACCGGAA TGATCTTGAC
GATCAACTCT TCGGAACCTT CTCCCGGTGC AGAAAACTCC TCCGCCAGAC GCCCATCCAG
GCAGAGGGCC GGGACCATCT GAAGACACTG CTCAGTGTCG CATCTGGTGG TGTCGTCTTC
ACTACCATTC AGAAGTTCTT CCCCGAAGAT GAGAACAAGG AGACCTATCC CCTCCTCTCT
GACCGGAAGA ACATCATTGT CATCGCAGAT GAGGCACACC GGAGCCAGTA CGGGATGCAT
GGAAAAGTAA GCAAGAAAGG AGAGATAAGT TACGGATTTG CCAAGTACAT CCGTGATGCT
CTCCCAAACG CCTCATTTAT CGGATTTACC GGGACACCCA TCGACCTTCA GGATAAAGTC
ACCAGAAACG TCTTTGGTGA ATACATCAGC GTCTATGATA TCCAGCAGGC CATCAAGGAC
AAGGCAACCG TTCCTATCTA TTACGAAAGC CGTATCGTCG ATATCAGGAT GAATGAGCAG
ATAAAACCGC TCATTGACCA GGAGTTTGAA GAAGTCACTG AGAACGAAGA AGTCATCAGA
AAAGAGAAAC TCCGGACAAA ATGGGCTGCC CTTGAGGCCA TCGTCGGAGA TGAGAAACGA
ATCAATGAGG TTGCTGATGA TCTCATCCGC CACTTTAAGC AGCGGTCAGA TGCCATCGAA
GGGAAAGCGA TGATCGTCTG CATGAGCAGG CGGATCTGTG TGGAGCTCTA CAAAGCACTG
GTTGAGAGGC AACCTGATTG GGATTCTCAT TCAGACAATG AGGGGTTCCT GAAGGTGATC
ATGACCGGTT CAGCAGGCGA TGAGAAAGAT TTCCAGAAAC ACATCAGACC CAAATCAGGC
CGGGAACTCC TTGCAAAACG GTTTAAAGAT CCGGAAGATC CATTCAAAAT CGCAATTGTC
CGTGATATGT GGCTGACCGG ATTTGATGTT CCCTGTCTGC ATACTATGTA TATCGACAAG
CCCATGAAGG GCCATACTCT TATGCAGGCA ATCGCCAGGG TAAACCGGGT ATGGAGAGAT
AAGCCTGGTG GACTTATTGT TGATTACATC GGTCTTGCCG AAGAACTGAA AAAAGCCCTG
GTCACCTACA CGGAGAGTAA AGGCAAAGGG GATCCCGTAT TTGACCAGGA GAGGATAGTC
GATAAGATGA TGGAGAAGTA CGAGGTCTGT TGTCATCTCT TCCATGGATT TGACTGGTCA
GGCTGGCGGG GAGCTCCGGC AGCGACTCGT CTTTCCATCC TTCAGGCAGG ATTCAACTAT
GTCCTTGATG ATCCTGAACG GAAATCAGAC TTTATGCAGC ACGTGCTTGA ACTCTCCCAG
GCTTTTGCCC TTGCTCTCCC CCATGAAAAA GCCATTGAGA TCCGGGAAGA TGTCGCGTAT
TTCCAGGCCG TTCGGGCACA AATCATAAAG ACATCAGAAG GAGCAGGAAA ATCAGAGTAC
GAACTGAATC TGGCGGTAAA GCAGATAGTT TCAAAATCCA TTGTCCCCCT TGGTATTGTC
GATATCTTCA AGGAGATGGG AAAGGATGTC GGAGATATCT CAGTCCTCTC CGAAGAGTTC
CTCAATGACC TCCTGAAATA CAAAAATCAG AATGTGGCAA TTGCTGCCAT GACCCGGCTG
CTGAATGATC AGATCCGGGC ACGAACCCGG AAAAACACAA CCGAAGCAAG AAAATTCTCC
GAGATGCTTG AGCAGACGAT AAACAAGTAC AATAACCGGA AGGTCACGAC CCAGGAGATC
TTAAAAGAAC TTGCGCAGAT TGCACGGGAG ATCAGGGAGG CTCAGCAGAG AGGAGAGGAT
CTGGGACTTA CCGAGGCAGA ACTTGCATTT TACGATGCAC TCGGTGTGAA TGACAGTGCG
GTGGTGATTC TCGGTGATGA CATACTCAAA AAGATAGCCC AGGACCTGGT AAAGACCATC
AGGGAGAATG TTACGATCGA CTGGTCATCA CGTGAATCGG TTCGGGCAAA GATGCGGGTT
GCGATAAAGA GAATCCTCCG GGTGAACGGG TATCCTCCGG ACAAACAGGA GAATGCTATT
CAGACGGTGA TGCAACCTAT ATTCGTCAGA TCTATTAAAA TTTTATATAA TAGTAAACTA
TATAACTAA
 
Protein sequence
MATTGLCESD VEETALDIFR DLGYEVLYGY EVAPGERDQI RNSFEEYLLG SRLEDAIFRL 
NKTLPCDAHE AAQKILTNPG HTTLIQNNKT VHEMLSGGIT VEYRRKDGSV GGDQVRLIDF
EHPENNEFLV VNQFTIQEGK NHRRPDIVLF INGIPVIIFE LKNPTDERAT LTIAHNQLQT
YMQEIPSFFS YNTMAIISDG IQTRIGTLSA SLERFSRWRT IDGAQEAPNH IPEIEVLIRG
VCNKKRLLDL IRFFIVFEED ERGNTIKKLA GYHQYHAVNL AVRSTETAIS DVGDRRCGVV
WHTQGSGKSL TMVFYSGKII QTPSLQNPTI LVITDRNDLD DQLFGTFSRC RKLLRQTPIQ
AEGRDHLKTL LSVASGGVVF TTIQKFFPED ENKETYPLLS DRKNIIVIAD EAHRSQYGMH
GKVSKKGEIS YGFAKYIRDA LPNASFIGFT GTPIDLQDKV TRNVFGEYIS VYDIQQAIKD
KATVPIYYES RIVDIRMNEQ IKPLIDQEFE EVTENEEVIR KEKLRTKWAA LEAIVGDEKR
INEVADDLIR HFKQRSDAIE GKAMIVCMSR RICVELYKAL VERQPDWDSH SDNEGFLKVI
MTGSAGDEKD FQKHIRPKSG RELLAKRFKD PEDPFKIAIV RDMWLTGFDV PCLHTMYIDK
PMKGHTLMQA IARVNRVWRD KPGGLIVDYI GLAEELKKAL VTYTESKGKG DPVFDQERIV
DKMMEKYEVC CHLFHGFDWS GWRGAPAATR LSILQAGFNY VLDDPERKSD FMQHVLELSQ
AFALALPHEK AIEIREDVAY FQAVRAQIIK TSEGAGKSEY ELNLAVKQIV SKSIVPLGIV
DIFKEMGKDV GDISVLSEEF LNDLLKYKNQ NVAIAAMTRL LNDQIRARTR KNTTEARKFS
EMLEQTINKY NNRKVTTQEI LKELAQIARE IREAQQRGED LGLTEAELAF YDALGVNDSA
VVILGDDILK KIAQDLVKTI RENVTIDWSS RESVRAKMRV AIKRILRVNG YPPDKQENAI
QTVMQPIFVR SIKILYNSKL YN