Gene Mhun_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0617 
Symbol 
ID3922564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp713000 
End bp716125 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content49% 
IMG OID637896254 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_502092 
Protein GI88601914 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.639116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.314613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCTA TTTTCAGACC GAATCCGGAA CGGATGAGAC TTAAAGGGAA TTTAAAAGGC 
CTTCTCCGGT TACTGGACGG GAAAAATGAT ACTGCATTAC GGATAGAAGC AATATTAGCG
CTCGGCCGGA TGAAAACTCC GGTTGCAGTT GAAGAACTTA TCAGACTTTT TCAGGACCCT
GAGTATGGGA TCAGGAATGC TGCATCTCAT GCGCTTGTTC AAATTGGATC TGATGCTATA
AACCCGCTGA TTCGGGCTTT GTCTGTATCT GATGAAGAGA CCGCGAGGAT GATTCATACC
ACTCTTACCG CGATGGGTGA TGATGCAGCC AGGGAAATAG TAAAGAATAT CACCACACTT
CAGGGTATTG GATATGAACG AGCAGGATAT ATCCTGAATT CAATGGGTAC TCGGATTATT
CCGGTTCTTA TAGATGCCTA CGGGACAGAG GATCAGACGA CGACCAGGTT CATCGAGGGT
CTGTTTGAGT CATTTGGCAG ATCAGCAATC CAGCCGTTAA TCAAGGGGCT GAATCATGAC
CATGAAGAAG TCAGAGCACG TATCTCTGCT TATCTTATTA TTCTGGGTGA CCAGGTAGTT
GGAGATCTCC TCTCATCATG CGGTCAGGAT GAGGAATGGC TGCGAGAACT GAAATTTTAC
ATCATCAGTG AGATAGGAAA GCCGGCACTC GACCCCCTTT ATCAGGCACT CAAAGATCCA
AATCCGGTCA CTTCTTCAAT GGCACAAAAG GCATTCCTTG AGTTTGGAGA ATCGGCAATT
ATGCCTCTGA TATCCGGTCT GTATGATCAG GACCCTGAAG TCCGGAAGGT ATCAGAAAAT
GCCCTGACCA GGATAGGAGA GCCTGTCATC CCTCACCTGC TTGAGGAGAT GTCTGTACGC
CGTGATACTG ATCGCGAACC GATTATCTCT GTCATTCAGC ATATCGGTGA ACCTGCTATC
CCTTACCTTA TCGGGAATCT GATTCATTCG AAGGGTGATC GTTCAAAACA GATGGTTCAG
ATTCTGTCAC GAATGGGAGC TGTCACCATT CCCTATCTTA TCAATTCTGT CAGGGAGCAA
TCAGATACGT CCGGAATAAA AGAGGCAATT CTCTCGATGG GGCGTATTGC ATTCCCCTTT
TTGGAAGAGG CATCTGAGCG TGAGCGGGGA AAAACTTCTG TATTCGCTAT TGATCTTCTC
AGACAGATTG ATCCGGTCAG ATCCATTGAG CCGATGATTA GTGCTCTGTA TCATACTGAC
CGGGAGGTAC GGGAGACTGC TCTTGATAAC CTGGTTGCTT CAGGTGAGAT TGCGATCCCC
CGTCTGATTC AGGTTCTTGG TTCTGGCGAT GAAGAGGCGG TCGATCTGGC AAAGATTGCT
CTCATGAAGA ATGGAGAACT GGCAATCCCC CATCTTGTAG ATTCGTTGTC AGATCCCATG
GGTGCCAATC ATGCTCTTAT CCTGGAGATT CTCCGTGAGA ACGAGACTGC TGCTCTTCCG
TACCTTATCC CGTTCATGGC ACCGGGAAAG GACGGTTATG ATGAGGCGAT GACACTTATT
CGTGAGATTG GTGCAGATGC AACCGCTCCT CTTCTGCAGG CTCTTTCAGG TTCAGGGCCC
GAACTGGCCA GGGAGATCAC GTCTTACCTG TCAGAGTTAT TCAGTCAGGA TCCCAGGAAA
TTTATCATGC GGCTCTTTTC CGGTCAGGTA CCGGACACCG ATCTGATGTA CGAACTGGTG
CATACCTCAC CTGAGGTTGT CATTCCGGAG CTTATCGATA TATTTCAGGG AGATGACGGG
TCCCGGGCAC TTATTGCAGG AGATCTGTTG TCCAGGTTTG GAAAGGATGC TATTCCACCG
TTTATTGATG CCCTCCGGAG TGAAACTGAT GATGACCGGA AGCTTGAGAT AACGAGTTTC
CTGATTCGAA TCGGAGAAGA TGCCATTCCT GATCTGGTAG CACGGCTTGG TGACGAGGAT
ATTGCTCCTT ATGCCATGGC AGCATTAAGC GCTATTGGCG AACCTGCAGT ACCCGCTCTT
TTGCCTTTGC TGAAAAGTCC TGACCTGATT GCACAACAAT ATGCGATTCA TGCTCTTACC
AGAATTGGTA CTCCGGCTGC TTCGGCACTG ATGACCCTGA TGCAGGAAGA TGAGAGTCTT
GTGCCATTGA TTAGCCGGAT AATGGCCGGA ATGGGAGGTT CAGCCCTGCC AGAGCTTATT
CAGGAACTTG AGACGCTTCA AGGATCAGGA CAGGAGGGTA GCAGCAGAGG TATTGCGGTC
ATGTCGCTGA TAACCGAGAT AGCTCTTTCA AACAGAGACG ACCTCAGGCA CCTCTTCTCC
ATACAAAATC CGATCCTCAT CACGATGTTT GAGCGTATAT TTATCAGTAA GGGTGAGCAG
ATCCTGGCCC CTCTTCTGGA TGCCGTCATG TATGAACAGG CTGTACCTGA CATGGCCGCC
AATATCATCA CCTCCATGCG CCCACAGGCA CAGACCGCCG TTACCAGGCT TTTAAAATCC
ATAGGTCCTG GAGACAGACG ACGGATTGCC CTTTTAAAGG TGCTTGGTGT TCTGAAAGAT
CCGGCATCTG CCCCTCTTAT GTATGAGGCA TTGCAGGATC CGGATCATGA AATCAGGATG
ACTGCCATCA GGGAACTTGG AAAGTTTGGC AGGGAAGCTC TGGGACCACT GACCGATGCG
ATGCATGACC CGGACCCTCA TGTTCGTGCA GCAGCGGTTG AATCATTGGG TGATATCGGA
CTTCCTGTCC TTGATCAGCT GATTGCAGCA TTAAAAGATC CCGACGGTTC AATCCGTGCA
GCAGCACTCA AAGGGATATC CAAGATCGGG GAACCAGGTC AGTTTATGCT GGTTCAGACA
CTTGATGACA AAGACAGAAA AGTCAGAAAT GCGGTGGCAA GGTTATTGGA AGAGAGCGGG
TGGAAACCCA AGTATACGAC TGATCGGCTG AGTTACCTCT TTGCCAAAGA GAAATTTGAT
GATCTCATCC GTATCGGACC TCCAAGTGTT GATACCCTTG CCAAGGGGCT TCATGATGAT
GACCCTGAAA TCAGGGAACG ATCCCGGGAT GCACTCGCTG TCATAAGGGA TTCTATTCAA
ACATAA
 
Protein sequence
MKSIFRPNPE RMRLKGNLKG LLRLLDGKND TALRIEAILA LGRMKTPVAV EELIRLFQDP 
EYGIRNAASH ALVQIGSDAI NPLIRALSVS DEETARMIHT TLTAMGDDAA REIVKNITTL
QGIGYERAGY ILNSMGTRII PVLIDAYGTE DQTTTRFIEG LFESFGRSAI QPLIKGLNHD
HEEVRARISA YLIILGDQVV GDLLSSCGQD EEWLRELKFY IISEIGKPAL DPLYQALKDP
NPVTSSMAQK AFLEFGESAI MPLISGLYDQ DPEVRKVSEN ALTRIGEPVI PHLLEEMSVR
RDTDREPIIS VIQHIGEPAI PYLIGNLIHS KGDRSKQMVQ ILSRMGAVTI PYLINSVREQ
SDTSGIKEAI LSMGRIAFPF LEEASERERG KTSVFAIDLL RQIDPVRSIE PMISALYHTD
REVRETALDN LVASGEIAIP RLIQVLGSGD EEAVDLAKIA LMKNGELAIP HLVDSLSDPM
GANHALILEI LRENETAALP YLIPFMAPGK DGYDEAMTLI REIGADATAP LLQALSGSGP
ELAREITSYL SELFSQDPRK FIMRLFSGQV PDTDLMYELV HTSPEVVIPE LIDIFQGDDG
SRALIAGDLL SRFGKDAIPP FIDALRSETD DDRKLEITSF LIRIGEDAIP DLVARLGDED
IAPYAMAALS AIGEPAVPAL LPLLKSPDLI AQQYAIHALT RIGTPAASAL MTLMQEDESL
VPLISRIMAG MGGSALPELI QELETLQGSG QEGSSRGIAV MSLITEIALS NRDDLRHLFS
IQNPILITMF ERIFISKGEQ ILAPLLDAVM YEQAVPDMAA NIITSMRPQA QTAVTRLLKS
IGPGDRRRIA LLKVLGVLKD PASAPLMYEA LQDPDHEIRM TAIRELGKFG REALGPLTDA
MHDPDPHVRA AAVESLGDIG LPVLDQLIAA LKDPDGSIRA AALKGISKIG EPGQFMLVQT
LDDKDRKVRN AVARLLEESG WKPKYTTDRL SYLFAKEKFD DLIRIGPPSV DTLAKGLHDD
DPEIRERSRD ALAVIRDSIQ T