Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mhun_0617 |
Symbol | |
ID | 3922564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanospirillum hungatei JF-1 |
Kingdom | Archaea |
Replicon accession | NC_007796 |
Strand | + |
Start bp | 713000 |
End bp | 716125 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637896254 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_502092 |
Protein GI | 88601914 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.639116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.314613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCTA TTTTCAGACC GAATCCGGAA CGGATGAGAC TTAAAGGGAA TTTAAAAGGC CTTCTCCGGT TACTGGACGG GAAAAATGAT ACTGCATTAC GGATAGAAGC AATATTAGCG CTCGGCCGGA TGAAAACTCC GGTTGCAGTT GAAGAACTTA TCAGACTTTT TCAGGACCCT GAGTATGGGA TCAGGAATGC TGCATCTCAT GCGCTTGTTC AAATTGGATC TGATGCTATA AACCCGCTGA TTCGGGCTTT GTCTGTATCT GATGAAGAGA CCGCGAGGAT GATTCATACC ACTCTTACCG CGATGGGTGA TGATGCAGCC AGGGAAATAG TAAAGAATAT CACCACACTT CAGGGTATTG GATATGAACG AGCAGGATAT ATCCTGAATT CAATGGGTAC TCGGATTATT CCGGTTCTTA TAGATGCCTA CGGGACAGAG GATCAGACGA CGACCAGGTT CATCGAGGGT CTGTTTGAGT CATTTGGCAG ATCAGCAATC CAGCCGTTAA TCAAGGGGCT GAATCATGAC CATGAAGAAG TCAGAGCACG TATCTCTGCT TATCTTATTA TTCTGGGTGA CCAGGTAGTT GGAGATCTCC TCTCATCATG CGGTCAGGAT GAGGAATGGC TGCGAGAACT GAAATTTTAC ATCATCAGTG AGATAGGAAA GCCGGCACTC GACCCCCTTT ATCAGGCACT CAAAGATCCA AATCCGGTCA CTTCTTCAAT GGCACAAAAG GCATTCCTTG AGTTTGGAGA ATCGGCAATT ATGCCTCTGA TATCCGGTCT GTATGATCAG GACCCTGAAG TCCGGAAGGT ATCAGAAAAT GCCCTGACCA GGATAGGAGA GCCTGTCATC CCTCACCTGC TTGAGGAGAT GTCTGTACGC CGTGATACTG ATCGCGAACC GATTATCTCT GTCATTCAGC ATATCGGTGA ACCTGCTATC CCTTACCTTA TCGGGAATCT GATTCATTCG AAGGGTGATC GTTCAAAACA GATGGTTCAG ATTCTGTCAC GAATGGGAGC TGTCACCATT CCCTATCTTA TCAATTCTGT CAGGGAGCAA TCAGATACGT CCGGAATAAA AGAGGCAATT CTCTCGATGG GGCGTATTGC ATTCCCCTTT TTGGAAGAGG CATCTGAGCG TGAGCGGGGA AAAACTTCTG TATTCGCTAT TGATCTTCTC AGACAGATTG ATCCGGTCAG ATCCATTGAG CCGATGATTA GTGCTCTGTA TCATACTGAC CGGGAGGTAC GGGAGACTGC TCTTGATAAC CTGGTTGCTT CAGGTGAGAT TGCGATCCCC CGTCTGATTC AGGTTCTTGG TTCTGGCGAT GAAGAGGCGG TCGATCTGGC AAAGATTGCT CTCATGAAGA ATGGAGAACT GGCAATCCCC CATCTTGTAG ATTCGTTGTC AGATCCCATG GGTGCCAATC ATGCTCTTAT CCTGGAGATT CTCCGTGAGA ACGAGACTGC TGCTCTTCCG TACCTTATCC CGTTCATGGC ACCGGGAAAG GACGGTTATG ATGAGGCGAT GACACTTATT CGTGAGATTG GTGCAGATGC AACCGCTCCT CTTCTGCAGG CTCTTTCAGG TTCAGGGCCC GAACTGGCCA GGGAGATCAC GTCTTACCTG TCAGAGTTAT TCAGTCAGGA TCCCAGGAAA TTTATCATGC GGCTCTTTTC CGGTCAGGTA CCGGACACCG ATCTGATGTA CGAACTGGTG CATACCTCAC CTGAGGTTGT CATTCCGGAG CTTATCGATA TATTTCAGGG AGATGACGGG TCCCGGGCAC TTATTGCAGG AGATCTGTTG TCCAGGTTTG GAAAGGATGC TATTCCACCG TTTATTGATG CCCTCCGGAG TGAAACTGAT GATGACCGGA AGCTTGAGAT AACGAGTTTC CTGATTCGAA TCGGAGAAGA TGCCATTCCT GATCTGGTAG CACGGCTTGG TGACGAGGAT ATTGCTCCTT ATGCCATGGC AGCATTAAGC GCTATTGGCG AACCTGCAGT ACCCGCTCTT TTGCCTTTGC TGAAAAGTCC TGACCTGATT GCACAACAAT ATGCGATTCA TGCTCTTACC AGAATTGGTA CTCCGGCTGC TTCGGCACTG ATGACCCTGA TGCAGGAAGA TGAGAGTCTT GTGCCATTGA TTAGCCGGAT AATGGCCGGA ATGGGAGGTT CAGCCCTGCC AGAGCTTATT CAGGAACTTG AGACGCTTCA AGGATCAGGA CAGGAGGGTA GCAGCAGAGG TATTGCGGTC ATGTCGCTGA TAACCGAGAT AGCTCTTTCA AACAGAGACG ACCTCAGGCA CCTCTTCTCC ATACAAAATC CGATCCTCAT CACGATGTTT GAGCGTATAT TTATCAGTAA GGGTGAGCAG ATCCTGGCCC CTCTTCTGGA TGCCGTCATG TATGAACAGG CTGTACCTGA CATGGCCGCC AATATCATCA CCTCCATGCG CCCACAGGCA CAGACCGCCG TTACCAGGCT TTTAAAATCC ATAGGTCCTG GAGACAGACG ACGGATTGCC CTTTTAAAGG TGCTTGGTGT TCTGAAAGAT CCGGCATCTG CCCCTCTTAT GTATGAGGCA TTGCAGGATC CGGATCATGA AATCAGGATG ACTGCCATCA GGGAACTTGG AAAGTTTGGC AGGGAAGCTC TGGGACCACT GACCGATGCG ATGCATGACC CGGACCCTCA TGTTCGTGCA GCAGCGGTTG AATCATTGGG TGATATCGGA CTTCCTGTCC TTGATCAGCT GATTGCAGCA TTAAAAGATC CCGACGGTTC AATCCGTGCA GCAGCACTCA AAGGGATATC CAAGATCGGG GAACCAGGTC AGTTTATGCT GGTTCAGACA CTTGATGACA AAGACAGAAA AGTCAGAAAT GCGGTGGCAA GGTTATTGGA AGAGAGCGGG TGGAAACCCA AGTATACGAC TGATCGGCTG AGTTACCTCT TTGCCAAAGA GAAATTTGAT GATCTCATCC GTATCGGACC TCCAAGTGTT GATACCCTTG CCAAGGGGCT TCATGATGAT GACCCTGAAA TCAGGGAACG ATCCCGGGAT GCACTCGCTG TCATAAGGGA TTCTATTCAA ACATAA
|
Protein sequence | MKSIFRPNPE RMRLKGNLKG LLRLLDGKND TALRIEAILA LGRMKTPVAV EELIRLFQDP EYGIRNAASH ALVQIGSDAI NPLIRALSVS DEETARMIHT TLTAMGDDAA REIVKNITTL QGIGYERAGY ILNSMGTRII PVLIDAYGTE DQTTTRFIEG LFESFGRSAI QPLIKGLNHD HEEVRARISA YLIILGDQVV GDLLSSCGQD EEWLRELKFY IISEIGKPAL DPLYQALKDP NPVTSSMAQK AFLEFGESAI MPLISGLYDQ DPEVRKVSEN ALTRIGEPVI PHLLEEMSVR RDTDREPIIS VIQHIGEPAI PYLIGNLIHS KGDRSKQMVQ ILSRMGAVTI PYLINSVREQ SDTSGIKEAI LSMGRIAFPF LEEASERERG KTSVFAIDLL RQIDPVRSIE PMISALYHTD REVRETALDN LVASGEIAIP RLIQVLGSGD EEAVDLAKIA LMKNGELAIP HLVDSLSDPM GANHALILEI LRENETAALP YLIPFMAPGK DGYDEAMTLI REIGADATAP LLQALSGSGP ELAREITSYL SELFSQDPRK FIMRLFSGQV PDTDLMYELV HTSPEVVIPE LIDIFQGDDG SRALIAGDLL SRFGKDAIPP FIDALRSETD DDRKLEITSF LIRIGEDAIP DLVARLGDED IAPYAMAALS AIGEPAVPAL LPLLKSPDLI AQQYAIHALT RIGTPAASAL MTLMQEDESL VPLISRIMAG MGGSALPELI QELETLQGSG QEGSSRGIAV MSLITEIALS NRDDLRHLFS IQNPILITMF ERIFISKGEQ ILAPLLDAVM YEQAVPDMAA NIITSMRPQA QTAVTRLLKS IGPGDRRRIA LLKVLGVLKD PASAPLMYEA LQDPDHEIRM TAIRELGKFG REALGPLTDA MHDPDPHVRA AAVESLGDIG LPVLDQLIAA LKDPDGSIRA AALKGISKIG EPGQFMLVQT LDDKDRKVRN AVARLLEESG WKPKYTTDRL SYLFAKEKFD DLIRIGPPSV DTLAKGLHDD DPEIRERSRD ALAVIRDSIQ T
|
| |