Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0522 |
Symbol | |
ID | 5410843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 503555 |
End bp | 506380 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640867740 |
Product | hypothetical protein |
Protein accession | YP_001403683 |
Protein GI | 154150065 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.057808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.329166 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAAGC GATTAACGAT TGCACTTATT GCACTCGTGG CACTGGTGCT TGTTGCAGTA CTGCCGGCTT CGGCAACGTA CTACAATGTC AACAACACCG TTAACGCCCC GGGCGGATCA GTGTATATTG GAGAACAGCA GTTGAACATC CAGCCTCTTG TAAATAGCTA CCCGGCAGGG GGAGCAGTCT CTATCGGCTG GTGGGCCTCC GCTGCGTCTG TCACTTCTAC AGCACCGTCG CAGCAGTATA ACGTAAGCTC ACTTAACCCC TCCTCGTTCT ATGCTTCGCC GAGCGGGTAT GGAAACTACT TGGGTAACTG GTACGCGATC AATGGATCTG GTACCCCGAA CACGACAGCT GCATTCTTTA CTGTCGTGGA CCCAAGCTTG GCTATTGATG TCTGGGATAT CAACACCTCG ACAACCGTAT CCAGCGGTAC CGTGATTCAG GGCGACCTGC TTACCTTCCG TATTAACACG AACCTGCAGT CGGCAATCGA CCCGAACTAC CGTACGTCCT CTGTCACCCC AGCTGTCGGC AGCGTTGATA TTCAGGTAAA GCCTTCTTCC GGAAACACCT ATGCCTCGCT GTTTGTCAAC TCGACCAGCG GAAGCAGCAC CTACAACACT CAGAATATCC GGTTCCAGCC GGTTAACCAG TCCCTTTGGT TCTGGGGTAT CGCTGGCAAC GGTGTGCCCT CGACCACTGC CACTAGTGGC GCTGGCGCTA TCCAGTATGC CTGGGCAACC GGTGCAACTG ATTCAACCGG AAACAATGCG TATCCTGCCG GTGTCTACAC CGTCACCGCC CAGTCTGACA TCAACGGTCA GTACGACAAC TACCTGAATG GTGGCGCGTA CTATACCGGT AAGACGATCA GCCAGCCTGT AACGGTCACG ATCGCGTCCA ACACGGTTTC GATCTCCGCA AACGTGGACT CCGTTGTTCG CAGCAAGCCC TTCTCAGTTA CTATCACGGG CAAGCCCGGA GCAACCTACC ACCTGTGGGT CAAGGGCGTC AGCTCGCTGG ATGGCACCTA TGACAACCAG CCGCCCATCA TCTCCGCCAA CCAGGTTGGA GTTACCTTTG ATAACCAGTC TGACTCGAAT GCATTCGCCT ACTTCCCGAC TTCCTCTGAG TTCCCGCTTA ACGCAAACAC GGCTTACGCT AACGGTAACT ACACCTACCA GAATGGTGTC CAGCTCTGGA ACGATGTTGC CCACGGCACG TATGGATCGA CCGGTCTGAT TCTCGGTAAC GGTACCTTTG AGTACGCCAA CGTGACCCTC AACTCCGCCG GTACCAGGAC CATCCAGTGG ACAACTACCA ACTGGACCAA GGCCCAGCAG TACACCATCC GTGTTGAGCA GAATTTCGGT GGCGCAACCG GTTACAAGTA CGATGAAGTA AAGGTGCAGG TCCAGAAGGG CGCAGTCACC ATCGTCGCAG CTGGTTCCCA GAGTTACTAC CTCGGTGAAG AGGTTCAGTT CTCCGGTACC AACACTGAGT CCCAGACTAC ATACCTGTTC ATCACCGGCC CCAACCTGCC GACCCAGGGT GGAGCACTCT GGAGCACAAA CCCGCGGTAT GCAGGATCTG CCAACCCGGG TATCACCAAT GGAAACGCCT CTGACTTCCA GCAGGTAAGT GTCAACGGTG ACAACACCTG GTCCTGGAAC TGGGGAACCA CAACGGTTGC GCTCGATGCA GGTACCTACA CGGTTTATGC AGTGAGCCAG CCCAATGATG CAAACCACCT GAGCAACGCA GCATACGGCA CGGTTTCCAT CATCATCAAG AAGCCCTTCG TCAGTGCGAC TGCATCCCAG TCCACCGTTG CACAGGGTGA CCCAATCTAC ATTACCGGTA CCGCAGAGGG CCAGCCCAGT CAGGGTGTCC AGCTCTGGAT CCTCGGTAAG AACTACGCAC TCGTAGCAAC TGAAGCCGTA AACTCTGACT CGTCGTTCTC GTACGAGATC AAGGGCGCAA CGACCTCAAC CATGTACTCT GGTCAGTACT TCGTTGTCGT TCAGCACCCG ATGCAGAACG GTGTATTTGA TGTCGCACCC GCCACCGCAA GTGGTACCGT GACATACCCG CAGAACTCAA ACGGCGCTTC AGTGTATGTG TGGAACGACA ACCCGTCCAA CCCCGGTGTC TTCACTGCAA ATGGTGCAAC AAATGACTTC GCTCTCACCG GTGCCGGTAG CTTACAGGGC TCCGATGCCG CAGAGGCACT GGTCCAGGCA CTCAACAGCG CAAATGTTGA TGATACCTAC ACCAAACTCC AGTTCCTCGT AGAGACCCCC GTTATCACCA TCACCCCGGT TGGCGACCAC TCCGTTGGTG ACAAGTTCAC CATCACGGCA ACCACCAACC TCGCAGTAGG TGACAACGTC CTCTTCACAG TGTACTCATC ATCATTCCAG CCGACTGACA AGACCCAGAG CGGTGAGTTC AGCGGCGCAT CAGGCACTGT AGCAGTCACG CAGGGCACCA GCGGCCTTAA CGCTCTCTCG TTCGATGTTG ACGCATCCAC GTTCAAGCCC GACGAGTACC TCGTTACCGC CACCGCAGTC GGTCTCCCGG ACACGAACAA CGTCCCGACC GGCACTGCAC TGTTCAATGT CCTGCAGGCA TCGGCCCAGA CTGCAACCCC CACACCGGTT GCATCTGTGA CCACTGCAGC AGTTCAGACA ACGGTTCCAA CCCCGGTTCC GACCACTGCA ACCCCCACCA AGACCCCGAC ACAGCCCGGG TTTGGTGCAC TGGTTGCACT GATTGGTCTC GGTGCAGTTG CACTCTTTGT CGTGCGCAAG CACTGA
|
Protein sequence | MTKRLTIALI ALVALVLVAV LPASATYYNV NNTVNAPGGS VYIGEQQLNI QPLVNSYPAG GAVSIGWWAS AASVTSTAPS QQYNVSSLNP SSFYASPSGY GNYLGNWYAI NGSGTPNTTA AFFTVVDPSL AIDVWDINTS TTVSSGTVIQ GDLLTFRINT NLQSAIDPNY RTSSVTPAVG SVDIQVKPSS GNTYASLFVN STSGSSTYNT QNIRFQPVNQ SLWFWGIAGN GVPSTTATSG AGAIQYAWAT GATDSTGNNA YPAGVYTVTA QSDINGQYDN YLNGGAYYTG KTISQPVTVT IASNTVSISA NVDSVVRSKP FSVTITGKPG ATYHLWVKGV SSLDGTYDNQ PPIISANQVG VTFDNQSDSN AFAYFPTSSE FPLNANTAYA NGNYTYQNGV QLWNDVAHGT YGSTGLILGN GTFEYANVTL NSAGTRTIQW TTTNWTKAQQ YTIRVEQNFG GATGYKYDEV KVQVQKGAVT IVAAGSQSYY LGEEVQFSGT NTESQTTYLF ITGPNLPTQG GALWSTNPRY AGSANPGITN GNASDFQQVS VNGDNTWSWN WGTTTVALDA GTYTVYAVSQ PNDANHLSNA AYGTVSIIIK KPFVSATASQ STVAQGDPIY ITGTAEGQPS QGVQLWILGK NYALVATEAV NSDSSFSYEI KGATTSTMYS GQYFVVVQHP MQNGVFDVAP ATASGTVTYP QNSNGASVYV WNDNPSNPGV FTANGATNDF ALTGAGSLQG SDAAEALVQA LNSANVDDTY TKLQFLVETP VITITPVGDH SVGDKFTITA TTNLAVGDNV LFTVYSSSFQ PTDKTQSGEF SGASGTVAVT QGTSGLNALS FDVDASTFKP DEYLVTATAV GLPDTNNVPT GTALFNVLQA SAQTATPTPV ASVTTAAVQT TVPTPVPTTA TPTKTPTQPG FGALVALIGL GAVALFVVRK H
|
| |