Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0472 |
Symbol | |
ID | 5411107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 455633 |
End bp | 458581 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640867684 |
Product | phosphoribosylformylglycinamidine synthase |
Protein accession | YP_001403633 |
Protein GI | 154150015 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain |
TIGRFAM ID | [TIGR01736] phosphoribosylformylglycinamidine synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.311423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.43995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTT TTGTCCACCG CATAGAAGTT CACTACTCAA AGGATCCCCG CGAGAAGACC CGGACAGAAA GGATCCGGGC GCTCGGTTTT GGCATCGATG TACTGCAACT CGTGGACGTG TACACGATCG CTACGGTCAA GCGTGATCTC CTGCCGGCGG AACTCGATGA GATTGGCGCC CGGCTCACAA ACCCTGTAGT GCAGGACTTC GTGGTGGACG AAGCTACGCA TGCCCTTTTC GATTACGCAA TCGAGATCGG TTTCCTTCCC GGCGTGACCG ATAACGTGGG CACAACCGCC CGGCAGATGG TCGAGGACTA CCTGGGCATC ACTTTTGGTG AGGGGGAGAT GGTTTCTTTC TCTTCGCTCT ATCTGGTCAG GGGCAGGCTT TCACCTGAGG CCCTTGCCCA ACTCCCGGCC ACACTTGCAA ATCCGCTCAT CAACCGCGTA AGTACCAAGA CCCGGCAGGA GTACGGCGAG ACAGGTATGG ATCGGATCAT CCCGGAGGTA CGGCTCCACG AGATCCCCGC TGCGGGCATC GTAGATCTCT CCCTCCCGGA CGAGGATCTG CTGAGGCTCG GGAAAGAGGG GATTTCTGAT CCGCAGACCG GCCAGCGTCG GGGTCCGCTT GCGCTTGATC TTGCCCAGCT TCACGCGATC CGTAACTATT TCGACCGGGC CGGGCGCAAG CCTACAGATG TGGAGATTGA ATCCCTTGCC CAGACCTGGA GCGAGCACTG CAAGCACACG ATCTTTGCAT CGGCCATGGA CGATGATGTC CCAAAAGGCC TGTATAAGTC CTTCATCCAG GCGGCGACAA ACGAGATCCG GGCAGAGAAG GGAGACCGCG ATATCTGCGT TACGGTTTTT ACCGATAACT CAGGTGCAAT TGTCTTTGAT GATGAGTATC TTGTGACCCA CAAGGTTGAA ACCCATAACT CTCCATCCGC ACTGGACCCG TTTGGCGGGG CGCTTACCGG GATTGTTGGC GTGAACCGGG ACACGATCGG ATTTGGCCTG GGTGCCAAGC CCTGCATTAA TGTGTACGGG TTCTGCGTTG GTGACCCAGC TTCGGAGCCG GCCCTCTACA GGGGCAGGGA CCGGAAAAAT CCCATTATAT CTCCCCGGCA GATTCTCGAT GGCGTTGTGC GGGGAGTCGG GGTGGGAGGC AATTGCTCGG GTATTCCCAC ACCGCAGGGC TTCTGCTGGT TCGATAACCG GTATGCGGGT AAGCCTCTCG TCTTTGCCGG CACGGTCGGC GTGATGCCCC GGGAACACAA TGGCCAAAAC CTCTTCGAGA AAAAGGCAGA ACCCGGCGAT CTCATCGTGA TGATGGGCGG ACGGGTGGGA AAGGATGGTA TCCACGGCGC TACGTTCTCG TCAGAGGCCC TTGACCCGAA CAGCCCGGTA ACCGCAGTGC AGATCGGCGA CCCGATCACG CAGAAAAAGT TCTCTGATGT GCTTGTCAAG GAAGCCCGGG ATCTCGGATT GTACCGGAGC ATTACGGATA ATGGTGCAGG TGGCCTCTCC TGCTCGGTGG CCGAGATGGC AAAGGAATGC AATGGCTGCC ATGTGGAGCT GGACAGGGTT CCCTTAAAGT ACCCCGGCAT GGCCCCCTGG GAGATCTGGA TCTCCGAGTC GCAGGAGCGT ATGACGCTTG CGGTCTCCCC GGAAAAAAGT GCGGAATTGA TCGATCTTTG TGCCCGGAGG GGAGTCGAGG CCACTCTTAT AGGGAAATTT ACCGGTACGG GCCGGTGCGT GGTGGAGTAT GGCGGGAAGA CCGTGATGGA CATTGAGCTT GCGTTCCTCC ATGATGGCCT GCCCAAAAAG CATCTTGTCA CCCGGCAGGC GCCGGTGGCA GAGCCTGCTG TCGGGCCCCT CCCCCCCTGC CCGGATCAGC TCGGTGATAC GCTGGCGGCC ATGATCGGGC GCCGGAATAT CTGCTCTACC GAATTTGTAA CGATCCAGTA CGACCATACA GTGCAGGGTG GCCATGTGCT TGGCCCCGTT CAGGGAAAAG GCCGGGTACA ATCGGTGGCC ACGCTCACCA AAGTCGTGCC CGGTTCGAAG AGGGCAGTCG GGCTCTCGCA GGCACTCTTT CCGTCCTATG CAGAGCTCGA CCCGTACCGT ATGGCCGGTG CTGCCATCGA TCTTGCGATA CGGGGACTGG TCGCAACCGG GATCTCCCCT GAGTCGATTG CGCTCCTCGA CAATTTCTGC TGGTGCTCAT CCGATGAGCC GGAGCGCCTG TGGCAGCTTA AGGAGGCGGC CCGCGGATGC TATGATTATG CCAAAGCCTT CGGGACACCA TTCATCTCCG GCAAGGACAG CATGTTCAAT GATTTCTCGG GATACGATGC GGAGAACAAC CGGGTGAAGA TCTCTGTCCC TCCCACCCTG CTCATCTCCT CCATAGGTAT CCACAGCGAT GTGACCAAAG CGGTCTCGCT TGATGCCAAA ATATCCGGGG ATCTTGTGTA CGTGATTGGT ACTACTGAGG AGGAGCTTGC CGGGTCGGAG TACTTCGCCT TCCTTGGCGC CGGCGGGAGC ACGGTGCCCT GTCTCGATGC AGCGGCGGCA CGGGTACGGT ACCTCAGACT TTCTGATGCT ATTGCCCATG AACTCGTGTC CTCGGCATAT CCCGTTGGCC ACGGGGGGAT TGGCGTGGCC CTTGCAAAGG TGGCTATCGC CGGCCGGCTC GGTATGGACA TTACCCTTCC CGGTGCCCTG CGCCCGGATA TCTTCCTCTT CTCTGAAACG CTTGGCCGTT TCGTGGTCAC GGTGGCACCG GATCACAAGC GTGCGTTCGA GCAGGCGCTC GGTGCCGATG CACTGTTGCT GGGAACAACC GGGGGAAGCC GGCTCCGGGT CACCGGCCGG ACGTTCCTCT TCGAACAGGA GATCGCTACA CTGGAAAAGG CGTACAAGTC ATCATTTGGG GGGTTTTGA
|
Protein sequence | MASFVHRIEV HYSKDPREKT RTERIRALGF GIDVLQLVDV YTIATVKRDL LPAELDEIGA RLTNPVVQDF VVDEATHALF DYAIEIGFLP GVTDNVGTTA RQMVEDYLGI TFGEGEMVSF SSLYLVRGRL SPEALAQLPA TLANPLINRV STKTRQEYGE TGMDRIIPEV RLHEIPAAGI VDLSLPDEDL LRLGKEGISD PQTGQRRGPL ALDLAQLHAI RNYFDRAGRK PTDVEIESLA QTWSEHCKHT IFASAMDDDV PKGLYKSFIQ AATNEIRAEK GDRDICVTVF TDNSGAIVFD DEYLVTHKVE THNSPSALDP FGGALTGIVG VNRDTIGFGL GAKPCINVYG FCVGDPASEP ALYRGRDRKN PIISPRQILD GVVRGVGVGG NCSGIPTPQG FCWFDNRYAG KPLVFAGTVG VMPREHNGQN LFEKKAEPGD LIVMMGGRVG KDGIHGATFS SEALDPNSPV TAVQIGDPIT QKKFSDVLVK EARDLGLYRS ITDNGAGGLS CSVAEMAKEC NGCHVELDRV PLKYPGMAPW EIWISESQER MTLAVSPEKS AELIDLCARR GVEATLIGKF TGTGRCVVEY GGKTVMDIEL AFLHDGLPKK HLVTRQAPVA EPAVGPLPPC PDQLGDTLAA MIGRRNICST EFVTIQYDHT VQGGHVLGPV QGKGRVQSVA TLTKVVPGSK RAVGLSQALF PSYAELDPYR MAGAAIDLAI RGLVATGISP ESIALLDNFC WCSSDEPERL WQLKEAARGC YDYAKAFGTP FISGKDSMFN DFSGYDAENN RVKISVPPTL LISSIGIHSD VTKAVSLDAK ISGDLVYVIG TTEEELAGSE YFAFLGAGGS TVPCLDAAAA RVRYLRLSDA IAHELVSSAY PVGHGGIGVA LAKVAIAGRL GMDITLPGAL RPDIFLFSET LGRFVVTVAP DHKRAFEQAL GADALLLGTT GGSRLRVTGR TFLFEQEIAT LEKAYKSSFG GF
|
| |