Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1201 |
Symbol | |
ID | 5411349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1215556 |
End bp | 1216593 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640868427 |
Product | peptidase M42 family protein |
Protein accession | YP_001404362 |
Protein GI | 154150744 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.214071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0232682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAGG AATTATTACG GAAATTATCC AATGCCCACG GGGTATCGGG AAGTGAAGGC AGTGTCTTTG CCTTAATAAA AAAAGAACTC AAAGGCTGTG TTGATGAGAT CACCGAGGAT CCTATGGGAA ATCTCATTGC AGTCAGGCAT GGTAACAAGT CAAAGGTGAT GCTTGCCGCC CATATGGACG AGATCGGGCT CATGGTCAAG TATATCGATG ACAAGGGCTT TCTCCGGTTC ATCACTCTTG GCGGGTGGTA CGGGCCAACA CTCTATAACC AGCGCGTGAT CGTCCATGGG ACAAAAGGTG ACCGGATCGG TGTCATCGGT GGCAAGCCCC CGCACATGAT GGACGAAGAT GAACGTAAGA AAGGGGTGAA AACCGACGAC ATGTTCATCG ATATCGGGGC AAAAAACAAG GACGATGTTG CGGAGCTGGG TATTGAGGTT GGCACGCCGG TAACGATTGA TCGCGAATTT ACCGAACTCG CGAATAACCG CGTTACCGGT AAGGCATTTG ATAACCGGGC CGGTGTTGCA ATGCTTATTA AGACCATGCA GAAAATGAAG TCCCCGTTCA CGGTTTACGC CGTCTTTACC GTGCAGGAAG AAGTGGGTCT CAAAGGGGCA AAGACCAGCG CCTATACAAT CGATCCCGAC TGGGCCATTG CTACGGATGT TACCATTCCC GGCGATCACC CGGGGATTGA TATGAAGGAT GCAGCAGTAG AAATGGGAAA AGGGCCGGTT ATCACCATTG TCGACAGCAG CGGAAGAGGA CTTATTGCCA GCCGCAAGGT TGTCCAGTGG CTGAAAAATG CTGCTGAAAC AAACGCGATT CCCGTGCAGT ACGAGGTTGG TACCGGGGGG ACAACCGATG CAACCTCTAT TCACCTGTCC CGCGGCGGCG TGCCAAGTAC CACTATCAGC CCGCCCACAC GGTACATCCA CTCCCCGGTG GAAGTCCTGG ATACAGGGGA TATAGAAGCC GGTGTTAACC TGCTTGTTGC AGCGCTGAAG ACAAAGCCTG CGCTTTAA
|
Protein sequence | MVKELLRKLS NAHGVSGSEG SVFALIKKEL KGCVDEITED PMGNLIAVRH GNKSKVMLAA HMDEIGLMVK YIDDKGFLRF ITLGGWYGPT LYNQRVIVHG TKGDRIGVIG GKPPHMMDED ERKKGVKTDD MFIDIGAKNK DDVAELGIEV GTPVTIDREF TELANNRVTG KAFDNRAGVA MLIKTMQKMK SPFTVYAVFT VQEEVGLKGA KTSAYTIDPD WAIATDVTIP GDHPGIDMKD AAVEMGKGPV ITIVDSSGRG LIASRKVVQW LKNAAETNAI PVQYEVGTGG TTDATSIHLS RGGVPSTTIS PPTRYIHSPV EVLDTGDIEA GVNLLVAALK TKPAL
|
| |