Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1000 |
Symbol | |
ID | 5411677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 979638 |
End bp | 981098 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640868226 |
Product | TPR repeat-containing protein |
Protein accession | YP_001404161 |
Protein GI | 154150543 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.326539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.768066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTCA TCGATTCAAT CCTGAAAAGC AGCGGAGTAA ATGCAGAGAC CGAGTTTTGC AAGGCCGAGA CCCTCTGCCG GCAGGGATAC TACACCGATG CGGTAAATAT CCTCGATAAA GTACTTGCCG CGGAGCCCAA TCATCTCAGG GCCTCGCAGC TCAAGGGCTT TGCCCTTTAC CAGATGGGAA CTTTTGAAGA AGCACTCCAG TATTTCGACA AGGCGCTGGG CATTGATGCA AACCTTCCCG ATGCGCTGGT ATACAAGGGA CTCATCTACT CCGGTTTTGG GAAACATGCC CACGCACTTG ATCTCTATGA CCGGGCACTT GCGATCCATC CCGGCTTTAT CCAGGCCTGG TATGCAAAAG GACTCACCCT TGCCATCCTT GAACGGTACG ACGAGGCGAT CCAGTCGTAC GACCGGGTGC TCGTCCTCCA GCCAAAGCAC GTGGATGCCC TGATCGGGAT AAGCGTTGCC CGTAAGAAAA AAGGAGCCGG GCCAAAAGAG AACACAATCC TCCAGCATCC CAAAACCAAC CTCCCGGAAA AATCCCGTCC TGCCCCTGCT GCCCCGATCG CAGCATCACC CGCACAAACC CAGAAACCCC TTGCTCCAAA ACCTGCGCCG GTACTGGTTC AAAAGCCTCC TGAACCGGTC GCGATCCCGG CACACCCCAA ACTATCTTCC CGGCAGGATC CTGCACCGGC GCTGGTACCG GCGGAATCAC GAATAGTTCC AAACTCCTCA CCGGCCGCGG TGCTGAATGT GCGACAGCCC GCTGCGGTAC CAGCCCATAA TGCATCCCGG GCCATGCCAA AAATGCCGCC GGTACCTGCC ACGGTTCCGG CACAGCCACG CACCGCACCG GAAGCACCGG CCACACACCG TGGATTCCCG GAAGAAAACC TTCTTGAACC GGAACCATCA TCACCGGCAT CTCCCCGATG CAGCACCTAC GAGGAGATGA TCCGGGAGAT CGCTGCAAAC CCGGAGAAGG TACCGGGCCC GGACCGCTGG CTTCTTCTGG GTAACCTCTC CATGAAACTC GGGAAGTTCC GGGACGCAGC CGGTATGTTT GAGCATTACC TTGGACTTGT CCAGAATGAT GCCGATGCAT GGCGGGCATT AGGGGATGCA CATAAAAAAT GCGGCCTCTA TGACGAGGCC CGTGAGGCCT ATGACCACGC ACTTGCACTG AACCCGGAGA CAGCGGCCGT CTGGATCAGC CACGCAAAAG TGCTGGTGAT GCTGCGGGAT CATGAGGGTG CTCTTGTCTC CTGCGATCAG GCCATCTCGC AGGACGGGGA ATATATCGAA GCATGGCTGT ATAAGGGTTT CATCCTCAAA AAAATACACC GGAACGACGA TGCAATGGCT GCGTACGATC GCGTACTCAT GCTTAACCCG GGCCATGATC ACGCAGTCCG GGAACTGCGG CGCATGAAGG GCGGGGCGTA A
|
Protein sequence | MGLIDSILKS SGVNAETEFC KAETLCRQGY YTDAVNILDK VLAAEPNHLR ASQLKGFALY QMGTFEEALQ YFDKALGIDA NLPDALVYKG LIYSGFGKHA HALDLYDRAL AIHPGFIQAW YAKGLTLAIL ERYDEAIQSY DRVLVLQPKH VDALIGISVA RKKKGAGPKE NTILQHPKTN LPEKSRPAPA APIAASPAQT QKPLAPKPAP VLVQKPPEPV AIPAHPKLSS RQDPAPALVP AESRIVPNSS PAAVLNVRQP AAVPAHNASR AMPKMPPVPA TVPAQPRTAP EAPATHRGFP EENLLEPEPS SPASPRCSTY EEMIREIAAN PEKVPGPDRW LLLGNLSMKL GKFRDAAGMF EHYLGLVQND ADAWRALGDA HKKCGLYDEA REAYDHALAL NPETAAVWIS HAKVLVMLRD HEGALVSCDQ AISQDGEYIE AWLYKGFILK KIHRNDDAMA AYDRVLMLNP GHDHAVRELR RMKGGA
|
| |