Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0194 |
Symbol | |
ID | 5411519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 185914 |
End bp | 187137 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640867409 |
Product | hypothetical protein |
Protein accession | YP_001403360 |
Protein GI | 154149742 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG GCATTGATGA CACGGACTCT CCCACGGGAA TGTGTACCAC CTACCTCGGC GCAGTGCTTG CCCGGCGGCT GATCCGCGAG CACATGAAGG TCCGGGAGGC CCGTCTGGTC CGACTTAATC CGAACGTGAC CTTCAAGACC CGGGGAAACG CAGCAATCGC TCTCGATGTG GATGGCGATC CGGCACGGGG ATTTGAGCTC GCGTGCGGGA TCGTAGAAGA ACTTGCCGAC TTTTCCTGCG ACAAGACCAA CCCCGGAGTG GTTGTGGCGG GGGAGCGGCT CGATCCGGCA TTCTACCGGA AAGCGGTTAC GGACTTCTGC GAGATCGAAG AGGCCACAGA GCTGCTGGAG CGGTCAGGGG CCCGGTACCG GGGCTGGAAG AACCGGCGAG GTCTAGTGGG TGCAACTGCT GCCGTTGCAA GCGTTCTTCC TGACAAAACC TACGAGATCC TTGCGTACCG TGAGCCGGCC CACTGGGGTA CCCCACGGGA AGTAGATCGC GAGAGCCTCT TTGTCGCAGA GGAAGCAACC TTCCCACACA CATGGGATAC CGTGGATCTC GCAAACCGGG TCGTGGTCTG CGTGCCGCAC ACACCGGACC CGGTCCTTTT TGGGATCCGG GGCGAGAGCC CGGCCTGGGT AATGACGGGC CGCTCGCTCA TCGAATCCGA GAAGCCGGCG CTCGAACAGA TCTGGGTGAC CAACCAGGGG ACCGATTCCC ATCTCGTTCC CGGGACCTGC GGCTCATTGC GCGAGGGGCT CTCGTACCGG GTGCGGGGGG TGGTCACTGG CCGGCCCGTG ACCGGAGAAG GAGGGCACGT CTCGTTTGCA ATCCGGGACA ATGAAGCGGA GATCCGGTGC ATGGCATATG AGCCCACGAA AAATTTCCGG GAAATAATCC GGGCTCTCGT GCCGGGCGAT GAGATCATCG CGGCAGGCAG TTACAAAAAG GGGAGCATCA ACCTGGAAAA GATCTGCGTC CTCTCTCTTG CACGGGATCT GCGCCATAAG GCACCGTTCT GTACCGTGTG CAAAAAGAGA ATGACAAGCG ACGGGAAAGG CAAGGGATAC AAATGCCGCA GGTGCGGTGC ACACGAACTG GAGCCGGAGG AACAGGAAAT CCCCCGGACA ATCAGGACCG GGTGGTACGA GGTGCCTCCG ACTGCACGAA GACACCTGGC AAAACCGCTC TGCCGGGGTA TGCCGGATAG GTGA
|
Protein sequence | MLIGIDDTDS PTGMCTTYLG AVLARRLIRE HMKVREARLV RLNPNVTFKT RGNAAIALDV DGDPARGFEL ACGIVEELAD FSCDKTNPGV VVAGERLDPA FYRKAVTDFC EIEEATELLE RSGARYRGWK NRRGLVGATA AVASVLPDKT YEILAYREPA HWGTPREVDR ESLFVAEEAT FPHTWDTVDL ANRVVVCVPH TPDPVLFGIR GESPAWVMTG RSLIESEKPA LEQIWVTNQG TDSHLVPGTC GSLREGLSYR VRGVVTGRPV TGEGGHVSFA IRDNEAEIRC MAYEPTKNFR EIIRALVPGD EIIAAGSYKK GSINLEKICV LSLARDLRHK APFCTVCKKR MTSDGKGKGY KCRRCGAHEL EPEEQEIPRT IRTGWYEVPP TARRHLAKPL CRGMPDR
|
| |