Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2159 |
Symbol | |
ID | 5410135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 2230466 |
End bp | 2231599 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640869404 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_001405316 |
Protein GI | 154151698 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.188732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTT TGTTTGTTGT ATGCGGGGAG GGTCTGGGCC ACACATCCCG GTGTATCCAT CTCGGCCATT ACCTGGAGCA GCAGGGTCAT TCGGTCAGTT TCCTGGCATA CGGGAAATCC TACGACTTTT TCCGGGACCA TGGATGTACC CGGGTTTACC GTGGGGAACG CGAGGTCTGC CTCGAAGGTG AGAATGGCTT TTTCTCCTTA AAAAAGACCC TTTGGTGCTC GCGCTGGATC GTCATAAACA TGGTCCGGTC CGGGCTGCGC GTCAGACGTT TGATCCGGGA ACAGCAGATC GACTGCGTTG TTTGTGACAC GATGTACGCA GGTGTCCTTG CTGCACGGTT TTGCCGGGTA CCGGTGATCT TTATTACCAA CCAGAACCGG TTCAGCGGCC CGGGAGGGGC GAAGAACCCG GTCTGGAGCG TGCTCAACTT CCTGATCCGA CGCTACCTTA AACTCGCCGA TGCCGTTATC ATTCCCGACT ACCCTCCACC GGATTCCGTG AGCGAGTACA ATCTCCTCAT TCCGGAGAAA GAGAAACCGC ACTATCATTT CACCGGCCCG TTTCTGGAGA TCGATCTCAA CCGGTACCAA TTCTCGCAGG AGACGATCTT TACCAGTTTC GGGGGAGAGC CCTACAAACT CCCCCTGTAC CGGTTGCTGC GGACGATCGC GGACAAACGA AAAGACCTGA TGTTCGATGT TTTCTATACA GGTGCAACCC TCCCGGGATC CTCGGATAAT TTCCTTTCCC ATGGGTATGT GCCTAATATC TATGAACATC TCGCCGAGGC CCGGATTGCC ATCGTGCATG GCGGATTGAC CACCCTCCAC GAAGCGTTGC TCTTCAATAA GCCCGTCCTC ATCATCATGG ACCCGGGCCA TCCTGAACAG CAGAACAACG CACAAAAAAT CGTTGACCTG GGAGCAGGGA CGGTAGTGGA TGGCAGGACA GTCACTCTTG AAATTCTCGA ACAAAAGATT GCAGAGACTC TCTCCCTTCC CCTCCGGTCT GGCGGTCGCG ATCTGGCTGC GGTAAATGGC AGAAAAAATG CCGCAGCAGT TATTGAGGCG TCTTTCGGTG CTGCTGCAAA CAGGAACATT AAGGTTTCAT CCTCCAAACC CTAA
|
Protein sequence | MRILFVVCGE GLGHTSRCIH LGHYLEQQGH SVSFLAYGKS YDFFRDHGCT RVYRGEREVC LEGENGFFSL KKTLWCSRWI VINMVRSGLR VRRLIREQQI DCVVCDTMYA GVLAARFCRV PVIFITNQNR FSGPGGAKNP VWSVLNFLIR RYLKLADAVI IPDYPPPDSV SEYNLLIPEK EKPHYHFTGP FLEIDLNRYQ FSQETIFTSF GGEPYKLPLY RLLRTIADKR KDLMFDVFYT GATLPGSSDN FLSHGYVPNI YEHLAEARIA IVHGGLTTLH EALLFNKPVL IIMDPGHPEQ QNNAQKIVDL GAGTVVDGRT VTLEILEQKI AETLSLPLRS GGRDLAAVNG RKNAAAVIEA SFGAAANRNI KVSSSKP
|
| |