Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2272 |
Symbol | |
ID | 5411004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 2341032 |
End bp | 2342186 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640869524 |
Product | CBS domain-containing protein |
Protein accession | YP_001405429 |
Protein GI | 154151811 |
COG category | [R] General function prediction only |
COG ID | [COG0517] FOG: CBS domain [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.29511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGAT CTCTTCGAAT CGGGCGGCTT TTTGGGATCC CGATCATGCT CCACTGGACT TTTCTTTTGA TCATTCCGGT TTTTGCCTTC CTTATCGGCA GCCAGATCGG AGCTACCACC GATTTGATCG GGGGGCTCTT TGGCATCGGG ATCGACTCCA CTATCATCTC CGGCGGCTAC ATGCCCTGGA TCCTCGGGAC AATTGTTGCC CTCGGACTCT TTTTCGGAGT GTTTGTCCAC GAGCTCGCCC ATTCGCTTGT GGCCCGGGTA AAAGGAATCC GGATGCAGAG CATCACACTC CTGATGTTCG GGGGCGTTGC CCAGATGGAC GAGGGGGCCC CGGAACCAAG GACTGAGCTG CCCATGGCGC TTGCCGGGCC GCTCACAAGC CTTGTCTTTG GCCTTGCCTG CTGCGGCCTT GTGTACGTCA CCCCGGCACT CACACCTGCA CCCGCAATAC AGGGAGTGCT CATCTTTATT TTCGGGTACG TGGGTGTGCT CAACATCATC CTCTTTGCGT TCAACCTGAT CCCCGCCTTT CCCATGGATG GCGGCCGTGT GCTCCGGGCG GCTCTCGCAA CCCGGATGCC GCTTGATCGG GCAACCCGGA TTGCGGCAAA CGTGGGCAAG GGATTTGCTA TCCTCTTTGG TATCGTCGGG CTCTTTGGTA TCCCCGGGTA CATCGCTCCC TTCGATCCCT TCCTGATCCT GATCGCCCTC TTTGTGTACC TGGGGGCAAG CCTCGAATCA TCGGCTGTGC AGTACAATGT CCTGCTCCGT GATGTGACCG TGGGTGAGAT GATGAGCACC CCGGTGGTCT CGGTTCCCGC GAGCATGCAG CTCGTCAAGG TTGTTGACAT GATGTACGCA AGCAAGCACC TTGGTTTTCC TGTCACCGAG CGCGACACGC TCGTGGGCAT GGTGACTCTT GCGGACGTGA ACCGGACTTC TCCCATCGAC CGCGAGGCAA TGCAGGTAAA AGATGTGATG ACCCGTGAGG TTGTCACCCT GCCCCCCACG GCTTCGGTGA TCGATGCTCT CCGGATCATG TCTGCCCGGA ATATCGGCCG CATCCCGATC CTGCAGGAAG ACCGGATTGT AGGGATCGTG ACCCGGACCG ATATCCTGAA AGTTACCGAA TTAAAAAAGA TCTAA
|
Protein sequence | MDGSLRIGRL FGIPIMLHWT FLLIIPVFAF LIGSQIGATT DLIGGLFGIG IDSTIISGGY MPWILGTIVA LGLFFGVFVH ELAHSLVARV KGIRMQSITL LMFGGVAQMD EGAPEPRTEL PMALAGPLTS LVFGLACCGL VYVTPALTPA PAIQGVLIFI FGYVGVLNII LFAFNLIPAF PMDGGRVLRA ALATRMPLDR ATRIAANVGK GFAILFGIVG LFGIPGYIAP FDPFLILIAL FVYLGASLES SAVQYNVLLR DVTVGEMMST PVVSVPASMQ LVKVVDMMYA SKHLGFPVTE RDTLVGMVTL ADVNRTSPID REAMQVKDVM TREVVTLPPT ASVIDALRIM SARNIGRIPI LQEDRIVGIV TRTDILKVTE LKKI
|
| |