Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1552 |
Symbol | |
ID | 5411239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 1621848 |
End bp | 1622984 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640868786 |
Product | CBS domain-containing protein |
Protein accession | YP_001404712 |
Protein GI | 154151094 |
COG category | [K] Transcription [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.651546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGCT CCTATAAGAT TGGACGGCTC TTTGGCATTC CTGTTCTGAT CCATTTTTCG TTTCTGATTG TTATCCCCCT TTTAGCGTGG ATCATCGGAA TCCAGATCGC ACTTACCACA ACTACGATCC AGTATATTTT CCAGGTTCCC ATAGATACCA CCCTGATCAC GGCGGGATTC ATGCCATGGA TCCTCGGCAC CATCGTCGCA CTCGGGCTTT TTTTGGGAGT TCTGGTCCAC GAAATCGCCC ATTGCATTGT TGCCAGGAAA AAGGGTGTGA GGATCCAGAG TATCACCCTG CTCATGTTCG GTGGTGTCTC CCGGATGGAG GAAGAAGGAG TACCGGATCC GAAAGTTGAG CTCCCCATGG CACTGGTCGG GCCATTCACC AGCCTTCTCT TCGGTCTCGT ATGCGCCGGT CTCGTCTACC TTGTCCCGGG AATGACCCCT TATCCCGCGA TCGCCGGCAT CCTCATTTTT ATCTTCGGGT ACGTTGCCGT GCTTAACATC CTTCTCTTTG CATTTAACCT GATCCCGGCA TTTCCCATGG ATGGCGGCAG GGTACTCCGG GCGGCACTGG CACAACGAAT GCCCGTCCAC AAGGCGACCC GGATTGCTGC CAATATTGGT AAGGGGTTTG CCATTATTTT TGGCATCATC GGCCTGTTAT TCTTCAACCC GTTCCTGATC CTTATCGCAC TTTTTGTCTA CATCGGGGCA GGCTCGGAAG CAACCATGGA CCAGTTCACC TACCTGTTGC ACAACGTTAC GGCAGAGAGC ACGATGAGTT CTCCGGTAAC GTCCGTGACA CCGGCCCTCT CACTTTCAAA GGTGGCAGAG ATGATGCTTT CCACCAAGCA CCTAGGTTTC CCGGTTGTCG AACATGACAA ACTTGTAGGA ATGATCACGC TTGTGGATGT GAACCGTATC TCACCGGCCG ATCGCGAGGC AAAGCAGGTC CGCGACATCA TGACCCGCGA TCCGGTTACC CTTCCGCCAT CTGCACCGGT CATGGACGCC TTAAGAATCA TGTCAGCCCG CAATATCGGG AGGATCCCCA TAGCTCAGGA CGGCAGGATC ATCGGTATTG TCACCCGTTC CGATATCCTG AAAGTGGCCG AGCTCAAAAA GGCATGA
|
Protein sequence | MNGSYKIGRL FGIPVLIHFS FLIVIPLLAW IIGIQIALTT TTIQYIFQVP IDTTLITAGF MPWILGTIVA LGLFLGVLVH EIAHCIVARK KGVRIQSITL LMFGGVSRME EEGVPDPKVE LPMALVGPFT SLLFGLVCAG LVYLVPGMTP YPAIAGILIF IFGYVAVLNI LLFAFNLIPA FPMDGGRVLR AALAQRMPVH KATRIAANIG KGFAIIFGII GLLFFNPFLI LIALFVYIGA GSEATMDQFT YLLHNVTAES TMSSPVTSVT PALSLSKVAE MMLSTKHLGF PVVEHDKLVG MITLVDVNRI SPADREAKQV RDIMTRDPVT LPPSAPVMDA LRIMSARNIG RIPIAQDGRI IGIVTRSDIL KVAELKKA
|
| |