Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0318 |
Symbol | |
ID | 4463296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 318239 |
End bp | 319351 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639699323 |
Product | CBS domain-containing protein |
Protein accession | YP_842753 |
Protein GI | 116753635 |
COG category | [K] Transcription [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.147887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACT CCATCCAGCT CGGCAGGGTC ATGGGCATAC CGATACGGCT TCATGTGACA TTTCTTCTGA TCATCCCATG GGTTGCATAT CTCTTTGGAA GCGTCAGCGC CACTGTTTTC GGCAAGCTCT ACGGCTTCGG CGCGGTCGAG CCCCCTCTAG TCAGATGGAT TTACTCCCTC CTCTTCGCGG TGTTGCTCTT CATATGCGTG GGTCTTCACG AGCTCGGCCA CTCGTACGTT GCAAAAAGGT ATGGAATAGA GATAAGAAGC ATCACCCTCT ACTTCTTCGG CGGCGTCGCC TCGATGGAGG AGATCCCCAG GAACCCATCG ATGGAGCTCA GGATGGCGAT AGCCGGACCT GCTGTCAGCG CGGCTCTCGG CGTAATGTCG ATACTTCTTT ACACACAATC GGAATCGATT TTGGGAGAAG GCCATCCCTT CTCGATACTC CTCTGGACTC TGGGCATAAT GAATATAATT CTCATGATAT TCAACCTCAT CCCCGCCTTC CCCATGGACG GCGGGCGGGT GCTCAGGGCA TGGTTCTCCA CAAGGATGCC GTATGTGGTT GCAACAAAGA ACGCAGCCGC TCTTGGAAAG ATCTTTGCTG TGTTCCTCAT ATTTCTCGGA CTCTTCACGC TGAACTTTCT CACGCTGATC ATAGGTATAT TCCTATACAT AGCTGCTTCT GAGGAGGACA GGAGCACCAC AATAGAAGAC AGCCTGCGGG GCATAAAGGT GAGGCACATA ATGTCTAAGG ATGTGCGGGT TGTGCCTCCG GAGATGACTC TCGCGGAGCT GATGCGGCTG ATGTTTTATG AGAAACACAG GGGATATCCT GTGATGGTCA ACGATGAGCT TGTGGGAATA GTGACGATCA CAGATCTGCA GCGTGTTCCT GAGCATCTGC GCGAGACAAC CCGTGTCGGA GATGTCATGA CCAGAAACAT ATATGTCATA GGGCCGGATG ATGAGGCGAC CGCGGCCATA AAGATCATGG GCGATAAGAA GATAAGAAGG CTCCCCGTCA TCGAGGATGG CAGGCTGGTG GGTATAATAT CAAGAGAGGA TCTCCTCAGG GCCATCGAGC TGTGCTCGGA TGTGAGGCTG TAA
|
Protein sequence | MENSIQLGRV MGIPIRLHVT FLLIIPWVAY LFGSVSATVF GKLYGFGAVE PPLVRWIYSL LFAVLLFICV GLHELGHSYV AKRYGIEIRS ITLYFFGGVA SMEEIPRNPS MELRMAIAGP AVSAALGVMS ILLYTQSESI LGEGHPFSIL LWTLGIMNII LMIFNLIPAF PMDGGRVLRA WFSTRMPYVV ATKNAAALGK IFAVFLIFLG LFTLNFLTLI IGIFLYIAAS EEDRSTTIED SLRGIKVRHI MSKDVRVVPP EMTLAELMRL MFYEKHRGYP VMVNDELVGI VTITDLQRVP EHLRETTRVG DVMTRNIYVI GPDDEATAAI KIMGDKKIRR LPVIEDGRLV GIISREDLLR AIELCSDVRL
|
| |