Gene Mthe_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0318 
Symbol 
ID4463296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp318239 
End bp319351 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content52% 
IMG OID639699323 
ProductCBS domain-containing protein 
Protein accessionYP_842753 
Protein GI116753635 
COG category[K] Transcription
[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.147887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACT CCATCCAGCT CGGCAGGGTC ATGGGCATAC CGATACGGCT TCATGTGACA 
TTTCTTCTGA TCATCCCATG GGTTGCATAT CTCTTTGGAA GCGTCAGCGC CACTGTTTTC
GGCAAGCTCT ACGGCTTCGG CGCGGTCGAG CCCCCTCTAG TCAGATGGAT TTACTCCCTC
CTCTTCGCGG TGTTGCTCTT CATATGCGTG GGTCTTCACG AGCTCGGCCA CTCGTACGTT
GCAAAAAGGT ATGGAATAGA GATAAGAAGC ATCACCCTCT ACTTCTTCGG CGGCGTCGCC
TCGATGGAGG AGATCCCCAG GAACCCATCG ATGGAGCTCA GGATGGCGAT AGCCGGACCT
GCTGTCAGCG CGGCTCTCGG CGTAATGTCG ATACTTCTTT ACACACAATC GGAATCGATT
TTGGGAGAAG GCCATCCCTT CTCGATACTC CTCTGGACTC TGGGCATAAT GAATATAATT
CTCATGATAT TCAACCTCAT CCCCGCCTTC CCCATGGACG GCGGGCGGGT GCTCAGGGCA
TGGTTCTCCA CAAGGATGCC GTATGTGGTT GCAACAAAGA ACGCAGCCGC TCTTGGAAAG
ATCTTTGCTG TGTTCCTCAT ATTTCTCGGA CTCTTCACGC TGAACTTTCT CACGCTGATC
ATAGGTATAT TCCTATACAT AGCTGCTTCT GAGGAGGACA GGAGCACCAC AATAGAAGAC
AGCCTGCGGG GCATAAAGGT GAGGCACATA ATGTCTAAGG ATGTGCGGGT TGTGCCTCCG
GAGATGACTC TCGCGGAGCT GATGCGGCTG ATGTTTTATG AGAAACACAG GGGATATCCT
GTGATGGTCA ACGATGAGCT TGTGGGAATA GTGACGATCA CAGATCTGCA GCGTGTTCCT
GAGCATCTGC GCGAGACAAC CCGTGTCGGA GATGTCATGA CCAGAAACAT ATATGTCATA
GGGCCGGATG ATGAGGCGAC CGCGGCCATA AAGATCATGG GCGATAAGAA GATAAGAAGG
CTCCCCGTCA TCGAGGATGG CAGGCTGGTG GGTATAATAT CAAGAGAGGA TCTCCTCAGG
GCCATCGAGC TGTGCTCGGA TGTGAGGCTG TAA
 
Protein sequence
MENSIQLGRV MGIPIRLHVT FLLIIPWVAY LFGSVSATVF GKLYGFGAVE PPLVRWIYSL 
LFAVLLFICV GLHELGHSYV AKRYGIEIRS ITLYFFGGVA SMEEIPRNPS MELRMAIAGP
AVSAALGVMS ILLYTQSESI LGEGHPFSIL LWTLGIMNII LMIFNLIPAF PMDGGRVLRA
WFSTRMPYVV ATKNAAALGK IFAVFLIFLG LFTLNFLTLI IGIFLYIAAS EEDRSTTIED
SLRGIKVRHI MSKDVRVVPP EMTLAELMRL MFYEKHRGYP VMVNDELVGI VTITDLQRVP
EHLRETTRVG DVMTRNIYVI GPDDEATAAI KIMGDKKIRR LPVIEDGRLV GIISREDLLR
AIELCSDVRL