Gene Mthe_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1113 
Symbol 
ID4463388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1200479 
End bp1201777 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content58% 
IMG OID639700130 
Productdihydroorotase 
Protein accessionYP_843536 
Protein GI116754418 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACC TTCTGGTAAA AGACGGCAGG GTTTACACTG GTGGCAGGCT GCTGAACACG 
GACATATGGA TCAAAAACGG AAGGATCGCA GCACTCGGTG GATACAACAC AGCCGCAGAG
AGAATCGACG CCAGCGGCAT GATCATCATA CCTGGGGTTA TCGACATGCA CGTCCACTTC
AGGGATCCTG GGTACACGCA CAAGGAGGAC TGGGAGAGCG GCTCGATATC CGCGGCTGCT
GGCGGTGTGA CGACAGTGGT CGACCAGCCC AACACAGATC CTCCGGTCAT GGACGCGGAG
TCGTATAAAG AGAAGCTGAA TCTGGCGAAG CGAAGGTCGA TCGTGGACTT CTGCCTCAAC
GGCGGTCCGG GCGATATCGA ATCACTCCTC AGAGAGGGGG CGGCTGCGAT CGGCGAGATC
TTCATGTACG AGATGAGCGA GGAACGTCTA GCCAGAATTT TAAAGGAGGT CGAGCGGCTC
GACGTGCTCG CGACTGTGCA CGCAGAGGAT GGGGAGGTGA TACGGAGATA CTCGGAGCCG
CTTGGAGGGA TCTGCGATCC CGATGTCCAT TCAAGAGCGA GACCGCCGAT AGCTGAGGTC
TCTGCGATCG ACCGGGCTCT GAGCGTATCC AGATGCAGGA TTCATATCTG TCACATCTCG
ACGGCTGATG GACTGGAGCT CGTAAAGAGG AGAAGGAACA GGAAGGTGAG CTGCGAGGTC
GCTCCGCACC ACCTTTTCCT GAGCAGGAGG GATTACAGGA GGCTCGGCAC ATTCCTCAAG
ACGAACCCGC CTCTGCGCAA CACAGCTGAC TGCGATGCCC TCTGGGACGG CCTGAGGCGG
AGGGATATAG ATGTCATCGC ATCAGACCAC GCACCCCATC TCCCTGAGGA GAAGAGGGAT
GATATCTGGC ATGCGCCCCC CGGCGTCCCG GGCGTGGAGA CGATGCTCCC CCTCATGCTA
TACGCTGTGA AGAGCAACAT GATAACCCTG GAGAGGGTTG TTGACGCGCT CTCAGCAAGG
CCAGCATCCA TACTTGGATT GAGATCCAAG GGGGAGATAG CCATCGGAAA AGATGCGGAT
CTGGTTATCT TCGATCCGAA AAGGCAGGAA CGAATCGATG TTCAGCGGCT CCACAGCAGG
GCGGACTGGA CACCGTATGA AAGGAAGAAG GCGATCTTTC CGGTGATGAC CCTTGTGAGG
GGCAGCGTCG TGTTCGATGG CGATATCGAG GTGAGCCCCG GCTACGGCAG GAATATCGAG
ATGCGCCAGG AGACGCGCAC GGAGGCGATC TCTGATTAG
 
Protein sequence
MMDLLVKDGR VYTGGRLLNT DIWIKNGRIA ALGGYNTAAE RIDASGMIII PGVIDMHVHF 
RDPGYTHKED WESGSISAAA GGVTTVVDQP NTDPPVMDAE SYKEKLNLAK RRSIVDFCLN
GGPGDIESLL REGAAAIGEI FMYEMSEERL ARILKEVERL DVLATVHAED GEVIRRYSEP
LGGICDPDVH SRARPPIAEV SAIDRALSVS RCRIHICHIS TADGLELVKR RRNRKVSCEV
APHHLFLSRR DYRRLGTFLK TNPPLRNTAD CDALWDGLRR RDIDVIASDH APHLPEEKRD
DIWHAPPGVP GVETMLPLML YAVKSNMITL ERVVDALSAR PASILGLRSK GEIAIGKDAD
LVIFDPKRQE RIDVQRLHSR ADWTPYERKK AIFPVMTLVR GSVVFDGDIE VSPGYGRNIE
MRQETRTEAI SD