Gene Mthe_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0901 
Symbol 
ID4462365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp978636 
End bp979847 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID639699920 
Productnucleotidyl transferase 
Protein accessionYP_843329 
Protein GI116754211 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA TGCAGGCGAT AATACTTGCT GCTGGCGAGG GCTCCAGGAT GCGGCCCCTC 
ACCGCGAGCA GGCCGAAGGT CATGCTCCCT GTAGGCGGAG CCCCGCTGCT CGAGGAGCTC
GTACTGAGAT GCAGAGAGGC GGGGATAAAC AGGTTTGTGT TTGTTGTTGG CTATCGCAGA
GATGTGGTAA CATCCTATTT CAAGGATGGC AGCGATTTCG ATGTGGATAT CAGCTACGCG
GTGCAGGAGA AACAGCTGGG CACAGGACAT GCACTAATGA CCGCGAGAGA CCTCTCAGAT
GATCGATTCT TTGTCATAAA CGGAGATGTG CTTCCAGACG TCCAGGCGCT CAGACGTATG
ATCTCAATGG AGGATCTAAG TGTTGCAACG CACAGGGTAG TGGAGGCGAG CCGTTACGGC
GTGTTTCTGC TCAGAGATGG GCTTGTGGAG GGGGTCGTGG AGAAGAGCCC GTCGCCGCCG
TCTGACATGG CAAACGCTGG AATATATCTG CTTGACAGGG AGATCTTCGA GCTCATGGAG
GAGGTGCCTG TCTCAATCAG GGGAGAATAC GAGCTCACCG ATGGAATTAA TGCACTTGCG
TCCGCTGGCA GAAAAATCTG GGCCATTGAG CTCAGCGAGT GGGTTGAGGT TGGCGTTCCC
TGGGATATAC TCACGGCCTC GAATGCTGTG CTCTCGAGAA AGGTCCCTGT CATGGATGGG
GATGTGGAGA GCGGCGCCAC GCTCAAGGGA AACGTATCAA TCGGCAGCGG CACACTGGTG
AGAAATGGCG CCTACATCGA GGGCCCGGTG TGGATCGGGA GGAACTGCGA CATAGGGCCG
AACTGCTACA TTCGCGCAGG ATCATGCATA GGGAACAGCG TGAGGGTCGG AAATGCGGTC
GAGATAAAGA ACTCGACCAT CATGGACGAC ACCAAGATCG GCCATCTATC CTACGTGGGG
GATAGCGTCA TCGGGTATGG CTGCAATCTC GGGGCCGGCA CCATCGTATC GAATCTCAGG
CATGACAACA GAAACATCCG CTCTTACGTC AAGGGCGTGC TTGTGGACAC AGGCAGGAGA
AAGCTTGGTG TTATAATGGG TGATGGCGTT AAGACGGGAG TGCATACCTG CATCTATCCG
GGAACCGTGA TAGAGCCCGG CTATCTCTCG AGGCCGGGCG AGGCCCTCAG GGGATACGTG
AAATCCATAT AA
 
Protein sequence
MNSMQAIILA AGEGSRMRPL TASRPKVMLP VGGAPLLEEL VLRCREAGIN RFVFVVGYRR 
DVVTSYFKDG SDFDVDISYA VQEKQLGTGH ALMTARDLSD DRFFVINGDV LPDVQALRRM
ISMEDLSVAT HRVVEASRYG VFLLRDGLVE GVVEKSPSPP SDMANAGIYL LDREIFELME
EVPVSIRGEY ELTDGINALA SAGRKIWAIE LSEWVEVGVP WDILTASNAV LSRKVPVMDG
DVESGATLKG NVSIGSGTLV RNGAYIEGPV WIGRNCDIGP NCYIRAGSCI GNSVRVGNAV
EIKNSTIMDD TKIGHLSYVG DSVIGYGCNL GAGTIVSNLR HDNRNIRSYV KGVLVDTGRR
KLGVIMGDGV KTGVHTCIYP GTVIEPGYLS RPGEALRGYV KSI