Gene Mthe_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1678 
Symbol 
ID4463350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1823786 
End bp1824907 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content55% 
IMG OID639700696 
Productglycosyl transferase, group 1 
Protein accessionYP_844084 
Protein GI116754966 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC TGGTCATACC GACAACTGAC TGGATAAGAC ATCCGTTTCC GAACAGGTTG 
AACTTCATAT TCGATATAAT AGCGGAGAGG CATGAGGTGC AGGTTTTGCA CTTCGAGCTC
TCGAAGTTCA GAGATAACAG CCCGAGATGG ACGAGATGCT CTCTTTTGAA GGCAGGATCG
TCCAAAGCTG AGGATCCGTC TGTCTACTAC ATCACAAGCG CTTTATCCCA TCTCCGCGTG
ATCCGCGATG CTGCCAGGGA CTCTGATGTG ATTCTATCAG CGAACATCCT CCCGTCCTTC
ATGGCGAACC TCACAGATAC GCCCGTGGTC TTCGACTATC TCGACCATCT GGAGGAATCC
GCATCCATAT ACTACCCGGG CTCGCTCTTC GGCAGGGCTG TCAAGCTCGG CGTCAGGGCG
ATCACCAGGT ACAACCTCAG GCACGCCAGG GCTGTGATAA CCGTGACCCA GGAGCTCAAA
GAGTACCTCA GAAACATCGG CGTTCGTGAT GTGGAGATCA TTCCGAACGG CGTGGACACG
AGCCTTCTGA AACCTATTGA TGCTGGAGAG GCAAAGATCG CTCTCGGTCT TGAGGGGGAT
GTGATCGGTT ACGTCGGATC GCTGGAGTAC TGGGTCGATC TCGAGACCGT TGTGAGCGCT
CTGCCAGATC TCGATGTCAC ACTCCTCGTT GTGGGCCCGA GCCTGTTCAC GGATTACGGC
GAGCGCATAA AGGATATGGC TGAGCGGCTC GGCGTTGGAG AGAGGGTGAT CTTCACGGGA
GCTGTGCCGT ACGCGGAGCT CGGCAGGTAC ATATCTGCGA TGGACATAGG CCTCAACCCC
CTGAGAATGA TGAAGAAGAA CGAGTATGCT GCTGGAGGGA AGATCTTCAA CTACCTCGCA
TGCGGCAGGC CTGTTCTCAC CACAAGAATG CTCTCGCTCG AGCGGCTTCT CGGGGACAGC
CTGTACTACT ATGATGACAG GGAGAGCTTC ATATCGCAGG TGAAGCGTAT CCTGGAGAGC
CCGCAGGATC AGAGAAGATA CAGGGAGATC GCTGAGAGGT ATGACTGGCG CGCTCTGGCA
GCCAGGTACG AGAGCGTTCT GAGGAGGGCT GCAGAAGATT GA
 
Protein sequence
MKILVIPTTD WIRHPFPNRL NFIFDIIAER HEVQVLHFEL SKFRDNSPRW TRCSLLKAGS 
SKAEDPSVYY ITSALSHLRV IRDAARDSDV ILSANILPSF MANLTDTPVV FDYLDHLEES
ASIYYPGSLF GRAVKLGVRA ITRYNLRHAR AVITVTQELK EYLRNIGVRD VEIIPNGVDT
SLLKPIDAGE AKIALGLEGD VIGYVGSLEY WVDLETVVSA LPDLDVTLLV VGPSLFTDYG
ERIKDMAERL GVGERVIFTG AVPYAELGRY ISAMDIGLNP LRMMKKNEYA AGGKIFNYLA
CGRPVLTTRM LSLERLLGDS LYYYDDRESF ISQVKRILES PQDQRRYREI AERYDWRALA
ARYESVLRRA AED