Gene Moth_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1831 
Symbol 
ID3832800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1889192 
End bp1890613 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content57% 
IMG OID637829761 
Productglycosyltransferase 
Protein accessionYP_430674 
Protein GI83590665 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000115115 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGGG AAACGATAAG TCCAGCGCAA ATCCTTTATT TAATTACCGT GGGGTTCTAC 
CTGCTTTTTT TCGGCCTCTT CCTGAGATAT TTCTATTGGA AATGGTATGC CGTCAAGTAT
CACTGGCGTA AACGCCTGCC CCTTGATGCT GAAAAGGTAA AGGCCCTGGC CGCCGCGAAG
GGCCTGGAGA TCCCTTTCTT TACCATTATG GTACCGGCCC GCAACGAGTC CGAGGTTATA
GCCAACACCA TCGAACACCT GGCCTCCTTA AACTACCCCA ATGATCGTTA TGAGATCCTG
GTAATCACCG ATGAAAAGGA AGCCCTGGCC AAGGCCGAAG GCCAGGGCGA GGGGCCTACC
ACCATGGAAG TGGTCGAGGC CAAGATCCGT GAGTTTGCAG CGCGCCCGGG TATGCCCCAG
CTGAAGCATT GTACCGTTCC CTACGATTTT GACGGCCGCT TCCGGGGTTC ACGGCGGGGG
CACAGCATCC CTTCCACCAA GGGCCGGGCC CTGAACTACG GCCTGGAGTT TGTCGACCCG
CGGACGACTA TTTGTGGTTT CTATGACGCC GAGAGCCATC CAGAGGCTGA TGTTCTCCTT
TACATAGCCT GGTCCTGGCT CCATGACCCG CGGGAGCGTA TCTGGCAGGG TCCCGTCTTC
CAGGTGCGTA ATTTCTACCA GCTGGGTATT ATTACAAAGA TCGCCGCCAT CTACCAGGCC
ATCTCCCATG AGATCTACCT GCCCATACTA ATGAAGAAGC TGCCCTTCGT AGGGGGCACC
AATCTCTTCG TCGGCCGGCG CCTCCTGGAG CGTATCGGGG GTTATGATCA CCGCGCCCTG
ACGGAAGACC TCGAGCTGGG GGTGCGGGCC TTCCTGGAGA CAGGGGTGTG GGCCGAGTAT
TTCCCTTATT TCAGCACCGA ACAAACGCCG GCCACCCTGT ACGCCTTTTT CCGGCAGCGC
TTGCGCTGGG GTAGCGGTCA CCTCCAGGTC TGTGATAAAT TCCGTTATGC CTACCAGTAT
TCCTGGGATA AGAGGGGCCC ACTACTCCAC AACCTCTTCT GGAAGGGCCA GGGCGAGTGG
CTCCTCTATC AGGGCGCGGT ACTGGTGCCT TTATCCATTG TCATCCTGGG GCTGAACGGC
GGGCTTGATC CCTCGATCGT CCCTTTTAAA ATCCGCGTGG TCCTCCATTA CCTGGTTTTC
ATCTACTTTG CCTTTACCTT TTATGCTTAC GGCCACTTCC ACCGCTTGAT GGCGCCGGTT
AACTGGTGGC AGCAGTTTAT CGGGTTCCTG CAGCTCCTGG CCCTGCCCTT TGCCAGTTTC
TTTTTGCCCC TGCCATATAC GGCGGCTTCC ATCATGAAGG CCCTGAACCG CCAGCCCCAG
ACGTGGGTTA AAACTCCACG GACCAAAGAG GCGACCCGCT AG
 
Protein sequence
MDWETISPAQ ILYLITVGFY LLFFGLFLRY FYWKWYAVKY HWRKRLPLDA EKVKALAAAK 
GLEIPFFTIM VPARNESEVI ANTIEHLASL NYPNDRYEIL VITDEKEALA KAEGQGEGPT
TMEVVEAKIR EFAARPGMPQ LKHCTVPYDF DGRFRGSRRG HSIPSTKGRA LNYGLEFVDP
RTTICGFYDA ESHPEADVLL YIAWSWLHDP RERIWQGPVF QVRNFYQLGI ITKIAAIYQA
ISHEIYLPIL MKKLPFVGGT NLFVGRRLLE RIGGYDHRAL TEDLELGVRA FLETGVWAEY
FPYFSTEQTP ATLYAFFRQR LRWGSGHLQV CDKFRYAYQY SWDKRGPLLH NLFWKGQGEW
LLYQGAVLVP LSIVILGLNG GLDPSIVPFK IRVVLHYLVF IYFAFTFYAY GHFHRLMAPV
NWWQQFIGFL QLLALPFASF FLPLPYTAAS IMKALNRQPQ TWVKTPRTKE ATR