Gene Moth_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1853 
Symbol 
ID3831714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1913326 
End bp1914387 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content63% 
IMG OID637829785 
Productglycosyl transferase, group 1 
Protein accessionYP_430696 
Protein GI83590687 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGTCC ACGTCCTGAC CCGGGCCCCA GGCGACCGGA GGGTTTCAAC CGTCCAGGAG 
GGTGTTTACC TGCATTATGT GCCCACCTAC CAGCAGCCCG ACAAAGAAAT TAATTTCCTC
TCCTGGGTGT TGCAGTTTAA CCTGGCCCTG GCCGATGCCG GCCAAGAGGT GCTGGCCGCC
TATCCCGGCC GGAATTGGAT CCTCCATGCC CACGACTGGC TGGTGGCCTA CGGCGCCCGG
GAACTCCAGG AGTCGGCCAG GGTACCCCTG GTAGCCACCA TCCACGCCAC CGAAGCCGGC
CGTAACCACG GTCTCCATAA CCGCATCCAG CAGGCCATCC ACCATATTGA GTCGGGGCTG
GTGAACGGTG CAGACCGGTT AATCTGCTGC AGCCGCTATA TGGAAGAAGA AATCAGACGC
CTGTTTCAGC CCCGGTCGGA GATAACCGTG ATTCCCAATG GTGTCCGGCC CATACCCCCG
GTTCCCCCCT GCCGGGACAG CCAGACCATT CTTTTTGTGG GCCGCCTGGT GGTGGAGAAG
GGGGTCCAGG TCCTCCTGGC CGCCCTGGCG CGCCTTAAGC GACTTTACCC GGGAGCCAGG
TTAATCGTCG CCGGGGCCGG TCCCTACGCT GGAGAACTGC AAACCATGGC TAACAACCTG
GGTCTGGCCG ACAGGGTTGA GTTTACCGGT TTTGTTTCCG AGGAGGTTCG TAATCGGCTC
CTGGCCCGGT CCCGGGTGGC CGTGTTCCCC AGCCTCTATG AGCCCTTTGG TATTGTCGCC
CTGGAGGCCA TGGCTGCCGG GATACCGGTG ATCGTATCCC GGACGGGCGG CCTGGCCGAG
GTGGTGGAAG ACAACCGTAC CGGCTTGACC TTCAACCCCG GAGATGTCGC CGACCTGGAG
CGTCGCCTGG TAACAATTTT CCAGAACCCC GACCTGGCCG CCGAGCTGGG CCGTAGCGGC
CAGGCCCGGG TTTACCGGGA TTATACCTGG GAGGCCGTGG CCCGGCAAAC CCTGGCCCTT
TACCGGGGCG TTCTCCGGGA GAACTCTTTG ATCGCCAGTT AG
 
Protein sequence
MDVHVLTRAP GDRRVSTVQE GVYLHYVPTY QQPDKEINFL SWVLQFNLAL ADAGQEVLAA 
YPGRNWILHA HDWLVAYGAR ELQESARVPL VATIHATEAG RNHGLHNRIQ QAIHHIESGL
VNGADRLICC SRYMEEEIRR LFQPRSEITV IPNGVRPIPP VPPCRDSQTI LFVGRLVVEK
GVQVLLAALA RLKRLYPGAR LIVAGAGPYA GELQTMANNL GLADRVEFTG FVSEEVRNRL
LARSRVAVFP SLYEPFGIVA LEAMAAGIPV IVSRTGGLAE VVEDNRTGLT FNPGDVADLE
RRLVTIFQNP DLAAELGRSG QARVYRDYTW EAVARQTLAL YRGVLRENSL IAS