Gene Moth_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1812 
Symbol 
ID3830730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1871292 
End bp1872515 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content56% 
IMG OID637829739 
Productglycosyl transferase, group 1 
Protein accessionYP_430655 
Protein GI83590646 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0100815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTC AGTTAAGCCT TAAAGATTAT GAAGGCGTAG CTGGAAGCGC CCTGATTGAC 
GAAATCCGAT CCCTGGGAGA AAGTTTACAA GGCTACAACG TCCGGCATAT CAACTCCACT
ATTATCGGCG GCGGGGTGGC AGAAATCTTA AGCTCCCTGG TCCCCCTGAT GGAAGACGTG
GGACTGACCG TCAACTGGGA GGTCCTAGCC GGCACCACGG AATTTTTCCA TACCACCAAG
CTATTTCACA ACGGTATGCA TGGCCAGCCG GTAAATATCA CCGGCGAGAT GCTGGAGAAT
TATCTGGCCA TAGCCCAAAA AAATCAGAAC CTCCTGGATG GAGATGCCGA CCTGGTGGTA
ATCCATGACC AGCAACCCCT GGGGCTAACC GCCTTCCGCG GCAGGACCAG GGGCCGGTGG
CTCTGGTACT GTCACGTCGA CCCGCGTTAT GCCGTACCAG AAGTATGGTA TTTTCTGGCG
CCAATGGTGG CCACCTGCGA TGCAGCCGTA TTTCACCTGC CAGAATACGC CCGCGACCTG
CCCGTCCTCC AGTACTTCAT GCCACCGGCT ATCGACCCCC TGTCTGACAA AAACAAGGAG
GTGTCTCCCG CTGATTACGA AGCGGTGCTC GAAAAGCTGG GCGTAGATCC GGAAGGTCCG
CCGGTAATCC TCCAGGTGTC CCGTTTCGAC CGGCTTAAAG ATCCTGCCGG TGTAATCGAG
GCCTTTAAAC TGGTCAGGAA AAATATAGCC TGCCGCCTAA TCCTGGCCGG CGGGAGCGCC
GATGACGACC CGGAAGGGGC TACTATCCTG GAAGAGGTGC GGGCATTGGC CGAGGGCGAC
CCGGACATTA CCGTACTTTC GTTAAATCCC GACGCAAACC TGGAAATAAA CGTTTTGCAG
CGGCGTGCCG ATGTGATAGT GCAAAAGTCA CTACGCGAGG GTTTTGGTTT AACGGCCACC
GAGGCCCTGT GGAAGGGTAA ACCCCTGGTA GCTACCCCCA CCGGTGGCCT GGCCTACCAG
GTGCTCGATG AGGAAACCGG ACTGACTGCC CGCACGGTGG AAGAGGTGGC CTTACAGGTA
GAACGCCTGC TGGTCAACCC CACCCTCGGG AGACGTCTGG GCGCTGCCGG CCGGGAACAT
GTCCGGCAGC GTTTTATCCT GCCGGTATAT TTGTATAACT GGTTAAAACT AATAAACTTG
CTCAACGGAC GACGCTGCGG CTAA
 
Protein sequence
MIRQLSLKDY EGVAGSALID EIRSLGESLQ GYNVRHINST IIGGGVAEIL SSLVPLMEDV 
GLTVNWEVLA GTTEFFHTTK LFHNGMHGQP VNITGEMLEN YLAIAQKNQN LLDGDADLVV
IHDQQPLGLT AFRGRTRGRW LWYCHVDPRY AVPEVWYFLA PMVATCDAAV FHLPEYARDL
PVLQYFMPPA IDPLSDKNKE VSPADYEAVL EKLGVDPEGP PVILQVSRFD RLKDPAGVIE
AFKLVRKNIA CRLILAGGSA DDDPEGATIL EEVRALAEGD PDITVLSLNP DANLEINVLQ
RRADVIVQKS LREGFGLTAT EALWKGKPLV ATPTGGLAYQ VLDEETGLTA RTVEEVALQV
ERLLVNPTLG RRLGAAGREH VRQRFILPVY LYNWLKLINL LNGRRCG