Gene Moth_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1934 
Symbol 
ID3832426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2008257 
End bp2009597 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID637829865 
Productglycosyl transferase, group 1 
Protein accessionYP_430775 
Protein GI83590766 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00109377 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000195577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGATCT TGATGCTTTC CTGGGAATAC CCGCCCCAGA GTGTCGGCGG CTTGGCCCGC 
CATGTGGAGG ATCTGGCTAT CTCCCTGGCG GCCCGCCATG ATGTTCACGT CCTGACTATT
GGCCGACCCG GAGAAGCTTT CGAGAGCCGG GAGAACGGGT TGACCGTCCA CCGGGTGGAA
GCCTACCCCG TTCATCCCCC TGATTTTCTC GTCTGGGTGC TGCAACTGAA TGCCCGCTTT
ATGGAAGAGG CCATGATCCT CATGCGCCGG TACGGCCCCT TCCAGATTAT CCACGCCCAC
GATTGGCTGG TGGCCTTTAC CGGCCGGGCT TTGAAGCACG CTTATCATTT ACCCCTCATC
GCCACCATCC ACGCCACCGA GGCGGGCCGC AACCGCGGCC TCCACAACGA CATGCAGCGC
TACATTAACA GCGTCGAATG GTGGCTGACC TACGAAGCCT GGCGGGTCAT TGTCTGCAGC
CGGCATATGC GCCAGGAGGT CCAGGGGTTA TTCCAGCTGC CGGCTGACAA GATTACCATT
ATACCCAACG GAGTGTATAG CAAAAAGTTC CGGGCCGGGA CAGTCGACCC GGAGGTCCGG
CGGCGTTACG CCGCGCCTAA CGAGAAAATC CTCTTCTTTG TCGGCCGCCT GGTGATCGAA
AAGGGAGTCC AGGTGCTCCT GGAGGCCATG CCTCGCATCC TCTCCTCTTG CCCGGAGGCC
AAACTGGTGG TTGCCGGCCG GGGACCCATG GAAGGCCAGC TCCAGAACCG GGCCCGGGAA
CTGGGAATCG GCCACAAGGT CTGTTTTGCC GGCTATATTG ACGACCGGAC CCGCAACCAG
CTCTACCGGG CCGCCAGGGT GGCTGTCTTC CCCAGCCTTT ACGAGCCCTT CGGTATCGTC
GCCCTGGAGG CCATGGCCGC CGGGACGCCG GTGGTGGCCA GCGAAACAGG CGGCCTGGCG
GAGATAATCA CTCACGGCGT TGACGGCATG CGCGCCTATC CGGGCAACGC CAATTCCCTG
GCCGACAACA TCCTGGCGGT CCTGCAGGAT GACGCTCTGG TTGCGAAACT CAGCGCCAAC
GGCCGTCGCC TGGTAGCAGA GGTTTACGAC TGGGAAAATA TCGCCCGGCG CACGGCTGAC
GTCTACCAGG AGGTTTACAA CCAGTATCGT CGCACCCCCT GGCCGGAACG GACCCCGGTA
ATAGCCCGCC TGTGGCGCTT CGTCCCTTAC GTAGCCGGGG ACCAGGACAG AGAACAACCA
CTGCCCCTGG GGGGGCGCTA TGACCTGGCC CGGTACCGGG CTACTCTGGT AAACCAGCAC
CGGGGCAGGA GCGAGGGGTA G
 
Protein sequence
MRILMLSWEY PPQSVGGLAR HVEDLAISLA ARHDVHVLTI GRPGEAFESR ENGLTVHRVE 
AYPVHPPDFL VWVLQLNARF MEEAMILMRR YGPFQIIHAH DWLVAFTGRA LKHAYHLPLI
ATIHATEAGR NRGLHNDMQR YINSVEWWLT YEAWRVIVCS RHMRQEVQGL FQLPADKITI
IPNGVYSKKF RAGTVDPEVR RRYAAPNEKI LFFVGRLVIE KGVQVLLEAM PRILSSCPEA
KLVVAGRGPM EGQLQNRARE LGIGHKVCFA GYIDDRTRNQ LYRAARVAVF PSLYEPFGIV
ALEAMAAGTP VVASETGGLA EIITHGVDGM RAYPGNANSL ADNILAVLQD DALVAKLSAN
GRRLVAEVYD WENIARRTAD VYQEVYNQYR RTPWPERTPV IARLWRFVPY VAGDQDREQP
LPLGGRYDLA RYRATLVNQH RGRSEG