Gene Moth_1365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1365 
Symbol 
ID3832287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1408612 
End bp1409802 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID637829301 
Productglycosyl transferase, group 1 
Protein accessionYP_430221 
Protein GI83590212 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0275215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTTG CCATGTTACA TTGGGCTTTC CCGCCGATCA TCGGGGGAGT GGAATCCCAC 
CTAGCCCTCC TGTGCCCTTA CCTCGTCCGG CAGGGGCACC AGGTAAGCCT CCTAACGGCC
ACAGCCCCCG GTACCCCGGT GGAGGAAAGC TGGCAGGGAG TTGTAATTAA GCGCTCGCCC
TTGCTGGATC TAAATTCCCT TACTCCAGCA GTTATTGAGG CCAGGGCCGG GGAGATCAAG
GAACTCTTGG AAAACTTTTT ACTGGCAGTG CGGCCGGATG TGGTCCACGC CCATAACTTT
CACTATTTCA GCTATGTACA CGCGGCCAGT CTCCAGGAAA TTTGCCGCCG TCACGGCTGG
CCCCTGGTCC TCACGGCCCA TAATGTGTGG GATGACGAAC TGTGGACCAG GATGAATAGC
CTGGCCAGGG GCTGGGACCT GGTTATTGCC GTCAGCCACT ACATACGCCA GGAATTGGTG
GTTAATGGCT ATCCGCCGGA GCGGGTGACA GTTGTCTACC ATGGCACAGA TACTAATACC
TTCCGGCCGC CCTCCCCGGA GGACAGGCAG GCCCTTTATA CCTCCTATCC GGAATGGCGG
GGACGGCGGA TTATCTTCCA CCCGGCCAGG ATGAGCCGGG CCAAGGGCTG TGACGTCAGC
ATTCGCGCCC TGGATCTCAT CCGCCGGGAA ATCCCCGACG TTCTCCTGGT ACTGGCCGGT
ACCACCAACA CCGTTGACTG GGGCCAAAAA CAGCCGGCGG AAGTAGCCTC TCTCCAGGAT
CTTATCGCCA GCTTGGGCCT GGAGGAAAAT GTCTTCATCC GTTTCTTCCC GTGGCAGGAG
ATGCCTGCTG TTTACCAGGG GGCCGAGGTC TGCCTCTACC CATCGGCCTT CCAGGAGCCG
TTTGGTTTGG TCATGCTGGA AGCCATGGCC ACGGCCAGGC CTATCATCGT CAGCCGCGCC
GGCGGCATGC CGGAGATCAT TCGTCCCGGA TATAACGGCT TTTTGGTCTC TATGGGGGAT
CACGAGGAAC TCGCCCGCTA TACCACTTTC CTTCTCCGTA ATCCGGAGGT GGCCAGGACC
ATGGGCCAGG ACGGCCGCAG GCTGGTAGAA GAAAACTTTA CTACCGCCGT GATGGCCCGA
AATACCCTGG AGGCATATAA CCAGTTGTTG GCCCTGCCCC GGGCCAGTTA G
 
Protein sequence
MRVAMLHWAF PPIIGGVESH LALLCPYLVR QGHQVSLLTA TAPGTPVEES WQGVVIKRSP 
LLDLNSLTPA VIEARAGEIK ELLENFLLAV RPDVVHAHNF HYFSYVHAAS LQEICRRHGW
PLVLTAHNVW DDELWTRMNS LARGWDLVIA VSHYIRQELV VNGYPPERVT VVYHGTDTNT
FRPPSPEDRQ ALYTSYPEWR GRRIIFHPAR MSRAKGCDVS IRALDLIRRE IPDVLLVLAG
TTNTVDWGQK QPAEVASLQD LIASLGLEEN VFIRFFPWQE MPAVYQGAEV CLYPSAFQEP
FGLVMLEAMA TARPIIVSRA GGMPEIIRPG YNGFLVSMGD HEELARYTTF LLRNPEVART
MGQDGRRLVE ENFTTAVMAR NTLEAYNQLL ALPRAS