Gene Moth_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1133 
Symbol 
ID3833231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1161728 
End bp1162861 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID637829063 
Product1,2-diacylglycerol 3-glucosyltransferase 
Protein accessionYP_429990 
Protein GI83589981 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATATTG GTATATTTAC CGATAGCTAC CTGCCCTATA CGAGTGGTGT CGTCAGATCG 
GTAGTTACCT TCAGCAGGGA ACTCCGGGCC CTGGGACACA GGGTGGTAAT CTTTGCCCCA
GCTTATGGTC ACCATGATCC TGAAAGGGAT ATCTACCGTT TTCGTTCCTT CAGGGCGCCG
ACCTTTAAAG AGTTTGCCCT AGCCATCCCG GTAGCGCCGG GCCTTACCAA TACCTTACGA
CAGCTGGGAA TCGATTTGAT CCACGTACAT TCCCCCTTTT TGATGGGCCA GTTGGGGGTC
AGAATGGCCC GCCGCTTGGG TCTGCCCCTG GTAGCTACTT ATCACACCCT TTATGAGGAA
TATATCCATT ACTTTCCCCT GGCTCCCGGG CTCCTGCGCC GGGTTGTCCG GAATTATACT
CTATCCTTTT ACAACGGCTG CCGGCTGGTA ATTACCCCTA CCGATACTAT AGCACGTTAC
CTGCAGGAAA ATGGGCTCAA AGTACCAGTT GTTAGCATTC CCACAGGAAT AGAGCTGGAA
CGTTTTCAGG ATGTTGACAC TGGCTGGTTG CGCCGTCACC TGCAGCTTCC AAGGGAAGAG
ATCATCCTTC TCCATGTAGG CCGTTTGGGC AAAGAAAAAA ATATCTCTTT TGTCCTCCAG
GCCTTTGCTA AAATCCATGG CGAGGTACCG GCGACCCGTC TGGTCCTGGT AGGTAGTGGC
CCCTTAAAGG GGGAGTTAGA GCACCAGGCC CATTCCCTGG GAATAGCCCA AGCGGTTACC
TTTGCCGGTT CCTTTTCTTT TGAACAAATG CCAGCCGTCT ATGCCGGCGC TGATTTATTT
GTCTTTGCCT CCGTTACCGA GACCCAGGGC CTGGTAGTGG GGGAGGCTAA AGCTGCCGGT
TTACCGGTAG TTGCCGTACG GGCCCGGGGA GTGCAGGAAA TGGTAGAAGA CGGCCGGGAT
GGTTTCTTAG TCCCTTTAGA TATTGAGACC TTCAGTGCCC GTATAAGACA ACTGGTCCTT
GATGCCGGCC TCCGTAAGGA AATGGGTCGG CAGGGACGCC TTAATGCTAG TTCCCTTGCG
GCGGCGACTA TGGCCCGCCG CCTGGCAGAC CAATACCAGG AGTTACTTGG ATAG
 
Protein sequence
MHIGIFTDSY LPYTSGVVRS VVTFSRELRA LGHRVVIFAP AYGHHDPERD IYRFRSFRAP 
TFKEFALAIP VAPGLTNTLR QLGIDLIHVH SPFLMGQLGV RMARRLGLPL VATYHTLYEE
YIHYFPLAPG LLRRVVRNYT LSFYNGCRLV ITPTDTIARY LQENGLKVPV VSIPTGIELE
RFQDVDTGWL RRHLQLPREE IILLHVGRLG KEKNISFVLQ AFAKIHGEVP ATRLVLVGSG
PLKGELEHQA HSLGIAQAVT FAGSFSFEQM PAVYAGADLF VFASVTETQG LVVGEAKAAG
LPVVAVRARG VQEMVEDGRD GFLVPLDIET FSARIRQLVL DAGLRKEMGR QGRLNASSLA
AATMARRLAD QYQELLG