Gene Moth_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0668 
Symbol 
ID3832155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp700280 
End bp701416 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID637828606 
Productglycosyl transferase, group 1 
Protein accessionYP_429536 
Protein GI83589527 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000385091 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACTAACA CAAAAGGCAG AGTTCTGTAT TTAGCCACCG TTTACACCCA CCTGGCCGCT 
TTTCACCTGC CTTTCATGCA GCTTTTGCAC GGTAAAGGAT ATGAGGTGCA CGCGGCAGCC
TCCTCTGATC AGGGGAGAAA GAAAGAGGTG GAGGCTATAG GTGTGAGATG CTGGGAGATC
CCTTTTTCCC GCTCGCCCTA TAGCCTTAAA AATTGGCGCG CTTTCAGGGA ATTAGGCCGG
CTGTTGGAAT ACTATCGGTT TGATCTCATC CATGTTCACA CCCCGGTAGC CAGTTTTTTG
GGGAGATATC TAGCTAGGGC TACAGGCCAG AGGCCTGTGC TCTATACCGC CCACGGCTTC
CACTTTTACC AGGGTGCGTC TTTACGGAAC TGGCTCCTTT ACTACCCTGC CGAGCGGCTG
GCTGCCCGCT GGACGGACGG GCTGATAGTG ATGAATAGGG AAGATTACGA TAGTGCCATT
AAAATGGGTT TCAGGCCGGG GGAGAACCTA TTTTATGTAC ACGGTGTAGG AGTAGATGTT
GACAAATTCA ACAGCATAGT TTCCTCTAGG AGTTTACGTG ATAAATTGGG TTTAAAAGCG
GACGATATTG TTATAACTTG TATAGCGGAA TTGATACCAA GAAAAAATCA CCTCTTTCTG
CTAAAGGCCT GGTCAAGACT TACTAAGAAG ATATGTTCCG CTCATCTTTT ATTGGTAGGT
GATGGTGTTC TCCGCCAGGC ATTGGAGTGC TGGGTAAATG GAAAAAGTGT AAGTAGGGTT
CATTTTCTTG GTTTTCGTCG GGATATACCT CAGCTTGTCC AGGAGGCAAA TATCGTCGTC
CTTGTTTCCC GGCATGAAGG CCTCCCCAGA TCCCTCATGG AAGCAATGGC TGCGGGGAAG
CCGGTGGTGG CGAGCAACGT GCGCGGCAAC CGAGATCTGG TAGACCACGG CCGGACGGGC
TTCCTGGTGG AGCTGGGCGA TGTAGAGGGG CTGGCTGGCT ATTTGGAACT GCTGGCCCGG
GATGAAAACC TGCGCCTGGC CTTAGGCAGA GCGGGCCGGG AGAAGATTGG CGATTATTCC
CTTGACAAAG TGCTCGCCGA GATGGATGCT GTTTACAGCC GCTATTTACC AAGCTAG
 
Protein sequence
MTNTKGRVLY LATVYTHLAA FHLPFMQLLH GKGYEVHAAA SSDQGRKKEV EAIGVRCWEI 
PFSRSPYSLK NWRAFRELGR LLEYYRFDLI HVHTPVASFL GRYLARATGQ RPVLYTAHGF
HFYQGASLRN WLLYYPAERL AARWTDGLIV MNREDYDSAI KMGFRPGENL FYVHGVGVDV
DKFNSIVSSR SLRDKLGLKA DDIVITCIAE LIPRKNHLFL LKAWSRLTKK ICSAHLLLVG
DGVLRQALEC WVNGKSVSRV HFLGFRRDIP QLVQEANIVV LVSRHEGLPR SLMEAMAAGK
PVVASNVRGN RDLVDHGRTG FLVELGDVEG LAGYLELLAR DENLRLALGR AGREKIGDYS
LDKVLAEMDA VYSRYLPS