Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0668 |
Symbol | |
ID | 3832155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 700280 |
End bp | 701416 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637828606 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_429536 |
Protein GI | 83589527 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000385091 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGACTAACA CAAAAGGCAG AGTTCTGTAT TTAGCCACCG TTTACACCCA CCTGGCCGCT TTTCACCTGC CTTTCATGCA GCTTTTGCAC GGTAAAGGAT ATGAGGTGCA CGCGGCAGCC TCCTCTGATC AGGGGAGAAA GAAAGAGGTG GAGGCTATAG GTGTGAGATG CTGGGAGATC CCTTTTTCCC GCTCGCCCTA TAGCCTTAAA AATTGGCGCG CTTTCAGGGA ATTAGGCCGG CTGTTGGAAT ACTATCGGTT TGATCTCATC CATGTTCACA CCCCGGTAGC CAGTTTTTTG GGGAGATATC TAGCTAGGGC TACAGGCCAG AGGCCTGTGC TCTATACCGC CCACGGCTTC CACTTTTACC AGGGTGCGTC TTTACGGAAC TGGCTCCTTT ACTACCCTGC CGAGCGGCTG GCTGCCCGCT GGACGGACGG GCTGATAGTG ATGAATAGGG AAGATTACGA TAGTGCCATT AAAATGGGTT TCAGGCCGGG GGAGAACCTA TTTTATGTAC ACGGTGTAGG AGTAGATGTT GACAAATTCA ACAGCATAGT TTCCTCTAGG AGTTTACGTG ATAAATTGGG TTTAAAAGCG GACGATATTG TTATAACTTG TATAGCGGAA TTGATACCAA GAAAAAATCA CCTCTTTCTG CTAAAGGCCT GGTCAAGACT TACTAAGAAG ATATGTTCCG CTCATCTTTT ATTGGTAGGT GATGGTGTTC TCCGCCAGGC ATTGGAGTGC TGGGTAAATG GAAAAAGTGT AAGTAGGGTT CATTTTCTTG GTTTTCGTCG GGATATACCT CAGCTTGTCC AGGAGGCAAA TATCGTCGTC CTTGTTTCCC GGCATGAAGG CCTCCCCAGA TCCCTCATGG AAGCAATGGC TGCGGGGAAG CCGGTGGTGG CGAGCAACGT GCGCGGCAAC CGAGATCTGG TAGACCACGG CCGGACGGGC TTCCTGGTGG AGCTGGGCGA TGTAGAGGGG CTGGCTGGCT ATTTGGAACT GCTGGCCCGG GATGAAAACC TGCGCCTGGC CTTAGGCAGA GCGGGCCGGG AGAAGATTGG CGATTATTCC CTTGACAAAG TGCTCGCCGA GATGGATGCT GTTTACAGCC GCTATTTACC AAGCTAG
|
Protein sequence | MTNTKGRVLY LATVYTHLAA FHLPFMQLLH GKGYEVHAAA SSDQGRKKEV EAIGVRCWEI PFSRSPYSLK NWRAFRELGR LLEYYRFDLI HVHTPVASFL GRYLARATGQ RPVLYTAHGF HFYQGASLRN WLLYYPAERL AARWTDGLIV MNREDYDSAI KMGFRPGENL FYVHGVGVDV DKFNSIVSSR SLRDKLGLKA DDIVITCIAE LIPRKNHLFL LKAWSRLTKK ICSAHLLLVG DGVLRQALEC WVNGKSVSRV HFLGFRRDIP QLVQEANIVV LVSRHEGLPR SLMEAMAAGK PVVASNVRGN RDLVDHGRTG FLVELGDVEG LAGYLELLAR DENLRLALGR AGREKIGDYS LDKVLAEMDA VYSRYLPS
|
| |