Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1934 |
Symbol | |
ID | 3832426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2008257 |
End bp | 2009597 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829865 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_430775 |
Protein GI | 83590766 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00109377 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000195577 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGATCT TGATGCTTTC CTGGGAATAC CCGCCCCAGA GTGTCGGCGG CTTGGCCCGC CATGTGGAGG ATCTGGCTAT CTCCCTGGCG GCCCGCCATG ATGTTCACGT CCTGACTATT GGCCGACCCG GAGAAGCTTT CGAGAGCCGG GAGAACGGGT TGACCGTCCA CCGGGTGGAA GCCTACCCCG TTCATCCCCC TGATTTTCTC GTCTGGGTGC TGCAACTGAA TGCCCGCTTT ATGGAAGAGG CCATGATCCT CATGCGCCGG TACGGCCCCT TCCAGATTAT CCACGCCCAC GATTGGCTGG TGGCCTTTAC CGGCCGGGCT TTGAAGCACG CTTATCATTT ACCCCTCATC GCCACCATCC ACGCCACCGA GGCGGGCCGC AACCGCGGCC TCCACAACGA CATGCAGCGC TACATTAACA GCGTCGAATG GTGGCTGACC TACGAAGCCT GGCGGGTCAT TGTCTGCAGC CGGCATATGC GCCAGGAGGT CCAGGGGTTA TTCCAGCTGC CGGCTGACAA GATTACCATT ATACCCAACG GAGTGTATAG CAAAAAGTTC CGGGCCGGGA CAGTCGACCC GGAGGTCCGG CGGCGTTACG CCGCGCCTAA CGAGAAAATC CTCTTCTTTG TCGGCCGCCT GGTGATCGAA AAGGGAGTCC AGGTGCTCCT GGAGGCCATG CCTCGCATCC TCTCCTCTTG CCCGGAGGCC AAACTGGTGG TTGCCGGCCG GGGACCCATG GAAGGCCAGC TCCAGAACCG GGCCCGGGAA CTGGGAATCG GCCACAAGGT CTGTTTTGCC GGCTATATTG ACGACCGGAC CCGCAACCAG CTCTACCGGG CCGCCAGGGT GGCTGTCTTC CCCAGCCTTT ACGAGCCCTT CGGTATCGTC GCCCTGGAGG CCATGGCCGC CGGGACGCCG GTGGTGGCCA GCGAAACAGG CGGCCTGGCG GAGATAATCA CTCACGGCGT TGACGGCATG CGCGCCTATC CGGGCAACGC CAATTCCCTG GCCGACAACA TCCTGGCGGT CCTGCAGGAT GACGCTCTGG TTGCGAAACT CAGCGCCAAC GGCCGTCGCC TGGTAGCAGA GGTTTACGAC TGGGAAAATA TCGCCCGGCG CACGGCTGAC GTCTACCAGG AGGTTTACAA CCAGTATCGT CGCACCCCCT GGCCGGAACG GACCCCGGTA ATAGCCCGCC TGTGGCGCTT CGTCCCTTAC GTAGCCGGGG ACCAGGACAG AGAACAACCA CTGCCCCTGG GGGGGCGCTA TGACCTGGCC CGGTACCGGG CTACTCTGGT AAACCAGCAC CGGGGCAGGA GCGAGGGGTA G
|
Protein sequence | MRILMLSWEY PPQSVGGLAR HVEDLAISLA ARHDVHVLTI GRPGEAFESR ENGLTVHRVE AYPVHPPDFL VWVLQLNARF MEEAMILMRR YGPFQIIHAH DWLVAFTGRA LKHAYHLPLI ATIHATEAGR NRGLHNDMQR YINSVEWWLT YEAWRVIVCS RHMRQEVQGL FQLPADKITI IPNGVYSKKF RAGTVDPEVR RRYAAPNEKI LFFVGRLVIE KGVQVLLEAM PRILSSCPEA KLVVAGRGPM EGQLQNRARE LGIGHKVCFA GYIDDRTRNQ LYRAARVAVF PSLYEPFGIV ALEAMAAGTP VVASETGGLA EIITHGVDGM RAYPGNANSL ADNILAVLQD DALVAKLSAN GRRLVAEVYD WENIARRTAD VYQEVYNQYR RTPWPERTPV IARLWRFVPY VAGDQDREQP LPLGGRYDLA RYRATLVNQH RGRSEG
|
| |