Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1010 |
Symbol | |
ID | 3833313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1037862 |
End bp | 1039067 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828938 |
Product | acyltransferase 3 |
Protein accession | YP_429867 |
Protein GI | 83589858 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3936] Protein involved in polysaccharide intercellular adhesin (PIA) synthesis/biofilm formation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.114811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000776761 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGAAAC CCCATATTGG CGAGGTCGAT TACCTGCGAG TTTTCGGCCT GATAACCATT GTTTTGATCC ATGCCTGGGG ATTTTACCTG TTGATGCCGG TGGCCAGTCC TTACAGCCGT ATCGCCCAGG AATTGGGCGT TAACCTGCTT CGCTTTGGCC GCCAGATTTT CATGTTCATT ACCGGCCTGG TTCTTTTTTA TAACTACAGC GGTCGTAAAC TGAATCTCAA TCGCTTCTTC AGCCGCCGGC TAAAAAACCT GGCCATCCCC TATGTTATCT GGACAGCTTT CTACCTGATC CTCAAACGTT CTTCCGGAAT GCTCAACTGG ACCGGCTTTG GCGGTTTTCT AACTTTATGG TGGCAAAATG TCCTGAACGG CAATGGTTAC AGCCATCTCT ATTACATCCT GGTAGCTATT CAATTTTATT TGTTTTTCCC TTGGCTCGTC GCCCTTTTTA AGCCCCGCCG CCCGGGAATG ACGGCGGAGA TTATTATCGG GTTGGGCCTG GCCCTGTATG CCCTGTACTT TTACCTCTTT GAGGTCCGGC AGGACTTGGT AGGGGCGGCG GTTGCCGGTA CCCCCCTGGC TGGTATTACC GGCTGGTTGT TCCTCTATAA AGATCGCCTG CTAGTCTCCT ATTTTCCTTA CTACCTCCTG GGAGCCCTGG CCGGCCTGCA CCTGGATTCA TGGCGGCGAT GGCTGCAGGA CCACCTGGAA GTAGCTGTGG CCTTTCTAGT AATTGCGGCC AGCCTGGTTA TCAGTGAGTA TTTTTATTAC TACCGGCGCC AGGGCCAGCC CTGGGCTCTG ACCATCAGCG TTTTTAAACC CAGTATTTAC CTGTACAGCC TGGCAATCAT CACCGTTTTT TTTCAATTGT CTTTTTACCT TGAGCGCCGG GGTTTCCTGC GCTGGCTGGT TTCCCCCCTG GCAGCCAATT CTCTGGGAAT TTACTTGCTT CACCCAGCGA TATTATTTAT TTTTAACAGC TATTTCTGGA ATTACGTGCA CCTGCCCGGT TTCTTGCTGG CTATCCTGGA ACCTGCGGCA GCAATCGTTA TCTCGGGAGC CATCAGCACC CTGCTGGGCA GTAACCGCTA CACCCGCTTT ATTGTTGGTG AAGCCGGGAA TCTAAGGAAT AGTTTCCCGT GGGGTAAGTG GCGGGAGCAC AGGGCCGTTC TCCACGGTGT AGGCCGGGAA AGCTGA
|
Protein sequence | MGKPHIGEVD YLRVFGLITI VLIHAWGFYL LMPVASPYSR IAQELGVNLL RFGRQIFMFI TGLVLFYNYS GRKLNLNRFF SRRLKNLAIP YVIWTAFYLI LKRSSGMLNW TGFGGFLTLW WQNVLNGNGY SHLYYILVAI QFYLFFPWLV ALFKPRRPGM TAEIIIGLGL ALYALYFYLF EVRQDLVGAA VAGTPLAGIT GWLFLYKDRL LVSYFPYYLL GALAGLHLDS WRRWLQDHLE VAVAFLVIAA SLVISEYFYY YRRQGQPWAL TISVFKPSIY LYSLAIITVF FQLSFYLERR GFLRWLVSPL AANSLGIYLL HPAILFIFNS YFWNYVHLPG FLLAILEPAA AIVISGAIST LLGSNRYTRF IVGEAGNLRN SFPWGKWREH RAVLHGVGRE S
|
| |