Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0063 |
Symbol | |
ID | 3830813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 61613 |
End bp | 62551 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637827995 |
Product | glycoside hydrolase family protein |
Protein accession | YP_428945 |
Protein GI | 83588936 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00787456 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCCCTGC AGGAACGACA ACTGGAAAAC AATTGGTACT ATGTCCCCGA CCCGTCGGGG CTGGCCTCCT TACGCAGCTA CGGTGGCCTG CTACGATATG TCCTGCCCTT CTGGTTTGGC GTAACTGAGA ACGGTGGCCT GGTGGACCAG GCCGACACCG GGGGACTGGC AGCCATCCGC CGCTACAATC TGCCGGTGCT GGCCATTGTC CACAATTACT CCAGCCCCCA GTACGGTCCC CTCATCCATC GCCTGCTGAC GACAGAAGGG TTGCGCCAGG CCCTGGTCCA GAATATCCTG AACCTGATGT ATCGCTGGGA CTTTGCCGGG GTGAATATTG ATTTTGAGTT CGTCCCGCCG GAGGATCGTC CCTACCTGAC CAGTTTTTTG AACCAGCTGG GGCAGACCCT GAAAGGGGCC GGCTTTTTAA CAACTATTTC CGTACCGGCG GAACTCCGGG ACAATCCCCG CCACCCCTTC TCGGGGGCTT TCAGCTATCC GAATCTGGCC GCCGCCAGCG ACCAGGTTTA TATCCTGGCT TACGATGAAC ATTTCGCCAC CCCGGGTCCC ATAGCCTCTA CCAGTTTTAT CCGCCAGGTC CTGGATTATG CCGTGACGGT AATTCCCCGG GAAAAGATTC GCCTGGGCAT GGCTGTCTAC GGTTATGACT GGGCCGAAGG GGCCAGGGTG CCGGTGACCC TATCCCACAG CCAGGTCCTT GACCTGGCCC GTCGTGTCGG GGCCAGCATC TACTATGACC CCAATGCCCA GGAGTCGACC TTTACTTATG TTGAGGATAA TACGCCCCAT GTGGTGTGGT TTGAAGACGT GCGCAGCTTT AGCGTTCGCC TGGGGCTGGT CAGGGAGTAC AACCTGCCTG GTATTGCTGT CTGGCGCCTG GGCCTGGAAG ACCAGCGCAT CTGGGAACTC CGGGGGTAG
|
Protein sequence | MALQERQLEN NWYYVPDPSG LASLRSYGGL LRYVLPFWFG VTENGGLVDQ ADTGGLAAIR RYNLPVLAIV HNYSSPQYGP LIHRLLTTEG LRQALVQNIL NLMYRWDFAG VNIDFEFVPP EDRPYLTSFL NQLGQTLKGA GFLTTISVPA ELRDNPRHPF SGAFSYPNLA AASDQVYILA YDEHFATPGP IASTSFIRQV LDYAVTVIPR EKIRLGMAVY GYDWAEGARV PVTLSHSQVL DLARRVGASI YYDPNAQEST FTYVEDNTPH VVWFEDVRSF SVRLGLVREY NLPGIAVWRL GLEDQRIWEL RG
|
| |