Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0714 |
Symbol | |
ID | 3830990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 745569 |
End bp | 746468 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828645 |
Product | heat shock protein HtpX |
Protein accession | YP_429575 |
Protein GI | 83589566 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000323195 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0549384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGGC AACTCGGTAG TGATGCCGGT TTAACGGCAC GAATGTTCCT GACCATGTTC CTCCTGGCAG CCCTGTATCT CTTTTTCCTG GCCGTTTTAT GGCAGGCCGG CGTCAGCTAT ACGGGGATCA TTGTCTTTGT AGCTATTATG CTGGGGGTTC AGTACTACTT TTCCGACCGG ATGGTCCTCT GGTCCATGGG TGCTAAGGAA GTATCGCCCC GGGAGGCTCC GGAACTCCAC GCTCTGGTGG AGAGGCTGGC CGCCCTGGCG GATTTGCCCA AGCCAAGGGT AGCCATTGTC CCCACCCCTA TGCCCAACGC CTTTGCCACC GGCCGCAACC CGGCCAACGC CGTAGTGGCC GTGACAACGG GGCTCATGGA GCGTCTAACC CCATCTGAGC TGGAGGCAGT CCTTGGTCAC GAGCTGACCC ACGTTAAAAA CCGGGACATG ACCGTGCTGA CCCTGGCCAG CTTTTTCGCC ACCGTAGCTT CCTTCATCGT CCAGAACTTC TTCTACTGGG GCGGAGCCTT TGGGGGCGGT CGCGATCGAG ACGAGCGTAA CAATATAATG CTGGTTTACC TGGCGTCCCT GGTAGTATGG CTGGTAAGCT ATTTCCTGAT CCGTGCCCTG TCCCGTTACC GGGAGTTTGC AGCCGACCGG GGTTCGGCCA TTCTTACCGG GTCACCGGGA CAATTGGCCT CCGCCCTGGT GAAAATTAGC GGCAGCATGG CTCGCATTCC CACCCGGGAC CTGCGCCAGG CCGAGGCTTT TAACGCTTTC TTTATCATCC CGGCCCTGAA CGGCAACAGC ATCATGGAAC TCTTTTCCAC GCATCCGTCC CTGGAGCGGC GCCTGGCCTA CCTGCGGCGG CTGGAGCAAG AAATGGAGGA ACGGCGGTGA
|
Protein sequence | MRRQLGSDAG LTARMFLTMF LLAALYLFFL AVLWQAGVSY TGIIVFVAIM LGVQYYFSDR MVLWSMGAKE VSPREAPELH ALVERLAALA DLPKPRVAIV PTPMPNAFAT GRNPANAVVA VTTGLMERLT PSELEAVLGH ELTHVKNRDM TVLTLASFFA TVASFIVQNF FYWGGAFGGG RDRDERNNIM LVYLASLVVW LVSYFLIRAL SRYREFAADR GSAILTGSPG QLASALVKIS GSMARIPTRD LRQAEAFNAF FIIPALNGNS IMELFSTHPS LERRLAYLRR LEQEMEERR
|
| |