Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1118 |
Symbol | |
ID | 3833250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1146101 |
End bp | 1147096 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829046 |
Product | AAA ATPase |
Protein accession | YP_429975 |
Protein GI | 83589966 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | [TIGR02881] stage V sporulation protein K |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000000607625 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGGG TGAAATTTGG TCCCCAAAAT AATAAGAATA ATTCCTGGTC CCCGGTTCGT CCCCTCCAGA ACAACAGGGC TGGCGGGGAA CCGGAGCAGG CCGGTAGGCG GCGGGTAGAG GCGGTTATGG CCGAGCTGGA TAAGATGGTG GGATTGGAGG CAGTAAAGAA TTTGATTAAA GAGTTGCGGG CTTTTGTTGA GATCCAGCAA AGGCGCCGGG GTGAAGGTCT GGCTGCTTCT TCCACAGTAA TGCATATGAT TTTTAAGGGA AATCCCGGTA CCGGCAAGAG TACTGTTGCC AGGCTCATGG GCCGCCTCTT CAAAGAACTG GGTGTTTTAA CCCAGGGCCA GCTCATAGAA GTCGAGCGGG CTGACCTGGT GGGGGAGTAT ATTGGTCATA CGGCCCATAA AGCCCGCGAA CAGCTTAAGA AAGCCATAGG AGGTATTCTT TTTATCGACG AGGCTTATTC CCTGGCCCGG GGTGGTGAAA AGGACTTTGG TCGGGAGGCC ATCGACGTCC TGGTCAAGGG AATGGAGGAT TACCGGGATA ACCTGATTCT CATCCTGGCC GGGTACCGGG AAGAGATGGA GTACTTCCTG GAAATTAATC CCGGCCTGCG TTCCCGTTTC CCCATTCAAT TGGAATTTCC CGATTATACC GTTCCGGAAT TACTGGCCAT TGCCCGGGTG ATGCTGGCGG AAAGGCAGTA CGTCCTCGCA CCGGAGGCTG CCGCCGAGCT GGAGAAGATT CTCCGGCGGG AGGTTCTCTT TGGACACCGC TATAACGGCA ATGCCCGGAT GGTACGTAAT ATCATTGAAA GGGCCATCCG GCGCCAGGCC CTGCGCCTAG TTAATAAAAA TAGAAACCTC AGCCGCCGGG AATTGATGTA TATCGAAAAG GAGGACCTTT TACGGCCGGA TCTGCCTGCG GGGGTTAAAG ATGAGGGGAG TGAGCTTCCG GGAGGCCCAT CAGTCTGCTA TAATAACACC CGGTAA
|
Protein sequence | MLRVKFGPQN NKNNSWSPVR PLQNNRAGGE PEQAGRRRVE AVMAELDKMV GLEAVKNLIK ELRAFVEIQQ RRRGEGLAAS STVMHMIFKG NPGTGKSTVA RLMGRLFKEL GVLTQGQLIE VERADLVGEY IGHTAHKARE QLKKAIGGIL FIDEAYSLAR GGEKDFGREA IDVLVKGMED YRDNLILILA GYREEMEYFL EINPGLRSRF PIQLEFPDYT VPELLAIARV MLAERQYVLA PEAAAELEKI LRREVLFGHR YNGNARMVRN IIERAIRRQA LRLVNKNRNL SRRELMYIEK EDLLRPDLPA GVKDEGSELP GGPSVCYNNT R
|
| |