Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1746 |
Symbol | |
ID | 3832891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1798329 |
End bp | 1799450 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637829670 |
Product | hypothetical protein |
Protein accession | YP_430590 |
Protein GI | 83590581 |
COG category | [R] General function prediction only |
COG ID | [COG1672] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000207576 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.634791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAG GACTATTCCC GCTTGGCGGT CCTGTTGCTA AGGAAGACCT GGTCGGCCGG GAACAATTCA TCGTTTCTTT AGTCAATCGC CTTTCAGAAG GACAAAGTGT GATGTTGGCA GGACCTCGTC GCATTGGCAA GACATCTCTA GCCCACGAAG TCCTGCGTAG GATGAAAAAT AAGGGAGCTT ATACCGCCGC CGTGGACTTC TTCCGTTTTT CCGGCAAGCG GGATTTTGCT GCCAGTTTGA TTGACGCATG TCTGGAAAAC AGGACGGGAA TTAATAAAAC CTTGAACGTC TTACGGGATC GGGCTAAAGC CATGGCTGGT GGAGCGCAGT TTGCTATTAA ACTTAAAGAC CTGGAGATTT CCTTCGGGTT TCCCGATAAA AAATCGGACG ACGAACTCCT TGACTATGCC TTGAAGCTTC CGGGTATCCT GGCCGACAGG GATGAAAAGG TAATGGTGGT CATGTTTGAT GAGTTTCAGG AGGCCTCCAG GGTAACGGAC CCCGAAATCT TTAAACGGAT GCGTGCCCAT TTCCAGACCC AAAAAGGCGT GGCCTACCTC TTTCTAGGGT CGAAGGAAGG CATGATGCAG ACCCTTTTTG GTGGGAGAAA GGAAGCATTC TACCGCTTTG CCATCATGCT TCCCATTCCT ATTATAGCTG AAGACGATTG GATACCATAT ATTACTCAAA AATTTGCCTC CAGAAACATT CAAACCGATG CGGAGGTGGT AAAAGAGATT ATTCAATTTT CCGGTGGTCA CCCGCAAGAT ACAATGTTCC TTTGTTCGGA AGTTTATTAT ACCCTTTTGG AAACTGGTAA TAATGTCCTT ACGCGCGACT ACGTCCGGTT AGGATATAAC CGGGCAATGC TGGCCCTTGC CCCAATCTTT GATGAAATGC TGGATGATTT AAGCACTCGT CCTCAAGTCC GGAGGGTGCT ATACCAGCTT GCGGCAGGCG AAAATGTATA TCAAGAGGGT ATCCACCCAA ATAAAATCAA ACGGGCTGTC GACCATCTAA TCAGCAGGGC AGTTATTGAA AAAACGGCCC GTGGGAGCTA TGTTTTTGTT GAGCCAATGT TCCAGCAGTA TATTTTGCAG CAGTTTCAGT GA
|
Protein sequence | MTKGLFPLGG PVAKEDLVGR EQFIVSLVNR LSEGQSVMLA GPRRIGKTSL AHEVLRRMKN KGAYTAAVDF FRFSGKRDFA ASLIDACLEN RTGINKTLNV LRDRAKAMAG GAQFAIKLKD LEISFGFPDK KSDDELLDYA LKLPGILADR DEKVMVVMFD EFQEASRVTD PEIFKRMRAH FQTQKGVAYL FLGSKEGMMQ TLFGGRKEAF YRFAIMLPIP IIAEDDWIPY ITQKFASRNI QTDAEVVKEI IQFSGGHPQD TMFLCSEVYY TLLETGNNVL TRDYVRLGYN RAMLALAPIF DEMLDDLSTR PQVRRVLYQL AAGENVYQEG IHPNKIKRAV DHLISRAVIE KTARGSYVFV EPMFQQYILQ QFQ
|
| |