Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0833 |
Symbol | |
ID | 3831530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 863966 |
End bp | 865033 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828763 |
Product | hypothetical protein |
Protein accession | YP_429693 |
Protein GI | 83589684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.637029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTAC CCGGCAACTT CCAGGCGGCA GGCATCGGCA GCCTGCCCTA CCTCGAGCCT GGACCGGCCC TGGACCTTAT CTTTAAAACC ACCCCTGTCA TCCCCCACTG GCCTCAACTA CCCAAACGGG GCCACCAGGA ACACTTTGTT TACCAGAGCC TGGCCCCCCT GGTGCGCCTT GGCTTGATTA AAGAAAATCC CGGTGGGATG CCAGCCTTTA CCGATGCTGA CGCCGGGTGG ACCGACAGGT TAACGGATTT CTATAGCCTA TACCTGGAGG CCGAGGCCGG CGATGGGGAG GCCCTGGCAG CCTTTGCCAT TCCGCGGGAA GCCGGGATTG GTTTTTATGC CCTGCTGGAA TACCTGGAGC AAAAGGGCCC CGGCGAAGCC CGCTTCTTAA AGGGGCAGGT TGCCGGACCT ATTACCGCCG GCCTATATTT GACCGATAGT GCTGGCAGGA GTTCCTTTTA CGACCCCCAG CTGCGGGACC TCATTGTCAA AACGACGGCC ATGCAGGCCT GCTGGCAGGC CCGTGAATTG GGTCGCTTTA ACCTTCCAGT CCTGGTGTTT GTCGACGACC CTGCCCTGGC GGCCTATGGT ACCTCCACCC ATGTAGCCCT AAAACGGGAT GACCTCCTGG CGGCCCTGGC GGGTGTCGTA GCCGGTATTG AAGCCGGGGG CGGACTGCCC GGGGCCCATT CCTGCAGCGG GGTGGAGTGG CCCGTCTTTT TTGAAGCAGG TTACCGGATC TTAAGTTTTG ACGCCTATAA TTATTTTACT TCCCTCCAGG TTTTCGCCTC TGATGTGGCT GCCTTCATAG CGCAGGGCGG GGTCCTGGCC TGGGGGATTG TGCCCACCTA TGAACAGGCC TGGCAGGAGA CTCCTGCCAC CCTGGCTGCG AAACTCCAGG AGCAGGTCGG AGAACTGGCC CGGCGGGGTG TGGACCGGGA GCGCCTCTGC CGCCAGGCCC TGGTCACCCC CTCCTGCGGC ACCGGCGTCC TGGAAGAAGA CCTGGCAGAA CATATCTACG GCTTGATGGC AGCTGTCGCC GAAATAATGG GCAGGTGA
|
Protein sequence | MFLPGNFQAA GIGSLPYLEP GPALDLIFKT TPVIPHWPQL PKRGHQEHFV YQSLAPLVRL GLIKENPGGM PAFTDADAGW TDRLTDFYSL YLEAEAGDGE ALAAFAIPRE AGIGFYALLE YLEQKGPGEA RFLKGQVAGP ITAGLYLTDS AGRSSFYDPQ LRDLIVKTTA MQACWQAREL GRFNLPVLVF VDDPALAAYG TSTHVALKRD DLLAALAGVV AGIEAGGGLP GAHSCSGVEW PVFFEAGYRI LSFDAYNYFT SLQVFASDVA AFIAQGGVLA WGIVPTYEQA WQETPATLAA KLQEQVGELA RRGVDRERLC RQALVTPSCG TGVLEEDLAE HIYGLMAAVA EIMGR
|
| |