Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0123 |
Symbol | |
ID | 3830780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 120040 |
End bp | 121470 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828057 |
Product | putative aminopeptidase 1 |
Protein accession | YP_429005 |
Protein GI | 83588996 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1362] Aspartyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.870895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAA GAGAAGAAAG CAGGGGTAAG AAACTACAAA AAGAACTGGG TCTGCCCCGT AAGAGCGCCT GGGAGCGCCT GGAGCCAGGT ACCAGGGAAA AGGTCTTTGC CTTTGCCGAG GGGTATAAGG GTTTCTTAAG CCGGGCCAAG ACCGAGCGAG AGGCCATTCG GGCCGCTCTG GAGGCTGCCC GGGGTCGCGG CTTTATCTCC CTGAACGATA TTCCCTCCGG CCGGGGGTTG GATCCCGGCA GCCGCTTCTT TTTAACCTGG CGAGATAAGG TCATGCTCCT GGGGATAGCC GGTACTGCTC CCCTGGAAGA AGGGATCAGG CTGGTCGGGG CCCATGTCGA CGCCCCGCGC CTGGACCTGA AACCCATGCC CCTGTATGAA AAGGACGGTC TGGCCATGTT TAAAACCCAC TACTACGGGG GGATCAAGAA GTACCAGTGG ACCGCCCTGC CCCTGGCCCT CCATGGCGTG ATCATGCTCC GCGACGGTCG CCGGGTGGAA GTGGTAATCG GCGAAGACGC CGGCGATCCC ATCTTTAGCG TCAGCGACCT CCTGCCCCAC CTGGCCAAGG AGCAGATGAA AAAGAATATG GAAGAAGCCT TAAGCGGCGA CGACCTGAAC CTGGTGGTTG GCAGCTTGCC CTTTGAGGGT GAAGAAGAGC TCAAGGATAA GATCCGCCTG GCTATCCTCC AGCTCCTGAA CCAGCGTTAC GGCCTGGTGG AAGAAGACTT CATCACCGCC GAACTGGAAC TGGTACCGGC CGGTCCGGCG CGGGACCTGG GTCTGGATCG TTCCCTGGTA GGGGCCTACG GCCAGGACGA CCGGGTGTGC GCTTATACCG CCCTGCAGGC CGTCCTGGAA CTGGAGAACC CCGGCCATAC AGCTCTGGTA CTCCTGGTCG ATAAAGAAGA GATCGGCAGT ACCGGCAACA CCGGCGCCCA CTCCCGCTTC CTGGAGTATG CCCTGACGGA GCTGGCGGCC CGTATGGGTT CGGCCAGTAT CGTCAATGTC GGCCGGATAA TGGCCAATTC CCAGGCTATA TCCGCCGACG TAACCGCCGG GGTGGATCCT ACCTACGAAA ATTACTTTGA TATGTATAAC GCCTCTTTCC TGGGATACGG CGTAGTCCTC AATAAATACT CAGGCTCCCG GGGTAAATAC GATGCCAATG ATGCTAGCGC CGAGTTCATG GGCCGGTTAA GGGATATCTT TAACCACAAC CGCGTCATCT GGCAGAGTGG CGAACTGGGT AAAGTAGACG CTGGCGGCGG GGGCACCATT GCCAAGTTCC TGGCCTATTT CGGCCTGGAT GTCGCCGATT GCGGTCCGGC CCTCCTCTCC ATGCACGCCC CCCTGGAGAT CGCCAGCAAA GTAGATATCT ATATGGCTTA CCGGGCCTAT GGCGCCTTCC TTGCCAGTTA A
|
Protein sequence | MAEREESRGK KLQKELGLPR KSAWERLEPG TREKVFAFAE GYKGFLSRAK TEREAIRAAL EAARGRGFIS LNDIPSGRGL DPGSRFFLTW RDKVMLLGIA GTAPLEEGIR LVGAHVDAPR LDLKPMPLYE KDGLAMFKTH YYGGIKKYQW TALPLALHGV IMLRDGRRVE VVIGEDAGDP IFSVSDLLPH LAKEQMKKNM EEALSGDDLN LVVGSLPFEG EEELKDKIRL AILQLLNQRY GLVEEDFITA ELELVPAGPA RDLGLDRSLV GAYGQDDRVC AYTALQAVLE LENPGHTALV LLVDKEEIGS TGNTGAHSRF LEYALTELAA RMGSASIVNV GRIMANSQAI SADVTAGVDP TYENYFDMYN ASFLGYGVVL NKYSGSRGKY DANDASAEFM GRLRDIFNHN RVIWQSGELG KVDAGGGGTI AKFLAYFGLD VADCGPALLS MHAPLEIASK VDIYMAYRAY GAFLAS
|
| |