Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1813 |
Symbol | |
ID | 3830731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1872523 |
End bp | 1873680 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829740 |
Product | peptidase M24 |
Protein accession | YP_430656 |
Protein GI | 83590647 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000026363 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGC TGGAATACAT AGAATATAAG AATCGCCTGA GGCGTTTCCA GGAGTCCCTG CAAGCTCTGG ACCTGGACGG GGCCCTGGTC TACCAGGCCG CCGACCTGTA CTACCTGACA GGAACGGCCC AGAGCTGCCA CCTTTTCGTT CCGGCTGCCG GGGAGCCCCT CCTCCTGGCC TACCGGGATT TTGAGCGGGC ACGGGAGGAA TCCGCCTGGC AGGTCAGGCC CCTGGGTAGT TTTAAGGATA TCCCAGGTCT CCTGGCGGAG GCCGGGTTGA CCGGGCTGCG GCGCCTGGGA CTGGAGCTGG ACGTCATTCC CTTAAGCCTT TTCCGGCGCT ACGAGGCCCT CTTGCCCGGC GTTCAGTGGG CCGACATCGG CCAGGTCCTG CGACGGCAAC GAATGGTCAA ATCGCCGGCC GAACTGGAGG CCCTGCGGTG GTCCGCCGCC AAACACGCAG AGGTCTTCCG TTACATAACT GCCAGGATCC GGCCTGGTAT GACAGAGCTG GAGATTGCCG CCGAGTTTGA AAGCTATGCC CGCCGCCTGG GCCACCAGGG CGCCAAGCGC TTCCGGGGTC AGGAGCAGGG CATGATTCCG GGCCTGGTTG CCGCCGGGGC CAACTCCGCC CGGACCTCCT GCTTCAACCT GCCTCTGGCC GGCCTGGGCC TCTCACCCCT TTACCCTATG GGAGCCAGCC AGCACGTCTG GGAGGAAGGT GAACCCCTCC TTATCGACTA CGCCGGGGTT TACGGCGACT ACACCGTCGA CCAGACCAGG ATCTACCTTG GTAAGGGGGT ACCGGAAGAC TTACGGCAGG CCCAGGAAGT GGCTATGGAG ATTGCCAGCC GGGTGGCGGA AGAGGCCCGG CCCGGAGTAA CGGCCGGCGC CCTCTACGAC CTGGCCGTGG CCATGGCGGC CCGCGCTGGT TTGCAGGAGC ACTTTATGGG CTACGGCCGG CAGGTGACTT ACATCGGTCA CGGCGTCGGC CTGGACCTGA ACGAGTGGCC GGTGATAGCC AGGGGGGACA AGACTGTCCT GGCCGCGGGC ATGGTCTTCG CCCTGGAGCC CAAGTTTGTC TTCCCGGGGA TGGGCAGCGC CGGGGTGGAG GATACGTATG TGGTTACTGA TAGGGGAGCG GAAAAGTTGA CATATTAG
|
Protein sequence | MAKLEYIEYK NRLRRFQESL QALDLDGALV YQAADLYYLT GTAQSCHLFV PAAGEPLLLA YRDFERAREE SAWQVRPLGS FKDIPGLLAE AGLTGLRRLG LELDVIPLSL FRRYEALLPG VQWADIGQVL RRQRMVKSPA ELEALRWSAA KHAEVFRYIT ARIRPGMTEL EIAAEFESYA RRLGHQGAKR FRGQEQGMIP GLVAAGANSA RTSCFNLPLA GLGLSPLYPM GASQHVWEEG EPLLIDYAGV YGDYTVDQTR IYLGKGVPED LRQAQEVAME IASRVAEEAR PGVTAGALYD LAVAMAARAG LQEHFMGYGR QVTYIGHGVG LDLNEWPVIA RGDKTVLAAG MVFALEPKFV FPGMGSAGVE DTYVVTDRGA EKLTY
|
| |