Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2088 |
Symbol | |
ID | 3831838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2180515 |
End bp | 2181921 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830014 |
Product | hypothetical protein |
Protein accession | YP_430924 |
Protein GI | 83590915 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT GGTTTCAGGC GTTGATACTG GCTGTGGGCC TGATCCTGGC CCTGGCGCCG GCCGCCCTGG CCGGGACGGT CACCGGCGAC CAGCTCCGCC AGGTATTTCC CCAGGCTCCG GCGCCGGGGA AGACCATAAC CCGGGGTGAG TTTGCCGCCC TGCTGGCCCG GGCGGCGGGG ATGCAGGTGA AGGGTAACCA GGCTGATATC CAGGGGGATG CCTGGTACAG CCCGGCAGTA ATGGCTTTGA AGGAGAAGGG CGTCATCAGG GGCTACCCCG ACGGCGGCCT GCACACCGAC CAGCCGGTCA GCCTGCTGGA AGCGGCGGTG ATGGTTTCCC GGGTCCTGGG GTTGCCGGAC GGTGTCGCCG CGCCGGAGGT AAAGGGGTCC CTGGGGCGGG AGAGCTGGGG TTATACCCCC TATGCCTGGC TGGTACGGGC CGGCCTGCTG CAGCCCGGCC AGGACGCCGG GGGATTCCTT ACCGTGGACG AAGGCATTGC TTTCCTGGCC GGGGTTTTTG GCAGCGATCC GGAGGCGGAA AAGATTGCTC AGGCCGCCCA GCAGGCCCAG GCTAAGGTCA AAGACCTGAA ATTCGCCGGC AGCATGGCTA TAAGCGTGCG CCTGCGGCCG GGGGTGGCCG GGGAAGTGCC GGCAGTTTTT TCCATGCAGG GCAATATCAT GCAGGGCAAT ATCGAGAGCG AGTTCAGCTA TCCCCTGAGC CTGCACCAGA AGGTGGACAT GACCCTTCGC TTGCCGGTAG AGAAACTGCC CGGTAAAGAC CTGTCAACGG GCGGTAAGAT GCAGATGACC ATGGAACAGT ACCTGGTGGA CGGGACGATG TACCAGAAGG TAGAGGCTCC CGGTATGGAA AAACCCCAGT GGATGAAGCT GCCCAAAGGA GCCCTGCCGG ACCTGGAAGC CTTGGTGGAA CAGAGCAGGA ACTCGGCAGG GTTACCGCCG GGGCTAAAGG ACAGCTTCCA TTTCCAGTAC TTGGGTGAGG GTATAGAGAA CGGGCATAAG GTTCACCGTA TCGCCTACTA CGGCCGGATT GACGACTGGC AGGCCCTGAT AAAGGCCCTG CCCGGAGGGT TGACCACGGA GATGGAGCAG GCCCTGAACC AGGCCGGCGG CGTCTTGAAG TCCATTTCCT TCTGGGGTGT GGAAGCCATC GGCGTGGAGG ACAATCTTAC TTATGCCTCG GAAATGACCA GCCTGGTCGC TTTTGCGGAT AAATACCAGG AAGAAATTGT GCCCCTGGAA ACAATGACCA TCAACGTGAA GGTTACGGAT TTTCAGTATA ACAGTGGCGT AAAGATCCAG GTGCCTGCCG AGGCCCTGAC GGCACCGGAA GTACCCCTGA CACCCTCACA ACCGGATGCA AAATCATCCG GGAGCCAGCA GATGTAA
|
Protein sequence | MKKWFQALIL AVGLILALAP AALAGTVTGD QLRQVFPQAP APGKTITRGE FAALLARAAG MQVKGNQADI QGDAWYSPAV MALKEKGVIR GYPDGGLHTD QPVSLLEAAV MVSRVLGLPD GVAAPEVKGS LGRESWGYTP YAWLVRAGLL QPGQDAGGFL TVDEGIAFLA GVFGSDPEAE KIAQAAQQAQ AKVKDLKFAG SMAISVRLRP GVAGEVPAVF SMQGNIMQGN IESEFSYPLS LHQKVDMTLR LPVEKLPGKD LSTGGKMQMT MEQYLVDGTM YQKVEAPGME KPQWMKLPKG ALPDLEALVE QSRNSAGLPP GLKDSFHFQY LGEGIENGHK VHRIAYYGRI DDWQALIKAL PGGLTTEMEQ ALNQAGGVLK SISFWGVEAI GVEDNLTYAS EMTSLVAFAD KYQEEIVPLE TMTINVKVTD FQYNSGVKIQ VPAEALTAPE VPLTPSQPDA KSSGSQQM
|
| |