Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0065 |
Symbol | |
ID | 3830815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 66390 |
End bp | 67670 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637827997 |
Product | hypothetical protein |
Protein accession | YP_428947 |
Protein GI | 83588938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00237194 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.154252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGGC GTTTGCTTGC CTGCTTTTTA ATTCTACTTT CTTTAAGCCT CGCCGGGTGC CGGGCGGAAA AACAGCCGGT TACTGCGACA GCCAGTTTTA AGGAGTTCAA AAGCAACATT GTAGGGCTTG AATTTTATGT AAACCAACCC TCAGGCGACC CCAAGCAGGT CATCAACCTT AACGACAAGC AACTTGCCCG GCGGTTCCTA GATTTTTTGG GCCAGTTACC AGTTACCAAC CCGCCGCCGG ACTCCTGGAC GGGGGCTCGC GACTTCCTGG CCTTTAAATT TACCCATAAT GGGGAGATTC TGGCCAGCAA GCAGTACCCC TATTACCACC AGGACAACGG CCCCGGTTAC CTTGAGCTGG AAGACGGCTG GCACCAGGTG CCGGCTGCTT TCAATTCCCG GCTGGCCACC CTGGCCCAGT ACCCTGAGGC CACCAGCAAT GTGGACCCGG CCGATGTGGC CTTTCTCAAG CAATACGGCT GGACCATCTT TTATAAAATC AAGACTTACA GCGGCCGCCT GCCGGATAAG CTCCTCCACG AATCAGGAGA ATACCCCGTA GCCCTTTACT ATGCCTACAA CAACGAACTT AGCAAGGACG TCGGCCTGGA CCTGACCCCC TACCTGGGAA AGAATGTCCA GATTAACCTT TACAAACTGG AAGAGCCCCT GCCGGCTTTT ATGAAACCGC GCCAGGACGC CGGCCGGGCC GTAATAGTCA GGGACGGCGA TAAAATTGTC GGTGCCTGGC TGGACGCCGG TGGTCCCGAT GCTTTTGCCT GTTCCTTAAA GGGCCGGCGG TTGGAAGAAA TCACCGGAAA ATCCTGGGGT GAATGGGTCG ATCAGTATAT CGATCACCGC AACGAGCAGG AGAAACTCAT CAGCCAGATG AACCCGGAAA AGGTTATTGA GACTTACTAT GAAGCCATCG ACCATAAAGA CCCGCGGACG GCCCATGCCT GTGAAACCCG CCGCCGCCTG GTCACTTACC TGTTCCGCAA CATGGACAAT AACCGCCTCT ATAATTACTC CTACGCCACC AATGATGCCG ATGAGATCAA TAATATCACC CGGGCCAGGG TCATCCGCAT CCAGCCTTAC CAGGGTCCCG GGCCGGAGCA ACCAGATGTA AAGAAGTATA TGGTCGAGGT TGATATAAAT GTTCGGCGGG TTATTACTTA CGACAGCGGT CGCCAGGTAC GCTTTTTCAC CCTGCGCCGG GAAACACCCA CTACCGGCTG GCGAATCGAC GATATCAGTA CCGGGCCATA A
|
Protein sequence | MFRRLLACFL ILLSLSLAGC RAEKQPVTAT ASFKEFKSNI VGLEFYVNQP SGDPKQVINL NDKQLARRFL DFLGQLPVTN PPPDSWTGAR DFLAFKFTHN GEILASKQYP YYHQDNGPGY LELEDGWHQV PAAFNSRLAT LAQYPEATSN VDPADVAFLK QYGWTIFYKI KTYSGRLPDK LLHESGEYPV ALYYAYNNEL SKDVGLDLTP YLGKNVQINL YKLEEPLPAF MKPRQDAGRA VIVRDGDKIV GAWLDAGGPD AFACSLKGRR LEEITGKSWG EWVDQYIDHR NEQEKLISQM NPEKVIETYY EAIDHKDPRT AHACETRRRL VTYLFRNMDN NRLYNYSYAT NDADEINNIT RARVIRIQPY QGPGPEQPDV KKYMVEVDIN VRRVITYDSG RQVRFFTLRR ETPTTGWRID DISTGP
|
| |