Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0957 |
Symbol | |
ID | 3832842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 989346 |
End bp | 990344 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637828886 |
Product | hypothetical protein |
Protein accession | YP_429815 |
Protein GI | 83589806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0898433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCAC AGGGAAAATC CATAAGGCAA TGGCGGGTAG GTTCCTTTTC CACGGCAGTG GCGCTGATTT TGTTTGGCGT GGCTACCCTG GTGTACCGGC ACGATCCGGG CTATGTTCCC CAATTAATAA ATAACTGGTG GCCGGTGGTG CTCATATTGC TGGGTTTGGA AATTTTTCTG GCGGGTTATT TATGCCGCGA TGATGTACCC CGGCTAAAAT ATGATTTTTT GGCCTTAATT CTTGTCCTCG TGATAGTAAT TATCAGTCTG GTAATTTATG TTTTCAACAG CAGCGGTTTG GCTGCCAGGT TGACCCAGGC CCTTGGCGCG GCAAGCTATA CGGTTGATCT GCCGGAAAAA AGGCTGGCTG TGGCGGAAGG AGTGAAGAAA ATTGCAGTCC AGGGGCCGGG ATGCGAGTGG CATAACCTGC AGCTACGTTC GGGTAGTTCT GACGATCAAA TAGTATTTTT TGGCCAGGCG ATGGTTACAG CTACATCTGA AGCGGAAGCC AGGGAGATGG CTCGCAATGC TGGCTTATTT TCCCACCAGG TTGGCGATAC CCTCTTTTTG GAATTGAGGG AGGTACCTTT ATACCAGGGA TTTGGGCTTT CTGCCAGGAT ATCAGGCAGC ACCTTGATTC TCCCTGCTGG TTTGGCAGTA GAGGTAGCTG GCGATAAGAA TGGAGCAGGA CTGGAGATGG TTTTGGATTA CCTGGAAGCA GACTGGTCTA TTACTAACTC GGGGCCTATC CGGGTGACGT TAAATCGGGG TATAAACGTT CAAATTGAAG CTTGTTCCCT GGCTGCCGAA ATGTTTCAAG GAAATATTAA CTGGAAGATT CAAAACAGGT TAACTTCTAC AGCCTCCTCG GATCAGAAAT TACCTGTGCC TACGTACAAT CAATCGGAAC CAGAAGTGAT GGCCCGGGCT ACTGTTGGCG AAGGCGGGCC GCTTTTGAAA CTGTTTTCCC GGGATGCGAT TATGATAAAT ACCCGGTAA
|
Protein sequence | MQSQGKSIRQ WRVGSFSTAV ALILFGVATL VYRHDPGYVP QLINNWWPVV LILLGLEIFL AGYLCRDDVP RLKYDFLALI LVLVIVIISL VIYVFNSSGL AARLTQALGA ASYTVDLPEK RLAVAEGVKK IAVQGPGCEW HNLQLRSGSS DDQIVFFGQA MVTATSEAEA REMARNAGLF SHQVGDTLFL ELREVPLYQG FGLSARISGS TLILPAGLAV EVAGDKNGAG LEMVLDYLEA DWSITNSGPI RVTLNRGINV QIEACSLAAE MFQGNINWKI QNRLTSTASS DQKLPVPTYN QSEPEVMARA TVGEGGPLLK LFSRDAIMIN TR
|
| |