Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1842 |
Symbol | |
ID | 3831702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1900313 |
End bp | 1901323 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829773 |
Product | hypothetical protein |
Protein accession | YP_430685 |
Protein GI | 83590676 |
COG category | [S] Function unknown |
COG ID | [COG5660] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTGCA GTCAGTGCCG GGAACTCATA TCACCATACC TGGATGGAGT CTTGAGCGAA ACAATACAAC GGGCCCTGGA GAACCACCTT AACTCCTGCC CGGCCTGCCG GGAAGAACTG GAGGCTATGG GGCAGACAAT CGAGATTATC CGTGCCTGGT CCGAAGAAGA ACTCGACCTG CCACCCGGTT TTGAGGAACG CCTGCGCTCA CGCCTGGAGG AGTGCCGGCA GCCGTGGTAC CGACGCCTCT CCCGGAACTG GCTTTCCCTG GCGGCAGCGG CGGCCACTAT CATGGTGGTA GCAATTACGG CCCGGGCGGA TTACCTCCAC CTGGGTTCCT CCAGGCAAAT CGCTGTCCCC CATGAGAAAC AGGTGCAGGA ATTGGCCATG ACCCGGGGAG ACCAGCAGGT GACTCCCCTC AAGGCTCTAC CACCGGTTAC CTCAACGGAT GCCCCGCAGC AATCAGCACC GAAAGTAAAG GTAAAAGCAG CTACTACCTC GGTACGATCT GCAGTGAGGA ATCTGGAGAG TTCCCACCCC GATCCGGAAC AACAGCAAAG GAAAATAGTC CCTGGAGGGA CCTTCAACCT CAATTCCCGG GGCAGAGCAG AGCGAGCAGC TCCGGAGCAG CAGACCGGGG GACAATCAGG AAAGGGTCAG CCGGACCAGG ATAAAGATAA GGGAAAGGAG AAGGGGCCGG GTCAATCCCG TACTGTACTG GAAGCAGGGA AGAAAGAAGT TACCCCGAGG GCTGGCGAGG GGGTGGCAGG GGGAACGTCA ACCATTGCTG GCGATGGGCC GGGAACCGTC AAGACCCCGG CCGGTGATGG GAAAGAGGTC CCACCTTTAC CACCGGCCGG TGGGAAGGCA ACGCTCCAGG ACCTGACGCC AGGGGTTGGG CGGCAAAACT CGGCAGCGTC TCCGGATAGC GACCTGCAAA ACCGGACTCT TACCCAGCCA CCCCCCGCAC CTGTTGCCCC GGCAACTATC CCTAAACCGC CCTCGCCTTG A
|
Protein sequence | MNCSQCRELI SPYLDGVLSE TIQRALENHL NSCPACREEL EAMGQTIEII RAWSEEELDL PPGFEERLRS RLEECRQPWY RRLSRNWLSL AAAAATIMVV AITARADYLH LGSSRQIAVP HEKQVQELAM TRGDQQVTPL KALPPVTSTD APQQSAPKVK VKAATTSVRS AVRNLESSHP DPEQQQRKIV PGGTFNLNSR GRAERAAPEQ QTGGQSGKGQ PDQDKDKGKE KGPGQSRTVL EAGKKEVTPR AGEGVAGGTS TIAGDGPGTV KTPAGDGKEV PPLPPAGGKA TLQDLTPGVG RQNSAASPDS DLQNRTLTQP PPAPVAPATI PKPPSP
|
| |