Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0136 |
Symbol | |
ID | 3830793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 130880 |
End bp | 131968 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828070 |
Product | hypothetical protein |
Protein accession | YP_429018 |
Protein GI | 83589009 |
COG category | [S] Function unknown |
COG ID | [COG1415] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000501513 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.643444 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACCG GTACCGCCAG CCTGCCCCTC CACGGCGGCC ATTGCCCGCC CTGGCTCTTC GAGCGCATGC AGCGCCTGGG GCCGGCCATC CTGGAGGTAA TTGTGCAGGA ATACGGCCCC CAGGAGGTTT TAAGACGCTT AAGCGATCCC CACTGGTTCC AGGCCTTCGG CTGTGTCCTG GGTTTTGACT GGCACTCCTC GGGCCTGACC ACCACCCTTT GCGGCGCCCT CAAAGAAGGT TTGCGCGGCC GGGAAAAGGA TCTGGGCTTG GTCATAGCCG GCGGCAAGGG CCGTACCTCC CGCCAGACGC CCCATGAGAT CGAAACGGCG GTCGACAGAT TGGCCCTGAC TTCCCTCGAG CCTGAAGATC TGGTTTATGC CAGCCGCATG GCGGCCAAGG TCGATAACAC CGCCCTCCAG GACGGCTACC AGCTCTACCA CCACGTCTTT ATCTTCACCT TTGACGGCCA GTGGGCCGTC GTCCAGCAGG GGATGAATGA AACCAGCCGC CTGGCCCGGC GCTACCACTG GCTGGGGGAA GGGATGCAGG ACTTCGCCTG CGAGCCCCAC GCCGCCGTCT GCTGTGACGC CAGGGAAACG GCCCTGAACA TGGTAGCCAG GGAAAGCGAG GCTTCCCGCC AGGTGGTAAC CGAACTGGTA CGCCAGCAAC CGGCGAAGGT GGTAGCCGAG TTTAGCCGCA TCCTGGAAAA GGACCTCCCC AACCTGGCCC TGCCCTGGCG CCACGACGTG CCCCGGGCGG GTTACCTGAA TAAAGCCCTG TTAAAGGTTT ACGACGTCCA GCCCCGGGAC TTCGCCGGTG TCCTGGGGAT CGAAGGAGTG GGTCCCAAGA CCATCCGCGC CCTGGCCATG GTGGCCGAAG TGGCCTATGG CGCGCCGGCC AGCTTCCGGG ACCCCGTTCG CTACAGTTTT AGCCACGGCG GCAAGGACGG CCATCCCTAC CCCGTCGACC GCCAGGTATA CGACCGCACC ATTAACGTCC TGGAACAGGC CCTGGCGGCC GCTAAAATCG GTCGGACCGA TAAAATACAG GCTTTAAAAA GGCTGAGCAG ATTGGCTAAT GGAAGTTAA
|
Protein sequence | MRTGTASLPL HGGHCPPWLF ERMQRLGPAI LEVIVQEYGP QEVLRRLSDP HWFQAFGCVL GFDWHSSGLT TTLCGALKEG LRGREKDLGL VIAGGKGRTS RQTPHEIETA VDRLALTSLE PEDLVYASRM AAKVDNTALQ DGYQLYHHVF IFTFDGQWAV VQQGMNETSR LARRYHWLGE GMQDFACEPH AAVCCDARET ALNMVARESE ASRQVVTELV RQQPAKVVAE FSRILEKDLP NLALPWRHDV PRAGYLNKAL LKVYDVQPRD FAGVLGIEGV GPKTIRALAM VAEVAYGAPA SFRDPVRYSF SHGGKDGHPY PVDRQVYDRT INVLEQALAA AKIGRTDKIQ ALKRLSRLAN GS
|
| |