Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0381 |
Symbol | |
ID | 3832625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 385892 |
End bp | 387322 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637828318 |
Product | carboxylyase-like protein |
Protein accession | YP_429258 |
Protein GI | 83589249 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTATG CTGATCTGCG TCAATTTCTA GAGCGCTTGG AAGCTGAAGG ACAACTCGTC AGGGTAAAAG AACCGGTTCA GGCTGAACCA GACTTAAGCT CTATTGGAAG GGCAGCAGCT AATTTAGAAC AGGCACCAGC AGTACTAGTG GAAAACATTA AAGGTTATTA CCAGAAAGTT GCTTTGAACG TGCATGGTTC CTGGGCCAAC CATGCCCTCA TGTTCGAGCT ACCCAAAGAT ATGCCGGTAA AAGAACAGTT TATGGAAATT AGCCGCCGAT GGGATAATTT TCCTGTTCCT GTAAGACGGG TGTCCAGCTC CCCGGTTAAG GAGATAAAGC ATCACGAACA GATAAACCTG TTTGAACTTA TGCCTCTATT CAGGGTAAAC AAGTTCGATG GGGGCTTCTA CCTCTCCAAA GCGGCAGTGG TTAGCCGCGA TCCCGAGGAT CAAGATAACT TTCACAAGCA GAATGTTGGC ATTTACCGTA TCCAGGTAAA AGGTAAAGAT CTTCTCGGAA TTCAAGTACT CCCCTTCCAC GATATTGCTA TCCACTTACG CAAGGCAGAG GAGCGCAATC AGCCATTGCC AGTTGCTATT GCTGTAGGCA ATGATCCGGT ACTGACTTTC GTTGCCAGCA CGCCAATGGC CTATGAAGAG TCAGAATACG AAATGGCCGG GGCCTTGCGC GGTGAACCAT TTGAAGTTAT TAAGGCAGAA ACGTCTAACC TTGATGTCCC GGCAGGAGCA GAACTTATCC TGGAAGGCGA AATTCTTCCC CGCCGTCGTA GTCCGGAAGG ACCGTTTGGC GAATTTCCTG GAAGCTACTC GGGCGTTAGA ATGCAAGCGG AAGTAAAGAT TCATACGGTT ACCCACCGGG TGGACCCGAT TTTCGAGAAC CTGTATCTTG GTGTACCTTG GACGGAAATA GATTATCTTC AGGCACTGAA CACCAGCATT CCTCTATATA AACAAATTAA AGCCTCAATG CCTGAGGTAG TAGCAGTAAA TGCTATGTAT ACGCACGGTA TAGGGGTGAT TATTTCTACC AACTGCCGTT TTGGCGGGTA CGGTAAAGCA GTAGCAATGC GGCTACTGTC CACCCCCCAC GGCATGCCCT ATTCCAAAAT TATCATTGTG GTAGATGACT TTGTCGATCC CTTTAATTTA GAGCAGGTCA TGTGGGCATT GACAACCAAG GTGCGCCCTG ATAAAGACGT AATACTGATT CATAATGCTC CTGGTATGCC GCTGGATCCT TCATCCGATC CACCAGGAAT GCATACAAAA TTGATTATTG ATGCTACTAC TCCGGTACCG CCAGACGTTG TGAGCAGGGA GGTGGAGTTA GTAGATACCC CTGCTAAGAC GGCTTTGTGG GAAAAGATAC TTAAAGAACT CCATCGAAAT AAAAAAGGAG GGAGCATGTA A
|
Protein sequence | MPYADLRQFL ERLEAEGQLV RVKEPVQAEP DLSSIGRAAA NLEQAPAVLV ENIKGYYQKV ALNVHGSWAN HALMFELPKD MPVKEQFMEI SRRWDNFPVP VRRVSSSPVK EIKHHEQINL FELMPLFRVN KFDGGFYLSK AAVVSRDPED QDNFHKQNVG IYRIQVKGKD LLGIQVLPFH DIAIHLRKAE ERNQPLPVAI AVGNDPVLTF VASTPMAYEE SEYEMAGALR GEPFEVIKAE TSNLDVPAGA ELILEGEILP RRRSPEGPFG EFPGSYSGVR MQAEVKIHTV THRVDPIFEN LYLGVPWTEI DYLQALNTSI PLYKQIKASM PEVVAVNAMY THGIGVIIST NCRFGGYGKA VAMRLLSTPH GMPYSKIIIV VDDFVDPFNL EQVMWALTTK VRPDKDVILI HNAPGMPLDP SSDPPGMHTK LIIDATTPVP PDVVSREVEL VDTPAKTALW EKILKELHRN KKGGSM
|
| |