Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1261 |
Symbol | |
ID | 3833056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1304850 |
End bp | 1306490 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637829197 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_430118 |
Protein GI | 83590109 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0532703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000308481 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGTTT CGGGTTATGG TAAATGGTTC GAGTTGGCTG AAAAGGATTT AAAAACACGC AAGTTCAATG GAGTTGAGTA CCGGTATTAC GACCACGGTA CGACTAATTT GTGGGAAGAT TTTTCCCGTT CTGTCAGCAG GCAACCTGAT AAAACGGCGC TACGTGCAGG AAATAGTTCT TTAAGTTATA GGGAAATGCA GGAAGCTTCA CGGCGACTGG CATCCGGCTT GTGGAATAAG TATCAGGTTA AAAAAGGTGA TGTGGTTGCC CTCTTGCTGG TGAATAGTAT CGACTTCTGC CTTAGCTTTT ATGCAGCAAT GTATCTGGGA GCCATAGCTT TACCCCTGAG TACCAAACTT AAAGCTACTG AACTTAATTT TATGCTCAAG GATTCGGGGG CTAGGATCTT AATAACTAAC CCGGAGTGGC TACCCAACGT CTTGCCTTTT ATCAAAGAAA CAAGTATTGA ACAAATAATT GTTACCGAAC CGATTACCGA TAAAATTAAT ATCAATTTTG GTAACGCTTC CATAATAACC TTAAAGAATG TTTTTCGCGA AACGGAAATT CCACCGGCGC CTGTCGACGA ACAAGATGGC GCGGTAATCA TGTATACCTC GGGAACTACC GGTAAACCCA AAGGCGCTTA TCTTACCCAT TTTAATCTCC TCCAAAGTGT TATCAGCTAT GAGCGCACCC TGCAGTTAAC GGCAGCAGAT AGTACCCTTA TTGCAGTTCC AATTTTCCAT ATAACAGGTT TAGCTGCTCT CTTTTTGCTT TTCATGCATA TTGGCGGTAC AGTATATCTG TTACCCTTTT TCAACACCCA AGAAGTCCTC AATATTTTAA CATGTTATTC TATTACTTTC TTCCATGCCG CTCCCACAGT CTATATCATG CTCCTCGAAC AAGGTTACAG GCATTATCAA TTACCTGATT TACGTAAGGC AGCCTGTGGC GGGGGGGCAA TCCCGATAGA AACGATAAAA AAAATTAAGA CATGGATACC CCAACTGGAG TTTCATACTG TTTACGGCCT AACGGAAACC AGTTCCCCGG CAACCTTATT CCCGGGTGAC GTAGCCACAA GTCCAAGGAT AGGCACTTCC GGGATACCAA TTCCAGTAGT CGATTGTAAA GTAATTGACG CTGAAGGGCG GGATATTACC GGTAAAGGGG TTGGCGAGCT TTGTATCCGG GGACCCGTTG TGACCCAACA ATACTGGAAT AATGATGAAG CTACCACCAG GGCTTTTCAA GGAGGGTGGT TCAGGACAGG GGATGTAGCC CGGATAGATG GGGATGGTTA TGTTTATATC ATGGATAGGT TAAAGGACAT GATTAATCGC GGCGGTGAAA AAATTTATTC CCTGGAAGTT GAAAATGTCA TCTATTCCCA CCCGGGTGTA AAAGAAGTTG CGGTAATTGG TTCCGTGGAT CCTATTTACG GGGAAGTAGC CAGGGCGGTA GTTGTTCCCA ATAATCATGG TAGTAGCATT ACAGGGAGAG AGATTCAAGA CTGGGTAAGG GCGAGACTAG CTAAATATAA AGTACCGCAA TATGTCAATT TTGTTAACGA GTTGCCGAAG AATGCCAATG GCAAAATTGA TAAAAAGCTT CTCCGGCAAC AGTTTCAATA A
|
Protein sequence | MAVSGYGKWF ELAEKDLKTR KFNGVEYRYY DHGTTNLWED FSRSVSRQPD KTALRAGNSS LSYREMQEAS RRLASGLWNK YQVKKGDVVA LLLVNSIDFC LSFYAAMYLG AIALPLSTKL KATELNFMLK DSGARILITN PEWLPNVLPF IKETSIEQII VTEPITDKIN INFGNASIIT LKNVFRETEI PPAPVDEQDG AVIMYTSGTT GKPKGAYLTH FNLLQSVISY ERTLQLTAAD STLIAVPIFH ITGLAALFLL FMHIGGTVYL LPFFNTQEVL NILTCYSITF FHAAPTVYIM LLEQGYRHYQ LPDLRKAACG GGAIPIETIK KIKTWIPQLE FHTVYGLTET SSPATLFPGD VATSPRIGTS GIPIPVVDCK VIDAEGRDIT GKGVGELCIR GPVVTQQYWN NDEATTRAFQ GGWFRTGDVA RIDGDGYVYI MDRLKDMINR GGEKIYSLEV ENVIYSHPGV KEVAVIGSVD PIYGEVARAV VVPNNHGSSI TGREIQDWVR ARLAKYKVPQ YVNFVNELPK NANGKIDKKL LRQQFQ
|
| |