Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1980 |
Symbol | |
ID | 3831162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2063304 |
End bp | 2064902 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829911 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_430821 |
Protein GI | 83590812 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0106234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCG CCTCGGTAAA AGAGCTGGTA ATAACCAGGG GCACCCAGTA CCGCGGGAGG ATATTTCTTT CCTCACCGGA AGATGGCGTG GATTTAACTT ATGACGCCTA CCTGCTGGCC GTCAGGAGAC TGGAAAAAGC ACTGCTGGCG TTAGGTATGC GCAAGGGAGA AAGGGTGGCC CTTCTCATGG CGAACGGCCT GAATTACGCC GTCACCTTTA CCGGGGTGAT GGCCTCCGGA GGCGTAGTCG TACCCATCAA CCCGCATTTA AAACCGGCAG AGGTGACCCG GCTCCTGGGA GATGCCGGGA CCAGCCTGGT TTTAACTGAC GACGGATGGT ACAGAGTATT TTACCCCCTC CTGAAGGGGT TACCTGTTCG CCGCTTGGAC CTGGGGGTGC AGGGCGGCAG GTTGCTGGCC CTGGAGCTGG CATCCGGGAG TAAGGGGGAT GACAGAGCAG TTGAGGCGTC CCCTCTTGGC AGGAACGACT TAGCCCTCCT CCTGTACACC TCCGGTACTA CCGGAAAGCC TAAAGGGGTG ATGCTAACCC ACGGTAATTT GCTGGCCGAG GCGAGGTATA TCCAGAAAGG ACACCGGTTA ACGCCGGAAG ATACTGCCCT GTGTATCCTG CCCCTGTATC ATATAAATGG GGAAGTTGTG ACCCTGATCA CCCCCATCTT TTCCGGCGGG CGGGTAGTGA TGCCCCATAA ATTCAGGGCC AGCAGGTTCT GGGACTGGGT CCGGAACTAC CGGGTTACAT GGTTCAGCGC CGTTCCCACC ATCCTGTCCA TCCTTCTTTC CCATCCTCTG CCGGATAGAT CGGCCCTCTC TTCTTTGCGT TTTGCGCGCT CCGCCTCGGC ACCTTTACCG GTAGCCGTCC TGCGGGAATT TGAAGCCCGG TTCGCCGTCC CTGTTATCGA GGCTTACGGC CTGTCGGAAA CCGCCAGCCA GGTAACCACC AATCCCCTGC CCCCGGCGGT GAGAAAGCCG GGTTCCGTGG GGCTGCCTGT TGGCAATCAG GTACGGGTGG TGAACGAAAA TGGAGAGACC GTACCTGCCG GTGTCACCGG CGAAGTCGTA GTTCGCGGGG AAAATGTCTG CCGGGGTTAC TTTCATAATG AAGAGGCTAC TGCCGCTTCT TTCAAAGGAG GCTGGTTTTA TACCGGCGAC CTTGGCTACC TTGATGCCGA TGGGTACCTG TTCCTTACCG GACGGCGCAA AGAACTTATC AACCGGGGTG GGGAGAAGTT TTCTCCCCGG GAGATCGACG AGATCTTATA CCGTTTACCC GAAGTAGAAT TAGCGGCAGC AGTAGGTGTC CCCGATCCCC TCTACGGTGA AGAGGTGGTA GCCTTCATCC AACTGCGCCC GGGAAAAAGC CTGGCGGAAG ATCGGGTAAT ATCCTTCTTA AGAGATTACC TGGCGGATTT TAAGGTCCCC CGGGAGGTCA TCTTTATCCG GGATTTTCCC CGGGGGCCGA GCGGAAAGAT TCAGCGCCTG AAGCTGGTGG ACCTGTATCT TAAAAAATTC CAGGGAGCCG CCCATGGGGC TGGGGCTGGC ACCCGCCCCA TAAATGGTGA GGAGGTTGCT AAAAGATGA
|
Protein sequence | MEFASVKELV ITRGTQYRGR IFLSSPEDGV DLTYDAYLLA VRRLEKALLA LGMRKGERVA LLMANGLNYA VTFTGVMASG GVVVPINPHL KPAEVTRLLG DAGTSLVLTD DGWYRVFYPL LKGLPVRRLD LGVQGGRLLA LELASGSKGD DRAVEASPLG RNDLALLLYT SGTTGKPKGV MLTHGNLLAE ARYIQKGHRL TPEDTALCIL PLYHINGEVV TLITPIFSGG RVVMPHKFRA SRFWDWVRNY RVTWFSAVPT ILSILLSHPL PDRSALSSLR FARSASAPLP VAVLREFEAR FAVPVIEAYG LSETASQVTT NPLPPAVRKP GSVGLPVGNQ VRVVNENGET VPAGVTGEVV VRGENVCRGY FHNEEATAAS FKGGWFYTGD LGYLDADGYL FLTGRRKELI NRGGEKFSPR EIDEILYRLP EVELAAAVGV PDPLYGEEVV AFIQLRPGKS LAEDRVISFL RDYLADFKVP REVIFIRDFP RGPSGKIQRL KLVDLYLKKF QGAAHGAGAG TRPINGEEVA KR
|
| |