Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3365 |
Symbol | |
ID | 7873856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3675179 |
End bp | 3677164 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700302 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_002890336 |
Protein GI | 237654022 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.865924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACAC GCACGAGCGC GTCCGAGCCC GGACAGGGCG GAGCAGGGCC GGATACCTTT CCGCGCTGGC TGATGCACCA CGCCAAGGCG CGCCCCCAGC AGCCGGCGAT GCGCGAGAAG GAATACGGCA TCTGGCAGAC CTATACCTGG GCGCAAGTGG CCGAGAACGT GCGCGCGATC GCCTGCGGGC TGGCGCAGCT CGGCTTCAAG CGCGGCGACC GCCTCGCGGT GGTGGGCGAC AACCGCCCGC GGTTGTACTG GTCGGTGGCG GCCTGCCAGT GCCTGGGCGG CATCCCGGTG ATGATGTACC AGGACGCGGT GGCCCAGGAG ATGGCCTACG TGCTGCAGGA CGCCGAGATC AAGTTCGCCG TGGTCGAGGA CCAGGAGCAG GTGGACAAGA TGGTCGAGAT CCAGCCCGAT GCGCCGCTGC TCGCGCATGT GATCTACGAC GACCCCCGTG GCATGCGCCA TTACACCCAG ACCCTGCTGA TGGGCCTGGA CGAGCTGCAG GAGATGGGGC GCATCCACGA CCGCAACCAG CCTGATTTCC TCGACGGCGA GATCGACAAG GGCGCCTCCG ACGACATCTC GATCATGCTG TACACCTCGG GCACCACCGG CAAGCCCAAG GGCGTGTGCC AGACCCATGG CGCGTTCATC GCCGCGGCGC GCGGCGCGAT CCAGTTCGAC AAGCTCACCG ACAAGGAGGA CATCCTGTCC TACCTGCCGA TGGCCTGGGT GGGCGACCAC CTGTTCTCGT TCGCGCAGGC CACGGTGGCG GGCTTCACGA TCAACTGCCC GGAGTCGGGC GAGACGGTGA TGAACGATCT GCGCGAGATC GGACCCACCT ATTACTTCGC CCCGCCGCGC GTGTTCGAGA ACCTGCTGAC CCAGGTGATG ATCCGCATGG AAGACGCCGG CAGCCTCAAG CGCAAGATCT TCCATCACTT CATGGACGTG GCGCGGCGCT GCGGCGCCGA CATCCTGGAT GGCAAGCCGG TGTCGGGTGG CGATCGCTTC CAGTACTGGC TGGGCAACCT GCTGGTGTAT GGGCCGCTGA AGAACGTGCT CGGCATGAGC CGCATCCGCG TGGCCTATAC GGCCGGCGCG GCGATCGGGC CCGACCTGTT CCGCTTCTAC CGCTCGATCG GCATCAACCT CAAGCAGCTC TACGGCCAGA CCGAGACCTG CGCCTACGTG TGCCTGCAGC CCGACGGCGA GATCAAGCTG GATTCGGTCG GCAAGCCGGC ACCCTTCGTC GAGGTCAAGC TCGCCGACAA CGGCGAGATC CTGGTCAAGG GGCCGATGCT GCTCAAGGCC TACTACAAGC GCCCCGACGC GACCGCCGAG TCGATCAACG CCGATGGCTA CTTCATGACC GGCGACGCCG GCTTCTTCGA CGCCGACGGC CACCTGAAGA TCATCGACCG TGCCAAGGAC GTCGGCAAGA TGGCCGACGG CACGATGTTC GCGCCCAACT ACATCGAGAA CAAGCTCAAG TTCTTCCAGC ACATCAAGGA GGCGGTGACC TTCGGTGCCG GCAAGGAATT CGCTACCGCC TTCATCAACA TCGACCTCGA GGCGGTGGGC AACTGGGCCG AGAAGAAGGG CATGGCCTAC TCCGGCTACA CCGACCTCGC CCAGCAGGCT GCGGTGTACG AGCTGATCCG CGACTGCGTC GAGAAGGTCA ACGCCGACCT CGCCGCCGAC CCCAACATGA GCGGCTCGCA GATCAAGCGC TTCCTGATCC TGCACAAGGA GCTCGACGCC GACGACGGCG AGCTCACGCG CACGCGCAAG GTACGGCGCA ATTTCATCGC CGAGAAGTAC GGCGTGCTGA TCGAGGCGAT GTTCGAAGGC CGCAAGACCC AGTTCATCGA GACCCAGGTC AAGTACGAGG ACGGCCGCAC CGGCAAGGTC TCGGCCGACC TGCGCATCGA GGAGGTGAAG ACCTTCGCGC CGCAGGCCGG CAAGCGCGCG GCCTGA
|
Protein sequence | MDTRTSASEP GQGGAGPDTF PRWLMHHAKA RPQQPAMREK EYGIWQTYTW AQVAENVRAI ACGLAQLGFK RGDRLAVVGD NRPRLYWSVA ACQCLGGIPV MMYQDAVAQE MAYVLQDAEI KFAVVEDQEQ VDKMVEIQPD APLLAHVIYD DPRGMRHYTQ TLLMGLDELQ EMGRIHDRNQ PDFLDGEIDK GASDDISIML YTSGTTGKPK GVCQTHGAFI AAARGAIQFD KLTDKEDILS YLPMAWVGDH LFSFAQATVA GFTINCPESG ETVMNDLREI GPTYYFAPPR VFENLLTQVM IRMEDAGSLK RKIFHHFMDV ARRCGADILD GKPVSGGDRF QYWLGNLLVY GPLKNVLGMS RIRVAYTAGA AIGPDLFRFY RSIGINLKQL YGQTETCAYV CLQPDGEIKL DSVGKPAPFV EVKLADNGEI LVKGPMLLKA YYKRPDATAE SINADGYFMT GDAGFFDADG HLKIIDRAKD VGKMADGTMF APNYIENKLK FFQHIKEAVT FGAGKEFATA FINIDLEAVG NWAEKKGMAY SGYTDLAQQA AVYELIRDCV EKVNADLAAD PNMSGSQIKR FLILHKELDA DDGELTRTRK VRRNFIAEKY GVLIEAMFEG RKTQFIETQV KYEDGRTGKV SADLRIEEVK TFAPQAGKRA A
|
| |