Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3439 |
Symbol | |
ID | 8334792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3809505 |
End bp | 3812342 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644956583 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003114186 |
Protein GI | 256392622 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.571319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.459804 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGAC AGAACCAGAC CCCTTACGAC CTCGTCCGCG CCGTGGCGCG CGCCGACGGC GACCGCGCCG CGCTTGTCGA CCCGGCCGGC GCCGGGGCGC AGCTGTCGTA CGCGGAGCTG ATCGCCCGCA CCGAGACGCT GGCGCGCCGG CTGCGGGAGC ACGGCGTGGG CGCCGAGCAG CCGGTCGCCG TCGTGCTGGA GCGCGGGGCG GACACTGTGG TGGCGATGCT GGCGGTGCTC GCCGCCGGCG GCGTCTACTG CCCGCTGGAC GTCTCGGCGC CGGACGCGCG GCTGACCGCG GTCGTGGAAC TGCTCGGGGC GAAGGTCGCG CTGACCGACG CGGCGCACGC CGGCCGGCTG CCCGCCGGGG TGGACGCGCT GCGGCTGGAG GCCGTATCAG CAGCCTCAGT TACTGCCTCA GGCGCCGCCT CAGACTCAGG CACCGCCTTG GACTCAGCCT CGGGCTCAAC CCCAGCCTCA GCCGCGGACT CAGGCACCGC CTCGGACTCG AACTCAGCCT CAGCCTCGGA CTCAGGCACC GGCTCAGCCT CAGCCTCGAA CTCGGCCTCG GACACCGGAG CCCTCGACCC CGACTCAGCG TTCGAGCCCG CCGCACCCAC CCCGGACTCC CTCGCCTACG TCCTCTACAC CTCCGGCTCC ACCGGCGTCC CCAAGGGCGT GGCGATGACC CACCGCGGTC TGTCCCGCCT CATCAGCTGG CAGACCGCCT CCGGCGCTCC GGGACTGCGC ACGCTCCAGT TCACCGCGAC CTCCTTCGAC GTCACCTTCC AGGAAGTGCT CTCCACCCTG GCCACCGGCG GCTGCCTCGT CGTCGCCGGC GAGCAGGTCC GGCGCGATCC GGCGCTGCTG CTGGAGACGA TCGTCAAGGA GCGGATCCAG CGCCTCTTCC TGCCCTACGT CGCCCTGCAG CTCATGGCGG TCACCGCCGC GCGGCTGGCG ATCGTCCCGG AGAGCCTGGA GCACGTCGTC ACCGCCGGCG AGCGCCTGGT CGTCACCCCC GCGATCCGCG ACCTGTTCAC CACGCTGCCG CACTGCCGTC TGGACAACCA CTACGGACCC ACCGAAGCGC ACCTGGTCAC CAGCCAGACC CTGCCGCAGG ACCGCGCCGC CTGGCCGGAC GTCCCCGGCA TCGGCGCCCC GGTCGCCGGC GTCGCCTGCC ATGTCCTCGA CGAGCGGCTG CACGCAGTGC CCGATGGCGA GGTCGGCGAA CTCTACGTCT CAGGACCCTG TCTGGCGCGC GGCTACCTCG CCGACCCGGC ACGCACCGCT GAGCGCTTCG TTGCCGACCC CGGCGCCGAC GCACCGGGGG AGCGTTGGTA CCGGACCGGT GACCTGGTGC GCCGCATCGC CGATGGGACC TATGAATTCC TCGGCCGCGC CGACGGTCAG CTGAAGGTGC GCGGGTTCCG CGTCGAGCCC GGCGAGGTCG AGACCGCGCT GACGAGCCAC CCGCGGGTCC AGGCCGCCGC CGTCGGACTG CGGCAGATCG AGGACGGGAT CTCGATCCTG GTCGGATACC TCCAGACCGA CGGCGCCGTC TCGCAGCGGG AGATCGGCGA TCATGCCAGG GCACTGCTAC CCGCCTACAT GGTGCCCTCG CGCTACCTGA CCGTGCCCGC ACTGCCGCGC ACCGGAACCG GCAAGGTCGA CACCCGGGCG CTCGCCGAGA TCGCGCTGCC CGACGCCGCC GATTCCTCCG ACGCCGCCGA CGCTGCCGAC CCCGACCAGC TCCCCTTGTC CGACCTCATC ACCGCTCTAT GGATCCGCGT CCTCGGCCAC GACGAATTCG ACCCCGACGA CGACTTCTTC GACGTCGGCG GCGACTCCCT GCTCGCCACC TGGGTAGCCG CCGAGCTGGG GCAGATGCTC GGCCGCCCGG TCGAGTTGTC GCTGTTCCTG GAATACAGCA CCGTCGAAGA CCTCGCCGAA GCTCTCGGCT CGCAGGGCTC TGCGACCGCG ACAGGAGCGG CACTGATCGC GGGATCCTCA CGCAGCTCAG CTTCGCAGAT CGTCACCCTG CGCCCCGGAC CGTCCGGCCG CAGTCTGTAC CTGTTCCACC CGCTCGGCGG CGAGCTGATC TGCTACCGCG AGCTGGCCCG CGCCAGCCGT GCCCCGGTCC GCGTCCTCGG CGTCGGCTGG AGTGGCGCGC CGCCGGAGTA CGGCGCCACG CTGGAGGACA TCGCCCGCGT GCACGTCGAG CAGCTGCTGG TCATCCAGCC CGACGCGCCG TTTCTATTGG CCGGGTGGTC CTTCGGCGGC GTGCTCGCCT TCGAAGTCGC CCGGCAGCTC ACCGCGGCCG GCGCGAGCGT GGACTTCCTC GGGCTGATCG ACGCCAACCC GGTGATCGAC CCCATCACCG GGCTGCCGCT GGCGGACACG CCGTTCCTGG GCGTGCTGGA CGAGGTGGTG ACGCTGCTCG ACGCACCCGG GACCACCTCC GCCGATCTCA CGGCTCTGAC ATCCGGCGAC ACCTGGCTCC AGCTCATGGG CGCGCCGATC GCCCCCGGCG CCTCAAGTAC GTATCTGCGG ACCGCGCTGG ACACCGCCCG AGCGTGTATG TGGGCTGCGA TGCGCTACCA GGCGCGCCGC CACGACGGCC CGATCGACGT GTTCCAGGCC TCCGGGTCCG GGGCGGATCG GCAGGAAGCG CTGGCCGGGG CGATCCGCAG CCTGGCCGGC GGCGCGTTCC GGACGGTCGC CGTCCCCGGC GGCCACTGGG CGTGCATCAG GGCGGAGGAC GGGGCCGAGA CGGCCAGGGC ACTGGATGCC GCGCTCGAGC GCGTCGGCGC GGCGGGGAGT GGGACGCATG GATCTTGA
|
Protein sequence | MVRQNQTPYD LVRAVARADG DRAALVDPAG AGAQLSYAEL IARTETLARR LREHGVGAEQ PVAVVLERGA DTVVAMLAVL AAGGVYCPLD VSAPDARLTA VVELLGAKVA LTDAAHAGRL PAGVDALRLE AVSAASVTAS GAASDSGTAL DSASGSTPAS AADSGTASDS NSASASDSGT GSASASNSAS DTGALDPDSA FEPAAPTPDS LAYVLYTSGS TGVPKGVAMT HRGLSRLISW QTASGAPGLR TLQFTATSFD VTFQEVLSTL ATGGCLVVAG EQVRRDPALL LETIVKERIQ RLFLPYVALQ LMAVTAARLA IVPESLEHVV TAGERLVVTP AIRDLFTTLP HCRLDNHYGP TEAHLVTSQT LPQDRAAWPD VPGIGAPVAG VACHVLDERL HAVPDGEVGE LYVSGPCLAR GYLADPARTA ERFVADPGAD APGERWYRTG DLVRRIADGT YEFLGRADGQ LKVRGFRVEP GEVETALTSH PRVQAAAVGL RQIEDGISIL VGYLQTDGAV SQREIGDHAR ALLPAYMVPS RYLTVPALPR TGTGKVDTRA LAEIALPDAA DSSDAADAAD PDQLPLSDLI TALWIRVLGH DEFDPDDDFF DVGGDSLLAT WVAAELGQML GRPVELSLFL EYSTVEDLAE ALGSQGSATA TGAALIAGSS RSSASQIVTL RPGPSGRSLY LFHPLGGELI CYRELARASR APVRVLGVGW SGAPPEYGAT LEDIARVHVE QLLVIQPDAP FLLAGWSFGG VLAFEVARQL TAAGASVDFL GLIDANPVID PITGLPLADT PFLGVLDEVV TLLDAPGTTS ADLTALTSGD TWLQLMGAPI APGASSTYLR TALDTARACM WAAMRYQARR HDGPIDVFQA SGSGADRQEA LAGAIRSLAG GAFRTVAVPG GHWACIRAED GAETARALDA ALERVGAAGS GTHGS
|
| |