Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1423 |
Symbol | |
ID | 5704812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1645074 |
End bp | 1647803 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641270933 |
Product | CoA-binding domain-containing protein |
Protein accession | YP_001536314 |
Protein GI | 159037061 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.382585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000471677 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCGTGT CGAGTCACCG CCCGGTCGCG CTGACCTTCG ACGAGACAGG CCCGAACACG GCCGGGGGAG GGACACTCGT CGTGACCACA GGTGTTCAGC CGGTGGATGT GTTGCTCAGC GACGGCACCA CCGTCGGATT GCGGCCGATC CAGCCCACGG ACGCGCCGGG CATCGTCGCC ATGCACTCGC GCTTCTCCGA GCGCACCCGC TACCTGCGTT ACTTCTCGCC GTACCCCCGT ATTCCAGAGC GAGACCTGCG GCGTTTCGTG AACGTCGACC ACCACGACCG GGAGGCGTTC GTGGTGCTGG TCGGCGACCA GATCGTCGCG GTCGGCCGAT ACGAGCGGTT GGGCCCGGCC TCCCCCGAGG CCGAGGTGGC CTTCGTCGTC GAGGACGCCT ACCAGGGCCG GGGCATCGGG TCGGTGCTGT TGGAACACCT CGCCGACGCG GCCCGGCGAG TTGGCATCCC GACCTTCGTG GCGGAGGTGC TGCCGGCCAA CGGTGCGATG CTCCGGGTCT TCGCCGACTT CGGATACCAG GTGCAGCGCC AGTTCGCCGA CGGCGTCGTG CATCTGAGCT TCCCGATCGC GCCGACCGAG GCGACCCTCG AGGTGCAGCG GGGCCGCGAG CACCGTACCG AGGCGCGGTC GGTCGCGCGG CTGCTCGCGC CGCGGGGGGT CGCCTTCTAC GGGGCCAGCG CCACCGGGCA GGGCGTCGGG GCGGCGGTGC TCGGGCACCT GCGCGACTAC GGGTTCACCG GCGCGGTGGT GCCGGTGCAC CCGAGCGCCC GGACGGTGGC CGGGCTGCCC GCGTATCCAT CCGCGGCCGA GGCGGGCCTG CCGGTCGACC TGGCGGTGGT GGCGGTGCCG CCGGCGGCCG TGGAGGCGGT CGTGGCGGAC GCGGCCAGCG CCGGGGCGCA CGGCCTGGTC GTCATCAGCG CGGGCTTCGC CGAGGCCGGG GCCGACGGCG CGGTCGCGCA GCGCCGGCTG GTTCGGGCGG CCCATGCGGC GGGCATGCGG ATCATCGGCC CGAACTGCCT GGGGGTGGCG AACACCGGCA CCGAGGTACG GCTGAACGCC ACGCTGGCCC CACGGCTGCC GGTCCCCGGC CGGGTTGGTC TGTTCAGCCA GTCCGGCGCG TTCGGGGTGG CGCTGTTGGC CGAGGTGGAT CGGCGGGGGC TGGGGCTGTC CAGCTTGGTG TCTGCCGGGA ACCGGGCCGA CGTCTCCGGT AATGACCTGT TGCAGTACTG GCAGGACGAC CCCGACACCG ACGTGATCCT GCTGTACCTG GAAACGTTCG GTAACCCGCG CAAGTTCGCC CGGCTGGCCC GGAGAATCGG GCGGGAGAAG CCGATCGTCG CGCTGGCACC GCCGGCCCGC CTGCCCGGTC TCGGCCCGTC GGCCGGTCCG AACGGGGGCG CGGCCAATCC GTATGGGGGC GTGGCCGGTC CGAATGCGGC CGGTCCGGAC GGGGGCGCGG CTGGCCTGGT TGCGGCCGGT CCGGATGAGG TCGCGGTCAG TGCGCTGTTC GCCCATTCCG GGGTGATCCG GGTGGACACT GTCGCCGAGC TGCTCGACGT CGGCGTGCTG CTGGCCAACC AGCCGCTGCC CGCCGGCGAC CGGGTGGGCG TCGTGGGTAA CTCCTCGGCG CTGACCGGGC TGGCCGCCAC CGCCGCCGCA GCAGCAGGGC TCACCGTCGC CGACGGCTAC CCCCGCGACG TCGGGCCACA CGCCGGGGCG GCGGATTTCG CGACCGCTCT CGCCGCCGCC GTGGCCGACG ACGGTGTGGA CGCGTTGGTG GCCGTGTTCG CCCCGCCGCT GCCAGGCCAA CTGCCCGACG CCGAGGCGGA CTTCACCTCG GCGCTGCCCG CGGCACTCGC CGGCGGCAAG CCCACCGTGG CGACGTTCCT GGCCGGACGA GCCCCCTCCG GCGTGCCCGC GTACCCGAGC GTGGAGGAGG CGGTGCGGGC CCTGGGCCGG GTGACCGCGT ACGCCGGATG GCTGCGCCGA CCCGCCGGCA CGGTCCCCGA GCTGTCCGAC GTGGACCGGG ACGCGGCTCA GGCGGCACTG CGGCCAGAAA CGTTCGATCC AACGGGTCTG CTCGCCGCGT ACGGGATCGA CGTGGTCGAG TCGGTGCTGG CGGCGTCCGA GCAGGAGGCC GCCGCGGCGG CGCGACGCCT GGGGTACCCG GTGGCGATGA AGGCCGCCGC CGCCGGCCTG CGGCACCGGC TGGACCTTGG CGCGGTCCGC CTGGACCTGC CCGACGAGGC GAGGGTGCGG CGGGCGTACA CCGAGATGGC GACGGAGTTC GGCGCTGACG TCCTGGTTCA GCCGATGGTC CCGCCCGGCG TGGCCTGCGT GGTGGAGCTG GTGGAGGACC CGGCGTTCGG GCCGGTGGTC GGCTTCGGCG TGGGCGGTGT CGCCACCGAA CTGCTCGGTG ACCGGGCCTG GCGGGCGGTG CCGCTGACCG GCCGGGACGC GGCGGAGCTG GTTGACGAGC CGCGGGCGGC CCCGCTGCTG CGGGGCCATC GTGGGGCGGC ACCGGTGGAC CGGAAAGCCC TGGCTGAGCT GCTGTTGCGG GTCGGGCAGC TGGCCGACGA GCAACCCCGG GCTCGTACGC TGACGCTGAA CCCGGTGCTG GCCCGGCCGG ACGGGCTGTC GGTGCTGCAC GCCAGCGTGG GGCTCGGCTC GGCCGCCGCC CGCCCCGACA CCGGCCCCCG CCGCCTGTGA
|
Protein sequence | MPVSSHRPVA LTFDETGPNT AGGGTLVVTT GVQPVDVLLS DGTTVGLRPI QPTDAPGIVA MHSRFSERTR YLRYFSPYPR IPERDLRRFV NVDHHDREAF VVLVGDQIVA VGRYERLGPA SPEAEVAFVV EDAYQGRGIG SVLLEHLADA ARRVGIPTFV AEVLPANGAM LRVFADFGYQ VQRQFADGVV HLSFPIAPTE ATLEVQRGRE HRTEARSVAR LLAPRGVAFY GASATGQGVG AAVLGHLRDY GFTGAVVPVH PSARTVAGLP AYPSAAEAGL PVDLAVVAVP PAAVEAVVAD AASAGAHGLV VISAGFAEAG ADGAVAQRRL VRAAHAAGMR IIGPNCLGVA NTGTEVRLNA TLAPRLPVPG RVGLFSQSGA FGVALLAEVD RRGLGLSSLV SAGNRADVSG NDLLQYWQDD PDTDVILLYL ETFGNPRKFA RLARRIGREK PIVALAPPAR LPGLGPSAGP NGGAANPYGG VAGPNAAGPD GGAAGLVAAG PDEVAVSALF AHSGVIRVDT VAELLDVGVL LANQPLPAGD RVGVVGNSSA LTGLAATAAA AAGLTVADGY PRDVGPHAGA ADFATALAAA VADDGVDALV AVFAPPLPGQ LPDAEADFTS ALPAALAGGK PTVATFLAGR APSGVPAYPS VEEAVRALGR VTAYAGWLRR PAGTVPELSD VDRDAAQAAL RPETFDPTGL LAAYGIDVVE SVLAASEQEA AAAARRLGYP VAMKAAAAGL RHRLDLGAVR LDLPDEARVR RAYTEMATEF GADVLVQPMV PPGVACVVEL VEDPAFGPVV GFGVGGVATE LLGDRAWRAV PLTGRDAAEL VDEPRAAPLL RGHRGAAPVD RKALAELLLR VGQLADEQPR ARTLTLNPVL ARPDGLSVLH ASVGLGSAAA RPDTGPRRL
|
| |