Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4434 |
Symbol | |
ID | 5705912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5007852 |
End bp | 5009837 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273850 |
Product | acetyl-CoA synthetase |
Protein accession | YP_001539199 |
Protein GI | 159039946 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000112496 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGAGG CGTTGGCCAA TCTGTTGAAC GAGACGCGCC AGTTCCCGCC GCCGCCCGGA CTCGCCGCGA ACGCCAACGT CACCGGCGAG GCGTACGCCG CGGCGGACGC CGACCGGCTG GCGTTCTGGG AACAGCAGGC CCGGCGACTG GCCTGGGCGA AGCAGTGGGA TCGGGTGCTC GACTGGTCCG AGCCACCGTT CGCGAGGTGG TACGTCGGCG GTCAGCTCAA CGTGGCGTAC AACTGCCTCG ACCGGCATGT CGAGGCCGGC CGGGGTGAGC GGGTCGCCAT CCACTGGGAG GGCGAGCCGG GTGACGTCCG CACGATCACC TACGCGGAGT TGCACAGGCT CACCTGCCAG GCGGCGAACG CGCTGACCGA TCTCGGGGTG ACCGCTGGCG ACCGGGTGGC GATCTACCTG CCGATGGTGC CGGAGGCTGC GGTGGCGATG CTGGCCTGCG CCCGGATCGG TGCCACCCAC AGCGTGGTTT TCGGTGGGTT CTCCGCCGAC GCGCTGACCA ACCGGATCCA GGACGCCGCC GCGAAGGTGG TGATCACCGC CGACGGCGGC TATCGGCGGG GCAGGCCGTC GGCGCTCAAG CCGACTGTCG ACGAGGCGGT GTCGAACTGC CCGTCGGTGG AGCATGTGCT GGTGGTCCGC CGCACCGGCG AGGAGGTCGC CTGGTCGGGG AAGGACCGTT GGTGGCATGA GACGGTGGAG CGGGCGCCCG TCGAGCACCA GGCCGTGGCC TTTGATGCCG AGCACCCGCT GTTCATCCTC TACACCAGCG GCACCACCGC CCGCCCGAAG GGCATTCTGC ACACCACCGG CGGCTATCTG ACCCAGGCTT CGTACACGAT GCATGCGGTC TTCGACCTCA AGCCGGACAC CGACGTCTAC TGGTGCACCG CCGACGTCGG TTGGGTCACC GGCCATTCGT ACATCGTGTA CGGTCCGCTG TCCAACGGCG CCACCCAGGT CATGTACGAG GGCACTCCGG ACACCCCCCA CCGGGGCCGT TTCTGGGAGA TCGTGGACAA GTACCGGGTG ACCATCCTCT ACACCGCCCC GACCCTGATC CGGACGATGA TGAAGTGGGG TGAGGACATC CCCGCTGGGT TCGAACTGTC GTCGCTACGA CTGCTGGGCA GCGTGGGGGA GCCGATCAAC CCCGAGGCCT GGATGTGGTA CAGGCAGCAC GTCGGGCGGG GTGAGCTTCC TATCGTGGAC ACCTGGTGGC AGACCGAGAC CGGCGCCATC ATGATCTCTC CGCTGCCGGG CGTGACCCAC GCCAAGCCGG GATCGGCGAT GACTCCACTG CCGGGGATCA ACGGCGACGT GGTGGACGAC CAGGGCCAGC CGGTGCCCAA CGGCGGGGGC GGCTACCTGG TGGTCAGGGA GCCGTGGCCG TCGATGCTCC GGACCATCTG GGGCGACGAC AACCGGTTCG TCGAGACGTA CTGGTCGCGC TTCGGCGCCG GGGCCGGCGC GGGTGACGAC TGGGTCTACT TCGCCGGGGA CGGCGCCAAG AAGGACGACG ACGGGCACAT CTGGTTGTTG GGCCGGGTCG ACGATGTGAT GCTGGTGTCC GGGCACAACA TTTCCACCAC CGAGGTGGAG TCGGCGCTGG TGTCGCATCC GTCGGTGGCC GAGGCAGCGG TGGTCGGCGC GACCGACCCG ACCACGGGAC AGGCGATCGT CGCCTTCGCC ATCCCCCGAG GCAGCACCGA AACCGGAGGT ACGGCGGGTG CGCGGCTCAT CGCGGACCTA CGCGACCATG TCGCGCGCAC GTTGGGTCCG ATCGCGAAGC CGCGGCAGAT CTTGCTCGTG CCGGAGCTGC CGAAGACCCG CTCCGGCAAG ATCATGCGGC GGCTACTTCG GGACGTGGCC GAGAACCGGT CGCTGGGCGA TGTCACGACC CTGCAGGATT CGTCGGTGAT GGACCTCATC TCAGCCGGCA TGGGAACCGG CAAGGGCGAG GACTGA
|
Protein sequence | MSEALANLLN ETRQFPPPPG LAANANVTGE AYAAADADRL AFWEQQARRL AWAKQWDRVL DWSEPPFARW YVGGQLNVAY NCLDRHVEAG RGERVAIHWE GEPGDVRTIT YAELHRLTCQ AANALTDLGV TAGDRVAIYL PMVPEAAVAM LACARIGATH SVVFGGFSAD ALTNRIQDAA AKVVITADGG YRRGRPSALK PTVDEAVSNC PSVEHVLVVR RTGEEVAWSG KDRWWHETVE RAPVEHQAVA FDAEHPLFIL YTSGTTARPK GILHTTGGYL TQASYTMHAV FDLKPDTDVY WCTADVGWVT GHSYIVYGPL SNGATQVMYE GTPDTPHRGR FWEIVDKYRV TILYTAPTLI RTMMKWGEDI PAGFELSSLR LLGSVGEPIN PEAWMWYRQH VGRGELPIVD TWWQTETGAI MISPLPGVTH AKPGSAMTPL PGINGDVVDD QGQPVPNGGG GYLVVREPWP SMLRTIWGDD NRFVETYWSR FGAGAGAGDD WVYFAGDGAK KDDDGHIWLL GRVDDVMLVS GHNISTTEVE SALVSHPSVA EAAVVGATDP TTGQAIVAFA IPRGSTETGG TAGARLIADL RDHVARTLGP IAKPRQILLV PELPKTRSGK IMRRLLRDVA ENRSLGDVTT LQDSSVMDLI SAGMGTGKGE D
|
| |