Gene Sare_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1423 
Symbol 
ID5704812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1645074 
End bp1647803 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content75% 
IMG OID641270933 
ProductCoA-binding domain-containing protein 
Protein accessionYP_001536314 
Protein GI159037061 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000471677 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCGTGT CGAGTCACCG CCCGGTCGCG CTGACCTTCG ACGAGACAGG CCCGAACACG 
GCCGGGGGAG GGACACTCGT CGTGACCACA GGTGTTCAGC CGGTGGATGT GTTGCTCAGC
GACGGCACCA CCGTCGGATT GCGGCCGATC CAGCCCACGG ACGCGCCGGG CATCGTCGCC
ATGCACTCGC GCTTCTCCGA GCGCACCCGC TACCTGCGTT ACTTCTCGCC GTACCCCCGT
ATTCCAGAGC GAGACCTGCG GCGTTTCGTG AACGTCGACC ACCACGACCG GGAGGCGTTC
GTGGTGCTGG TCGGCGACCA GATCGTCGCG GTCGGCCGAT ACGAGCGGTT GGGCCCGGCC
TCCCCCGAGG CCGAGGTGGC CTTCGTCGTC GAGGACGCCT ACCAGGGCCG GGGCATCGGG
TCGGTGCTGT TGGAACACCT CGCCGACGCG GCCCGGCGAG TTGGCATCCC GACCTTCGTG
GCGGAGGTGC TGCCGGCCAA CGGTGCGATG CTCCGGGTCT TCGCCGACTT CGGATACCAG
GTGCAGCGCC AGTTCGCCGA CGGCGTCGTG CATCTGAGCT TCCCGATCGC GCCGACCGAG
GCGACCCTCG AGGTGCAGCG GGGCCGCGAG CACCGTACCG AGGCGCGGTC GGTCGCGCGG
CTGCTCGCGC CGCGGGGGGT CGCCTTCTAC GGGGCCAGCG CCACCGGGCA GGGCGTCGGG
GCGGCGGTGC TCGGGCACCT GCGCGACTAC GGGTTCACCG GCGCGGTGGT GCCGGTGCAC
CCGAGCGCCC GGACGGTGGC CGGGCTGCCC GCGTATCCAT CCGCGGCCGA GGCGGGCCTG
CCGGTCGACC TGGCGGTGGT GGCGGTGCCG CCGGCGGCCG TGGAGGCGGT CGTGGCGGAC
GCGGCCAGCG CCGGGGCGCA CGGCCTGGTC GTCATCAGCG CGGGCTTCGC CGAGGCCGGG
GCCGACGGCG CGGTCGCGCA GCGCCGGCTG GTTCGGGCGG CCCATGCGGC GGGCATGCGG
ATCATCGGCC CGAACTGCCT GGGGGTGGCG AACACCGGCA CCGAGGTACG GCTGAACGCC
ACGCTGGCCC CACGGCTGCC GGTCCCCGGC CGGGTTGGTC TGTTCAGCCA GTCCGGCGCG
TTCGGGGTGG CGCTGTTGGC CGAGGTGGAT CGGCGGGGGC TGGGGCTGTC CAGCTTGGTG
TCTGCCGGGA ACCGGGCCGA CGTCTCCGGT AATGACCTGT TGCAGTACTG GCAGGACGAC
CCCGACACCG ACGTGATCCT GCTGTACCTG GAAACGTTCG GTAACCCGCG CAAGTTCGCC
CGGCTGGCCC GGAGAATCGG GCGGGAGAAG CCGATCGTCG CGCTGGCACC GCCGGCCCGC
CTGCCCGGTC TCGGCCCGTC GGCCGGTCCG AACGGGGGCG CGGCCAATCC GTATGGGGGC
GTGGCCGGTC CGAATGCGGC CGGTCCGGAC GGGGGCGCGG CTGGCCTGGT TGCGGCCGGT
CCGGATGAGG TCGCGGTCAG TGCGCTGTTC GCCCATTCCG GGGTGATCCG GGTGGACACT
GTCGCCGAGC TGCTCGACGT CGGCGTGCTG CTGGCCAACC AGCCGCTGCC CGCCGGCGAC
CGGGTGGGCG TCGTGGGTAA CTCCTCGGCG CTGACCGGGC TGGCCGCCAC CGCCGCCGCA
GCAGCAGGGC TCACCGTCGC CGACGGCTAC CCCCGCGACG TCGGGCCACA CGCCGGGGCG
GCGGATTTCG CGACCGCTCT CGCCGCCGCC GTGGCCGACG ACGGTGTGGA CGCGTTGGTG
GCCGTGTTCG CCCCGCCGCT GCCAGGCCAA CTGCCCGACG CCGAGGCGGA CTTCACCTCG
GCGCTGCCCG CGGCACTCGC CGGCGGCAAG CCCACCGTGG CGACGTTCCT GGCCGGACGA
GCCCCCTCCG GCGTGCCCGC GTACCCGAGC GTGGAGGAGG CGGTGCGGGC CCTGGGCCGG
GTGACCGCGT ACGCCGGATG GCTGCGCCGA CCCGCCGGCA CGGTCCCCGA GCTGTCCGAC
GTGGACCGGG ACGCGGCTCA GGCGGCACTG CGGCCAGAAA CGTTCGATCC AACGGGTCTG
CTCGCCGCGT ACGGGATCGA CGTGGTCGAG TCGGTGCTGG CGGCGTCCGA GCAGGAGGCC
GCCGCGGCGG CGCGACGCCT GGGGTACCCG GTGGCGATGA AGGCCGCCGC CGCCGGCCTG
CGGCACCGGC TGGACCTTGG CGCGGTCCGC CTGGACCTGC CCGACGAGGC GAGGGTGCGG
CGGGCGTACA CCGAGATGGC GACGGAGTTC GGCGCTGACG TCCTGGTTCA GCCGATGGTC
CCGCCCGGCG TGGCCTGCGT GGTGGAGCTG GTGGAGGACC CGGCGTTCGG GCCGGTGGTC
GGCTTCGGCG TGGGCGGTGT CGCCACCGAA CTGCTCGGTG ACCGGGCCTG GCGGGCGGTG
CCGCTGACCG GCCGGGACGC GGCGGAGCTG GTTGACGAGC CGCGGGCGGC CCCGCTGCTG
CGGGGCCATC GTGGGGCGGC ACCGGTGGAC CGGAAAGCCC TGGCTGAGCT GCTGTTGCGG
GTCGGGCAGC TGGCCGACGA GCAACCCCGG GCTCGTACGC TGACGCTGAA CCCGGTGCTG
GCCCGGCCGG ACGGGCTGTC GGTGCTGCAC GCCAGCGTGG GGCTCGGCTC GGCCGCCGCC
CGCCCCGACA CCGGCCCCCG CCGCCTGTGA
 
Protein sequence
MPVSSHRPVA LTFDETGPNT AGGGTLVVTT GVQPVDVLLS DGTTVGLRPI QPTDAPGIVA 
MHSRFSERTR YLRYFSPYPR IPERDLRRFV NVDHHDREAF VVLVGDQIVA VGRYERLGPA
SPEAEVAFVV EDAYQGRGIG SVLLEHLADA ARRVGIPTFV AEVLPANGAM LRVFADFGYQ
VQRQFADGVV HLSFPIAPTE ATLEVQRGRE HRTEARSVAR LLAPRGVAFY GASATGQGVG
AAVLGHLRDY GFTGAVVPVH PSARTVAGLP AYPSAAEAGL PVDLAVVAVP PAAVEAVVAD
AASAGAHGLV VISAGFAEAG ADGAVAQRRL VRAAHAAGMR IIGPNCLGVA NTGTEVRLNA
TLAPRLPVPG RVGLFSQSGA FGVALLAEVD RRGLGLSSLV SAGNRADVSG NDLLQYWQDD
PDTDVILLYL ETFGNPRKFA RLARRIGREK PIVALAPPAR LPGLGPSAGP NGGAANPYGG
VAGPNAAGPD GGAAGLVAAG PDEVAVSALF AHSGVIRVDT VAELLDVGVL LANQPLPAGD
RVGVVGNSSA LTGLAATAAA AAGLTVADGY PRDVGPHAGA ADFATALAAA VADDGVDALV
AVFAPPLPGQ LPDAEADFTS ALPAALAGGK PTVATFLAGR APSGVPAYPS VEEAVRALGR
VTAYAGWLRR PAGTVPELSD VDRDAAQAAL RPETFDPTGL LAAYGIDVVE SVLAASEQEA
AAAARRLGYP VAMKAAAAGL RHRLDLGAVR LDLPDEARVR RAYTEMATEF GADVLVQPMV
PPGVACVVEL VEDPAFGPVV GFGVGGVATE LLGDRAWRAV PLTGRDAAEL VDEPRAAPLL
RGHRGAAPVD RKALAELLLR VGQLADEQPR ARTLTLNPVL ARPDGLSVLH ASVGLGSAAA
RPDTGPRRL