Gene Sare_0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0909 
Symbol 
ID5706055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1022485 
End bp1023615 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID641270427 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001535817 
Protein GI159036564 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.738896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.27677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCAG AGACCATCGA GGGGCCCGGA ATCCGGCTGC GTCCGTTCCA CCTCACCGAC 
GCCCCCGCCA CCGCGACCGC CTGCGCCGAC CCGCTGACCC AACGCTTCCT GCCCGCACTG
CCGTCGCCGT ACACGGAGGC CGACGCCCGG TTGTGGATCA CCGAAGGGGC ACCCGGGGTC
TGGGCCACCG GCGGGGCCGC CTACGCCATC ACGGACCGGG CCACGGACCA GCTCCTCGGC
TCGGTCGGGT TGCACGACGT GATCCCCGGT CGCCAGGAGG CGGCGATCGG CTACTGGGTC
GCCCCGTGGG CGCGGGGACG CGGCGTCGCC ACGGCCGCGA CCCGGACCCT CGCCGAGCGG
GCATTCACCA CCGGGACGAT CCGGCTGGAG CTGCTCACCA CAGCTGAGAA CACCGCCAGT
CAGCGGGTGG CGCTGGCCGC CGGCTTCCGC CACGAGGGCG TGCGCCGGTC GGCGAGCCCC
CGTCGCGGTG GCCAGGGACG AGATGATCTC CTCGCCTGGG CGCGTCTCGC CAACGATCCC
CCGGGTCCGA CCCCGCGGTT GCTGCCAGAC CTGCCCGACG GCCGGGTCAC CGACGGCGTG
GTGGAGCTAC GGGCGCTCGG CCCGCAGCAC GCGGCCCACA TGCACGACCT GAACACGCGG
CCCGAGGTGG TCGCCTCCCG GGTGCCGCCG GAGCCGCCGA CGCGGGCGGA CACCGAGCGG
CACTGCCGGG AGGCGATGTC CCGGTGGCTG TGCGACAAGG CCGCGAACAT GGTCATCCTC
GACGCGACGA GCGGAGCCAC CGCTGGCACC TGCACTCTGG TCCTCGACCA TCCGCCGTTC
CGACAGGCGA TGATCGGCTA CAGCCTGCTG CCGGACTGGC GCGGACGCGG CTTCGCGACC
CGCACGATCC GGCTGCTCAC CGCATGGGGA TTCAACGAGG TCCGGCTCGA ACGGATCTGG
GCGGGTACCC ACTCCGGCAA CGTTGCCTCG GAGCGGGTGC TGGAACGGGC CGGGTTCCGC
CGGGAGGGGC GAACACGCGG GGGCCTTCCC AGCGTCGGCA ATGCCCGGGC GGACTGCACG
CTGTACGGCC TGCTCTCCGG TGATCTCGCG CCACCACCCG GAACGTGTTG A
 
Protein sequence
MTPETIEGPG IRLRPFHLTD APATATACAD PLTQRFLPAL PSPYTEADAR LWITEGAPGV 
WATGGAAYAI TDRATDQLLG SVGLHDVIPG RQEAAIGYWV APWARGRGVA TAATRTLAER
AFTTGTIRLE LLTTAENTAS QRVALAAGFR HEGVRRSASP RRGGQGRDDL LAWARLANDP
PGPTPRLLPD LPDGRVTDGV VELRALGPQH AAHMHDLNTR PEVVASRVPP EPPTRADTER
HCREAMSRWL CDKAANMVIL DATSGATAGT CTLVLDHPPF RQAMIGYSLL PDWRGRGFAT
RTIRLLTAWG FNEVRLERIW AGTHSGNVAS ERVLERAGFR REGRTRGGLP SVGNARADCT
LYGLLSGDLA PPPGTC