Gene Sros_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1054 
Symbol 
ID8664328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1071485 
End bp1072630 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content69% 
IMG OID 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_003336797 
Protein GI271962601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.312348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00410165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGAGG CGTACATCGT CGGGGCGGTC CGCACCCCGG TCGGTAAGAA GAAGGGCGGT 
CTGTCCACCG TCCACCCCAC CGATCTGGCC GCGCACACCC TCAAGGCGCT CATCGACCGC
ACGGGCGTCG ACCCGTCCGC GGTGGAAGAC GTCATCATGG GCTGCGTCAT GCAGTTCGGC
CCGCAGAGCA TGGACATCGC GCGTAACGCC TGGCTGTCGG CAGGTCTTCC GGAGAGCGTC
GCCGGGGTGA CCATCGACCG CCAGTGCGGC TCCTCCCAGC AGTCGATTCA CTTCGCGGCC
CAGGGAGTGC TCTCCGGCAC CCAGGACCTC GTGGTGGCCG CCGGAGTGGA GTCGATGAGC
ATCGTGCCGA TGGGCTCGTC GATCACCGCG GCGCTGGAGA AGGGTATGCC CTTCCCGTTC
GGTGACATGT GGGTCGAGCG GTACGGCAAG CAGGAGATCT CCCAGTTCCG CGGCGCCGAG
CTCATGTGCG AGAAGTGGGA TCTGTCGCGG GAGGAACTGG AGCGGTTCGC CTACGAGAGC
CACCAGCGGG CGGCCAAGGC CATCGCGAAC GGTTACTTCA AGGACCAGAT CGCCCCGGTC
AACGGCGTCG AGGACGACGA GGGACCGCGC GCGGACAGCA CGCTGGAGAA GATGGCCTCC
CTGAAGACGC TGCGGGACGG CGGCCGGATC ACCGCCGCGA CCTCCTCGCA GATCTCCGAC
GGCTCGGGTG CGCTCCTCAT CGCCTCCGAG CAGGCGGTCC GCGACCACGG CCTGACGCCC
CGGGCCCGCA TCGTCACCCT GGCGCTCACC GGAGACGACC CGGTCTACAT GCTGACCGCG
CCGATCCCGG CGACCCAGAA GGCGCTGAAA AGGTCCGGCC TGTCCATCGA CGACATCGAC
GTCACCGAGA TCAACGAGGC GTTCGCCCCG GTCCCGCTGG CGTGGATCAA GGACCTCGGC
GCCGACCCGG CCAAGGTCAA CCCGAACGGC GGCGCGATCG CCCTGGGCCA CCCGCTGGGC
GGGACCGGCG CGATCCTGAT GACCAAGCTG CTCCACGAAC TGGAGCGGAC CGGCGGCAGG
TACGGCCTGC AGACGATGTG CGAGGGCGGC GGCCAGGCCA ACGTCACGAT CATCGAGCGC
CTCTGA
 
Protein sequence
MAEAYIVGAV RTPVGKKKGG LSTVHPTDLA AHTLKALIDR TGVDPSAVED VIMGCVMQFG 
PQSMDIARNA WLSAGLPESV AGVTIDRQCG SSQQSIHFAA QGVLSGTQDL VVAAGVESMS
IVPMGSSITA ALEKGMPFPF GDMWVERYGK QEISQFRGAE LMCEKWDLSR EELERFAYES
HQRAAKAIAN GYFKDQIAPV NGVEDDEGPR ADSTLEKMAS LKTLRDGGRI TAATSSQISD
GSGALLIASE QAVRDHGLTP RARIVTLALT GDDPVYMLTA PIPATQKALK RSGLSIDDID
VTEINEAFAP VPLAWIKDLG ADPAKVNPNG GAIALGHPLG GTGAILMTKL LHELERTGGR
YGLQTMCEGG GQANVTIIER L