Gene Sros_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1697 
Symbol 
ID8664974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1811968 
End bp1813155 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID 
ProductAcetyl-CoA C-acetyltransferase 
Protein accessionYP_003337431 
Protein GI271963235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.035668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.106126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGTT CCGTCATCGT CGCCGGAGCT CGCACCCCCA TCGGCCGGTT GCTCGGCTCG 
CTGGCCGGCC TGTCGGCCGT CGAGCTCGGC GGCATCGCCA TCAAGGCCGC GCTGGAGCGC
TCCGGCGTCG CCCCCGAGTC CGTGCAGTAC GTGATCATGG GCCAGGTCCT CCAGGCCGGA
GCGGGTCAGA TCCCCTCCCG CCAGGCGGCC GTCAAGGCCG GGATCCCGAT GACCGTGCCG
TCGCTGACGA TCAACAAGGT CTGCCTGTCC GGGCTGGACG CCATCGCCTT GGCCGACCAG
CTCATCAGGG CGGGCGAGTT CGACGTCGTG GTCGCCGGCG GCATGGAGTC CATGTCGAAC
GCCCCCCACC TGCTGCCCGG CCTGCGCAGG GGAGTGAAGT ACGGCGACGC CGGCATCGTG
GACTCGATGG CCTTCGACGG CCTGACCGAC GCCTACGACC AGGTGTCCAT GGGCGAGTCC
ACCGAGCGGC ACAACGCGCG CCTCGGCCTG ACCCGCGAGG AGCAGGACGC GTTCTCCGCC
CGTTCCCACG AGCTCGCCGC CGCCGCGATC AAGAACGGCG TGCTCGACGA CGAGATCGTT
CCGGTGCCGG TCCCGCAGCG CAAGGGGGAG CCGGTGATGT TCGCCGCCGA CGAGGGCGTG
CGCGGCGACA CCACGGTCGA GACCCTGGGA CGGCTGCGGC CGGCCTTCAG CAAGGACGGC
ACCATCACCG CCGGGTCCGC CTCGCAGATC TCCGACGGCG CCTGCGCGGT GGTCGTGATG
TCCAAGGCCA AGGCCGAGGA ACTGGGCCTG GAGTGGCTGG CGGAGATCGG CGCGCACGGC
AACGTGGCCG GGCCCGACAA CTCGCTCCAG TCCCAGCCCG CCAACGCGAT CAAGCACGCC
CTCGGCAAGC AGGGGCTCTC GGTCGAGGAC CTCGACCTGC TGGAGATCAA CGAGGCCTTC
GCCCAGGTCG TCCTCCAGTC GGCCAAGGAC CTCGGCGTCC CGCTCGACAA GGTCAACGTC
AACGGCGGCG GCATCGCCGT CGGCCATCCG ATCGGCGCCT CCGGCGCCCG CATCGTCCTC
GCCCTCGCCC ACGAGCTCAG GCGCCGGGGC GGCGGGCTCG GTGCCGCGGG CCTGTGCGGC
GGCGGCGGCC AGGGCGATGC GCTGATCATC CGGGTCCCCT CGGCCTGA
 
Protein sequence
MSGSVIVAGA RTPIGRLLGS LAGLSAVELG GIAIKAALER SGVAPESVQY VIMGQVLQAG 
AGQIPSRQAA VKAGIPMTVP SLTINKVCLS GLDAIALADQ LIRAGEFDVV VAGGMESMSN
APHLLPGLRR GVKYGDAGIV DSMAFDGLTD AYDQVSMGES TERHNARLGL TREEQDAFSA
RSHELAAAAI KNGVLDDEIV PVPVPQRKGE PVMFAADEGV RGDTTVETLG RLRPAFSKDG
TITAGSASQI SDGACAVVVM SKAKAEELGL EWLAEIGAHG NVAGPDNSLQ SQPANAIKHA
LGKQGLSVED LDLLEINEAF AQVVLQSAKD LGVPLDKVNV NGGGIAVGHP IGASGARIVL
ALAHELRRRG GGLGAAGLCG GGGQGDALII RVPSA