Gene Sros_8569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8569 
Symbol 
ID8671903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9455466 
End bp9456686 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAcetyl-CoA C-acyltransferase 
Protein accessionYP_003343954 
Protein GI271969758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGG CAGTCATCGT CGCAACCGCG CGCTCGCCGA TCGGACGCGC CTTCAAGGGA 
TCCCTCAAGG ACATCCGTCC CGACGACCTG ACCGCACAGA TGATCAAGGC CGCGCTGGCC
AAGGTCCCCC AGCTCGACCC CACCGGCATC GACGACCTGA TGCTGGGCTG CGGCCTGCCG
GGCGGCGAGC AGGGGTTCAA CATGGCCCGC GTGGTCTCCA CGCTGCTCGG GCTGGACACC
GTGCCCGGCA CCACCGTCAC CCGCTACTGC TCGTCCTCGC TGCAGACCAC CAGGATGGCG
CTGCACGCGA TCAGGGCGGG CGAGGGCGAC GTGTTCGTCT CGGCGGGCGT GGAGTGCGTC
TCCCGCTTCG CCAAGGGCAA CTCCGACTCG CTGCCCGACA CGCAGAACAC GCTGTTCGAC
GAGGCCCGCG GCCGTTCGGC CAAGGCCGCC GAGGGCGGCG GCGAGGTCTG GCACGACCCG
CGCGAGGACG GCACCGTGCC CGACGTCTAC ATCGCGATGG GCCAGACCGC GGAGAACCTC
GCCGGGTTGA AGGGCGTCTC CCGCCAGGAA CAGGACGAGT TCGGCGTCCG GTCCCAGAAC
CTGGCCGAGA AGGCGATCGC CAACGGCTTC TGGGAGACCG ACATCACCCC GGTCACCCTG
CCCGACGGGA CCGTGGTCAG CAAGGACGAC GGTCCCCGCG CGGGCACCAC CTACGACGCG
GTCTCGCAGC TCAAGCCGGT CTTCCGGCCG GACGGGACGG TCACCGCCGG CAACTGCTGC
GCGCTGAACG ACGGCGCCGC CGCGGTGATC GTGATGAGCG ACACCAGGGC CGCCGAGCTG
GGCATCACCC CCCTCGCCCG GATCGTCTCC ACCGGCGTGA CGGGCCTGTC CCCCGAGATC
ATGGGCCTGG GCCCGGTCGA GGCCTCCAGG CAGGCCCTGG CGCGGGCGGG CATGTCGATC
GGCGACGTGG ACCTCGTCGA GATCAACGAG GCCTTCGCCG CCCAGGTCAT CCCGTCCTAC
CAGGATCTCG GCATCGACCT CGACCGGCTC AACGTCAACG GCGGCGCCAT CGCGGTGGGC
CACCCGTTCG GCATGACCGG TGCCCGGATC ACCTCCACGC TGATCAACAG CCTCCGGTTC
CACGACCGGT CGATCGGCCT GGAGACCATG TGCGTGGGCG GCGGTCAGGG CATGGCCATG
GTCCTGGAGC GCCTCAGCTA G
 
Protein sequence
MPEAVIVATA RSPIGRAFKG SLKDIRPDDL TAQMIKAALA KVPQLDPTGI DDLMLGCGLP 
GGEQGFNMAR VVSTLLGLDT VPGTTVTRYC SSSLQTTRMA LHAIRAGEGD VFVSAGVECV
SRFAKGNSDS LPDTQNTLFD EARGRSAKAA EGGGEVWHDP REDGTVPDVY IAMGQTAENL
AGLKGVSRQE QDEFGVRSQN LAEKAIANGF WETDITPVTL PDGTVVSKDD GPRAGTTYDA
VSQLKPVFRP DGTVTAGNCC ALNDGAAAVI VMSDTRAAEL GITPLARIVS TGVTGLSPEI
MGLGPVEASR QALARAGMSI GDVDLVEINE AFAAQVIPSY QDLGIDLDRL NVNGGAIAVG
HPFGMTGARI TSTLINSLRF HDRSIGLETM CVGGGQGMAM VLERLS