Gene Sros_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3621 
Symbol 
ID8666909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4016293 
End bp4017480 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAcetyl-CoA C-acyltransferase 
Protein accessionYP_003339295 
Protein GI271965099 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.905144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG CCGTCATCGT GGAGGCCGTA CGGACTCCGA TCGGGAAGGG GAAGCCCGGC 
GGCGCGCTCG CCGGAGTGCA CCCGGTCGAC CTTCTCGCCC ACACCCTGCG CACCCTCGTC
GGACGCTCCG GCGTCGACCC GGCGCTCGTC GACGACGTGA TCGGCGGCTG CGTCGACCAG
GTGGGCGAGC AGGCGATGAA CACCACCCGT TACGCCTGGC TGTCGGCCGG TTTCCCGGAG
TCTGTCCCGG CGACCACCAT CGACCGCCAG TGCGGCTCCT CCCAGCAGGC CGTCCACTTC
GCCGCCCAGG GCGTGATCTC GGGGGCGTAC GACCTGGTCG TGGCCTGCGG GATCGAGTCG
ATGAGCCGGG TGCCGATGTG GTCCAACGTG CCGCCGGGCG CCGACCCCTT CGGCCCGGGG
CTGGCGGCCC GCTACCCGGA GGGGCTCGTC CCCCAGGGCA TCAGCGCCGA GCTCATCGCG
GCCAAATGGT CGATCGGCAG GGAGGAGATG GACGTCTTCG CCACCTCCTC CCACCGGCGC
GCCGCCCAGG CGCACGCAGA CGGCCTGTTC GACGCCGAGC TCGCGCCTGT CGCGACGGAC
GCGGGCACGG TGACCGCCGA CGAGTCCGTA CGGCCGGGCA CCACCCCCGA GATCCTCGCC
GGGCTCCGCC CCGCCTACGC CGACCCCGCC TACGCAGAGC GCTTCCCGCA GATCGAGTGG
TCGGTGACCG CGGGCAACGC CAGTCCGATC AACGATGGCG CGTCCGCTGT GCTGATCGCC
TCCGGCGAGA CCGCGGCCCG CCTCGGACTG CGCCCGAAGG CGCGGCTGCA CAGCTCCGCG
GTGACCGGCT CCGACCCGCT GACCATGCTG ACCGGCATCA TCCCGGCCAC TGACAAGGTG
CTGCGCAGGG CCGGGCTGCG GCTGGACGAC ATCGACCTGT TCGAGGTCAA CGAGGCCTTC
GCCGGCGTCG TGCTGGCCTG GCTGCGGGAG ACCGGCGCCG ACCCGGCCAA GGTCAACGTC
AACGGCGGCG CCATCGCGCT GGGCCATCCC CTCGGTGCCA GCGGGACCCG CCTGATGGCC
ACCCTCGTCA ACGCCATGCA CCAGCGCGGC GCCCGCTACG CGCTGCAGAC CATGTGCGAG
GCCGGCGGCC TGGCCAACGC GACCATCCTG GAGGCCGTCA TGGCCTGA
 
Protein sequence
MRDAVIVEAV RTPIGKGKPG GALAGVHPVD LLAHTLRTLV GRSGVDPALV DDVIGGCVDQ 
VGEQAMNTTR YAWLSAGFPE SVPATTIDRQ CGSSQQAVHF AAQGVISGAY DLVVACGIES
MSRVPMWSNV PPGADPFGPG LAARYPEGLV PQGISAELIA AKWSIGREEM DVFATSSHRR
AAQAHADGLF DAELAPVATD AGTVTADESV RPGTTPEILA GLRPAYADPA YAERFPQIEW
SVTAGNASPI NDGASAVLIA SGETAARLGL RPKARLHSSA VTGSDPLTML TGIIPATDKV
LRRAGLRLDD IDLFEVNEAF AGVVLAWLRE TGADPAKVNV NGGAIALGHP LGASGTRLMA
TLVNAMHQRG ARYALQTMCE AGGLANATIL EAVMA