Gene Sros_4150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4150 
Symbol 
ID8667444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4619112 
End bp4620380 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content72% 
IMG OID 
Product3-oxoacyl-(acyl-carrier-protein) synthase II 
Protein accessionYP_003339797 
Protein GI271965601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0596786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00141331 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTCAACG ACAGGCGCGT CGTCATCACC GGGATCGGCG TGACGGCGCC GGGCGGCATC 
GGGACCAAGG CGTTCTGGGA GCTGCTCACG GCCGGCCGTA CCGCCACCCG TAACATCACG
CTGTTCGACG CGTCCGGGTT CCGCTCCCGC ATCGGGGCGG AGTGCGACTT CGATCCCGCC
GCGGAGGGCC TGAGCCCGCA GGAGATCCGG CGGATGGACC GGGCCGCGCA GTTCGGCGTC
GTGTGCGCCA GGGAGGCCCT GGCCGACAGC GGCCTGGAGG AGGGCGCGGT CCCGCCGGAG
CGGATCGGCG TCAGCATCGG CAGCGCCGTG GGCTGCACGA TGGGCCTTGA GCAGGAGTAC
GTCGTGCTGA GCGACGGCGG CCGGCGCTGG CTCGTCGACC CCGCCTACGC GGTGCAGCAC
CTGTACGGCT ACATGGTGCC GAGCACGATG GCGTGCGAGG TCGCGTGGGC GGCGGGGGCC
GAGGGGCCGG TGTCGCTGAT CTCCACCGGG TGCACGGCGG GGCTGGACGC GGTCGGCAAC
GGCTGCCAGC TGATCTGGAC GGGCCGGGCC GACGTCGTGA TCGCCGGAGC CACCGACGCG
CCGCTCTCGC CGATCACGTC GGCGTGCTTC GACGCGATCA AGGCGACCTC GCCGAACAAC
GACGACCCGG CACATGCCTC CCGGCCGTTC GACGCCGATC GGGACGGGTT CGTGCTCGGA
GAGGGCGCCG CCGTCTTCGT GCTGGAGGAG CGGGAGGCCG CGCGCCGGCG CGGAGCGCAC
ATCTACGCGG AGATCGTCGG GTTCGCCGGC CGCAGCAACG CCTACCACAT GACCGGTCTC
AAGCCCGACG GGCGCGAGAT GGCCGAGGCG ATCCGCCAGG CCATGCACCT GGGGCGCGTG
GACGCCGCGG ACATCGACTA CATCAACGCG CACGGTTCGG GCACCAAGCA GAACGACAGG
CACGAGACCG CGGCCTTCAA GCGCGCGCTC GGCCAGCGTG CCTACGAGGT GCCGGTCAGC
TCCATCAAGT CGATGATCGG GCACTCGCTC GGGGCCATCG GGGCGATCGA GGTGGCCGCC
TGCGCGCTGG CCATCGAGCA CCAGGTGGTG CCGCCGACGG CGAACCTGCA CACCCGCGAT
CCCGAGTGCG ACCTGGACTA CGTGCCGCTG ACCGCGCGTG AGCACCCGAT CGACTCCGTG
CTCAGCGTCG GCAGCGGGTT CGGCGGCTTC CAGACCGCCA TGGTCATCGT GCGGGACCGG
GCGGCATGA
 
Protein sequence
MVNDRRVVIT GIGVTAPGGI GTKAFWELLT AGRTATRNIT LFDASGFRSR IGAECDFDPA 
AEGLSPQEIR RMDRAAQFGV VCAREALADS GLEEGAVPPE RIGVSIGSAV GCTMGLEQEY
VVLSDGGRRW LVDPAYAVQH LYGYMVPSTM ACEVAWAAGA EGPVSLISTG CTAGLDAVGN
GCQLIWTGRA DVVIAGATDA PLSPITSACF DAIKATSPNN DDPAHASRPF DADRDGFVLG
EGAAVFVLEE REAARRRGAH IYAEIVGFAG RSNAYHMTGL KPDGREMAEA IRQAMHLGRV
DAADIDYINA HGSGTKQNDR HETAAFKRAL GQRAYEVPVS SIKSMIGHSL GAIGAIEVAA
CALAIEHQVV PPTANLHTRD PECDLDYVPL TAREHPIDSV LSVGSGFGGF QTAMVIVRDR
AA