Gene Sros_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4474 
Symbol 
ID8667768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4987198 
End bp4988331 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content72% 
IMG OID 
Productsalicylate hydroxylase protein 
Protein accessionYP_003340084 
Protein GI271965888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.102807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA CTCCGACACC CCGCATCGCC ATCATCGGGG CCGGTCCCGG CGGCCTGATC 
TGCGCTCGCA TCCTCCAGCA GCACGGCATC ACCGCCGCCG TCTACGACCG CGACGCCGGC
CCCGCCGCCC GCGACCAGGG CGGCACTCTC GACCTGCACG CCGACAACGG CCAGATCGCT
CTGCGCGAAG CCGGCCTCCT GGAGGAGTTC TTCCGACTGG CCCGGCCCGA GGGCCAGGAG
ATGCGCCAGA TGGACCCGGC CGGCACGATC CTCTTCCACC ACGTCCCCGA GCAGGGCGAG
CGGTTCAAAC CGGAAATCGA CCGCGGCAGG CTGCGCGACC TGCTGCTCGA CTCGCTTCAG
CCCGGCACCG TGCGCTGGGG CCATGCCCTG CAGACCGTCA GCGGCCCCGC CGAAGGCCCC
CGGCAGCTGC ACTTCACGGG CGGCACCACC ATCGAAGCCG ACCTCGTCGT CGGCGCCGAC
GGCGCCTGGT CCAAGGTCCG CCGCGCCCTC TCCCAGGCCA CCCCCCGCTA CAGCGGCGTA
AGCTTCCTGG AAGCCTGGTT CCACGATGTC GCGACCCGGC ACCCCGACAT CGCCGAGCTC
GTCGGCCAGG GCGGCGCCGC CGCAGCCGAC GGCGACCGCG GCCTGTTCGC CCAGCGCAAC
AGCGGCGACC ACATCCGCGT CTACATCATC CAGCGCGTCC CGGCCGACTG GATCACCGCC
GGCGGTCTCA CCCCCCAGGC CACCGACGGC ATCCGCGCCC TCCTCCTGGA GCGCTACCGC
GACTGGTCGC CCCGCCTGCG CCGGCTGATC AGCGACAACG ACGGCCCCTA CGTCGACCGC
CCGATCTTCG CCCTGCCCGT CCCGCACGCC TGGGAGCACA ACCCCACGGT GACCCTGCTC
GGCGACGCCG CCCACCTCAT GCCCCCGCTC GGCGTCGGCG TCAACCTCGC CATGCTGGAC
GCATGCGAAC TCGCCCTCGC CATCGCCTGC CACGACACCA TCGACGAAGC CATCCACGCC
TACGAGGAGA CCATGCTTCC CCGCTCCACG GAGATGGCCC AGCTCCTCGA CGGCGCCGCC
GGCGAGCTGC TGTCCACCGA GCTGCCCGAC TTCGCCACCG CCGGCAACCA CTGA
 
Protein sequence
MNTTPTPRIA IIGAGPGGLI CARILQQHGI TAAVYDRDAG PAARDQGGTL DLHADNGQIA 
LREAGLLEEF FRLARPEGQE MRQMDPAGTI LFHHVPEQGE RFKPEIDRGR LRDLLLDSLQ
PGTVRWGHAL QTVSGPAEGP RQLHFTGGTT IEADLVVGAD GAWSKVRRAL SQATPRYSGV
SFLEAWFHDV ATRHPDIAEL VGQGGAAAAD GDRGLFAQRN SGDHIRVYII QRVPADWITA
GGLTPQATDG IRALLLERYR DWSPRLRRLI SDNDGPYVDR PIFALPVPHA WEHNPTVTLL
GDAAHLMPPL GVGVNLAMLD ACELALAIAC HDTIDEAIHA YEETMLPRST EMAQLLDGAA
GELLSTELPD FATAGNH