Gene Sros_8830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8830 
Symbol 
ID8672168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9748232 
End bp9749956 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptide synthetase 
Protein accessionYP_003344206 
Protein GI271970010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.426227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000549729 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGACG CGCTGATCGC CCGGGTTGAG CGGTGGGCGA GGGAGATCCC GGACGCTCCC 
GCGTACACCT TCCTGGACTA CTCCGCCGGA CACGACGGGG TGAAGCACAC GCTCTCCTGG
GCGAAGGCGG ATCTCAAGGC GCGGGCGCTG GCCGTACGGC TGCGCGAGGT GACGCTCCCC
GGTGACCGGG CGGCGATCCT GGCGCCGCAG GGGCTGGAGT ACGTGGTCGC GATGCTCGGG
ACCATGTACG CCCGGGTGGT CGCGGTGCCG CTGTTCGCAC CCGACCTGCC CGGCCACGCC
GACCGGCTGA TCCGGGCCTA CGCCGACGCC GACCCGGTCT GCGTGCTCAC CACGACCTCC
GCGCTGGACA GCGTCCACGC CTTCCTGGAC GGCGGGTCCG CGCCCCGGCC CAAGCAGGTC
ATCACGCTGG ACGCGGTCTC CGACCTGCTC GCCGACGAGT GGGAGCCCGA GCCGATCGGG
CTCGGCGACC TGGCCTACCT GCAGTACACC TCCGGCTCGA CCCGCGCCCC CGCGGGAGTG
GAGATCAGCC ACGCCAACTT CACCGCGAAC GCCGAGCAGC TCTGGGAGGC CTTCCGGGCC
ACCCCCCGGG TGTCCACGGC GGCCCTGTGG CTGCCGCTCT TCCACGACAT GGGCCTGATC
GCCACGATCG CCGCGCCGAT GGTGGGCGGA AACCAGGCGG TGTTCATGGA CCCGGTCGCG
TTCGTCATGC ACCCGGTGCG GTGGCTGCGG ATGCTCAGCG AGTACGACGA CGTGTTCACC
GGCGGCCCCA ACTTCGCCTT CGAGTACACG GCCGGCCGGG TCACCGACGA GGAGAAGGCC
ACGCTCGACC TGTCCGGGGT CTCGGTCATG CTCAACGGCG CCGAGCCGCT GCGCGGCAGC
ACGATCGACC GGTTCTCCGA GACGTTCGCC GCGTGCGGGC TGCGGCCCGA GGCGCACACC
CCCGGGTACG GCCTGGCGGA GGCGACGGTG TTCGTCACGG TGATGGACCG GGACCTGCCG
GCCCGGGTGA CCCTGTTCGA CAGGGACGCG CTGACCGCCG GGCGGGCCGT GCCGTACACG
GGTGAGGGAC GGGTCAGCGA CCTGGTCTCC TGCGGGGTGC CGACGGGCCA GCGGGTCGCC
ATCGTCTCGG AGTCCGGTAC GGCGAAGCCG GACGGCGAGG TGGGCGAGAT CTGGGTGCAG
GGGCCGAACG TGGCGCGGGC CTACTGGCGG GACGAGGAGC GCAGCGCCGA GGTCTTCGGC
AACGTGCTCG ACGGCGCGGA CGGCACCTGG CTGCGGACCG GGGACCTGGG CGTCGTCCAC
GAGGGCGAGC TCTACATCAC CGGGCGGATC AAGGACCTGA TCATCGTCGA CGGGCGCAAC
CACTATCCAC AGGATGTGGA GGTGACCGTG CAGGAGGCCG ACCAGGCCGT CCGCCGGGAC
CACGTGGCGG CGTTCGCCCT GCCGGGGGAG GAGACCGAGC GGCTGGTCGT GGTGGCCGAG
CGGTCCCGCA GGGCCGCCGG GCGCGACCTC GCCGAGGTCA CGGCCAACAT CCGCGCGGCT
GTCGCGAAAA ACCATGATCT GCGGCTGCAT GACTTCGTGC TGACCGAGGC GGGAGTCGTG
CCGCGCACGT CGAGCGGGAA GATCGCCCGC AGGGCGTGCG TACTGGCCTA CCTGGACGGC
GCGTTCGGCC CCCGGCCCGC CGGGCCCAGG GACGCCGGCG TCTGA
 
Protein sequence
MSDALIARVE RWAREIPDAP AYTFLDYSAG HDGVKHTLSW AKADLKARAL AVRLREVTLP 
GDRAAILAPQ GLEYVVAMLG TMYARVVAVP LFAPDLPGHA DRLIRAYADA DPVCVLTTTS
ALDSVHAFLD GGSAPRPKQV ITLDAVSDLL ADEWEPEPIG LGDLAYLQYT SGSTRAPAGV
EISHANFTAN AEQLWEAFRA TPRVSTAALW LPLFHDMGLI ATIAAPMVGG NQAVFMDPVA
FVMHPVRWLR MLSEYDDVFT GGPNFAFEYT AGRVTDEEKA TLDLSGVSVM LNGAEPLRGS
TIDRFSETFA ACGLRPEAHT PGYGLAEATV FVTVMDRDLP ARVTLFDRDA LTAGRAVPYT
GEGRVSDLVS CGVPTGQRVA IVSESGTAKP DGEVGEIWVQ GPNVARAYWR DEERSAEVFG
NVLDGADGTW LRTGDLGVVH EGELYITGRI KDLIIVDGRN HYPQDVEVTV QEADQAVRRD
HVAAFALPGE ETERLVVVAE RSRRAAGRDL AEVTANIRAA VAKNHDLRLH DFVLTEAGVV
PRTSSGKIAR RACVLAYLDG AFGPRPAGPR DAGV