Gene Sros_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3398 
Symbol 
ID8666686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3738599 
End bp3739825 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content74% 
IMG OID 
ProductDyp-type peroxidase 
Protein accessionYP_003339078 
Protein GI271964882 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.528141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.50105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACC CCCGCTTGAC TCGCAGGGGA CTCCTCGCGG GAGGCGCCGC CGCAGCGGCC 
GGCGCCCTCG CGGGGTGCGC CCCCGGGCAG GTCGCCCTCT CCGGGCCGGC CGTCCCCCCC
ACCACGCCGG AAACGGCTCC CCCGATCTCC GGCGCCTCGG CGACGGAGCC CTTCCACGGC
CCCCACCAGG CCGGGATCGC CACCACCCCC CAGACCCACG CGGTGTTCGT CGGCCTGGAC
CTGCTGCCCG GCACCGGCCG CGAAGCCGTC GTCCGGATGA TGCGCCTGCT CACCGACGAC
GCCCGCCGCC TGAGCGAGGG CCGCCCCGCC CTGGCCGACA CCGAGCCGGA ACTGGCCGCG
CCCCCCGCCC GGCTGACCGT CACCTTCGGC TTCGGTCCCG GCCTGTTCGC CGCGGCCGGA
GTCCAGGACC GGCGGCCCGG GTCGATCGCG CCCCTGCCCG GGTTCGTGGT CGACAAGCTG
GAGAAGCGGT GGACAGGCGC GGACCTGCTG CTGCAGCTCT GCGCCGACGA CCCCGTCACC
CTCGCCCACG CCCTGCGCAT GACGATCAAG GACGCCCGCT CCTTCGCCCG GGTGCGCTGG
ACCCAGCGGG GGTTCCGCCG CAGCCCGCAG GCCGCGGCCC CCGGCACGAC CCAGCGCAAC
CTCATGGGCC AGCTGGACGG GACCGTCAAC CCCCAGCCGG GCACGCCGGA CTTCGACCGG
GCCGTCTGGG TCGGCGACGG CCCGCGGTGG CTGCATGGCG GCACCACCCT GGTGCTGCGG
CGCATCCGCC TCAAGCTGGA GACCTGGGAC GCCGCCGACC GGGTGGCCAA GGAGTTCACC
ATCGGCCGCC GCCTGGACAC CGGCGCCCCG CTGACCGGGC AGAAGGAGCG CGACGAGCCC
GACTTCGACA AGCTCAACGC GGTCGGCTTC CCGGTCATCT CCGAATACGC CCACATCCGC
CGGGCCCACG TCACCGACCC GGGCATGCGG ATCCTGCGCC GGGTCTACAA CTACGACGAG
GGCCTCACCC CCGAGGGACA CGCCGACTCC GGACTGCTGT TCGCCTCCTA CCAGGCCGAC
ATCGACCGCC AGTTCGTCCC CATCCAGAAG AGACTGGCCG AGGCCGACCT GCTCAACGAG
TGGACGACCC CCATCGGCTC GGCGGTCTTC GCCATCCCTC CCGGATGCGC ACGCGGCGGA
TGGGTAGGAG AGACCCTCCT GTCCTGA
 
Protein sequence
MPDPRLTRRG LLAGGAAAAA GALAGCAPGQ VALSGPAVPP TTPETAPPIS GASATEPFHG 
PHQAGIATTP QTHAVFVGLD LLPGTGREAV VRMMRLLTDD ARRLSEGRPA LADTEPELAA
PPARLTVTFG FGPGLFAAAG VQDRRPGSIA PLPGFVVDKL EKRWTGADLL LQLCADDPVT
LAHALRMTIK DARSFARVRW TQRGFRRSPQ AAAPGTTQRN LMGQLDGTVN PQPGTPDFDR
AVWVGDGPRW LHGGTTLVLR RIRLKLETWD AADRVAKEFT IGRRLDTGAP LTGQKERDEP
DFDKLNAVGF PVISEYAHIR RAHVTDPGMR ILRRVYNYDE GLTPEGHADS GLLFASYQAD
IDRQFVPIQK RLAEADLLNE WTTPIGSAVF AIPPGCARGG WVGETLLS