Gene Sros_4429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4429 
Symbol 
ID8667723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4942467 
End bp4943819 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content69% 
IMG OID 
Productdyp-type peroxidase family protein 
Protein accessionYP_003340042 
Protein GI271965846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0133678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGG AGGCAAGCAC GAGCGGAGCC AGTGCGGGAA TCGAGATCGA CGACATCCAA 
AGCGGGGCAT TGCATCCACG GCCTGCACCG TACGAGGGAA GGTTCATCTT CCTGCGGGTA
GATGACCGTC ACGCGGGGCG TTCCCTGCTG CGGCGGCTGC TCCCGGCGAT CGAGGGGGGT
TTCCACAGCG CGGACCCGAG CCAGGATGCC TGGGTGGCGG TGGCGTTCAC CTACCAGGGG
CTGCGGGCCC TGGGGGTGCC CCAGGAGTCG CTGGACAGCT TTCCGCGGGC GTTTCGCCAG
GGCATGGCTG CGCGTGCGGA CCTGATCGGC GACGTGGGTG AGAGCGCCCC GGCTCACTGG
GAGCCGCCGT TCGGAACAGC CGATGTGCAT ATCGCGTTGA GTGCTCTGTC ACCCGATGCG
GCGCGGCTGG ACAAGGCTCT GGAGCGAGCC CGCATCGCCT GCCGGGACAC CCCTGGTGTC
CAGGTGATCT GGCAGCAGGA GGTCCGCCAG CTCCCGACCG GGCGTACCAC CTTCGGCTTC
CGCGACGGCA TCAGCCATCC GAACATCGAG GGCCTCGGGC TGCCCGGCTC CAACCCCCGG
GAAGCCCCTC TCAAGGCCGG CGAGTTCATC CTCGGCTACC CCGACGAGAC CGGCAATCTG
CCGCCCATGC CCAGCCCCGA TGTGCTGGGG CGCAACGGGA CCTACGTCGC TGTGCGCAAG
ATTCACACCA AGGTGGCGGC CTGGCGCCAG TACCTGCGCG CGAACACCTC CAGCGCCCGG
GAAGAGGCGC TCCTGGCGGC GAAGATGGTC GGGCGCTGGC CCAGCGGGGC ACCGTTGACG
CTGACCCCGG AGCACGACGA CGCGGCGCTG GGCGCCGATC CGCACCGCAA CAACGACTTC
CTGTACCGGG AGAACGACGA TCGAGGCTTC CGATGCCCCG CTGGTGCGCA CATCCGGCGC
ACCAACCCCC GCGATGCCGC CATCATCGGC GACGCACGGA TGCACCGCCT CATCCGCCGC
GGCACCAGCT ACGGCCCGCC GCTGCCAGAG GGCGTGCTGG AGGACGACGG CGCCGACCGG
GGCCTGGTCG GAGTCTTCAT CGGAGCTCAT ATCGAACGAC AGTTTGAATT CATCAAGGCC
GAGTGGGTCA ACGACGGCAA CTTCATCGGC TTCCCCGGCG AGAAGGATCC GGTGGCCGGG
CATCACGACG GAACCGGCAG CGCCACCATC CCGGAGAGGC CGATCCGGCG GCGCCTGCAG
AACCTGCCCA GCTTCGTGGT CACCCGAGGC GGCGAGTACT GCTTCCTGCC GGGTCTGCGC
GCCCTGCGCT GGCTGGCCGA ACTGAGGGAC TGA
 
Protein sequence
MNAEASTSGA SAGIEIDDIQ SGALHPRPAP YEGRFIFLRV DDRHAGRSLL RRLLPAIEGG 
FHSADPSQDA WVAVAFTYQG LRALGVPQES LDSFPRAFRQ GMAARADLIG DVGESAPAHW
EPPFGTADVH IALSALSPDA ARLDKALERA RIACRDTPGV QVIWQQEVRQ LPTGRTTFGF
RDGISHPNIE GLGLPGSNPR EAPLKAGEFI LGYPDETGNL PPMPSPDVLG RNGTYVAVRK
IHTKVAAWRQ YLRANTSSAR EEALLAAKMV GRWPSGAPLT LTPEHDDAAL GADPHRNNDF
LYRENDDRGF RCPAGAHIRR TNPRDAAIIG DARMHRLIRR GTSYGPPLPE GVLEDDGADR
GLVGVFIGAH IERQFEFIKA EWVNDGNFIG FPGEKDPVAG HHDGTGSATI PERPIRRRLQ
NLPSFVVTRG GEYCFLPGLR ALRWLAELRD