Gene Sros_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3236 
Symbol 
ID8666524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3524522 
End bp3525982 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCoproporphyrinogen dehydrogenase 
Protein accessionYP_003338922 
Protein GI271964726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0363144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0797924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGA CTCCGAGTTC CGGCCGGACA CGGACGCGGG TGCTGGAGAC GGTGGCCCTC 
GGGCCGGACG GCATCCCGGC GCGACCGGCC TCCGACGGCG CGACCGGCGC GGGGCCGTAC
CAGGGGTACG TGTACGCCTA CCCGCACAAG ACGGCATACC GGCCGCTGGA GCCGCGTCCG
TCGCTGCGGG AGGTGTGGTC GGGGGAGCCG CTCGGGAGCC TCTTCCTCTA CCTGCACATC
CCGTTCTGCG AGATGCGCTG CGGGTTCTGC AACCTGTTCA CCCGGACCGG CGCCCCCGAG
GAGCTCGTCG CCGCCTACCT GGACGCCCTG GAACGCCAGG CGGAGGCCGT ACGGGACGCC
CTGGAGGAGC CGAGGTTCGT CACCGCGGCC ATCGGCGGCG GCACGCCCAC CTACCTGAGC
GCGGCCGAGC TCACCCGGAT GTTCGACCTG ACCGAACGGA TCATGGGCGC CGACCTGCGG
GCCGTGCCGC TGTCGGTGGA GACCTCGCCG GCGACCGCGA CGGCCGACCG GCTCGCCGTC
CTGGCCGAGC GGGGTACGAC CCGGATCTCG ATCGGCGTGC AGAGCTTCGT CGACGCCGAG
GCCCGCGCGG CGATACGCCC GCAGAAACGG CAGGAGGTCG AGACGGCGCT CGGACACATC
AGGGAGACCG GGTTCGAGGT GCTCAACATC GACCTGATCT ACGGGATCGA CGGGCAGACC
GAGCGGTCCT GGCGCCACTC GCTGGACGCC GCGCTCGCCT GGAAGCCCGA GGAGATCTAC
CTCTACCCGC TGTACGTCCG CCCGCTCACC GGCCTCGGCC GCCGCGCCCG CGACTGGGAC
GACCACCGGC TCGGCCTCTA CCGGCAGGGC CGTGACCACC TGCTGGCCGC CGGATACGAG
CAGGTGTCGA TGCGGATGTT CCGGCTGCGC GGATCCTCCG GCGCGACCGG CTACGACTGC
CAGAGCGACG GCATGGTCGG GCTCGGCTGC GGGGCCCGGT CCTACACCTC CGGCCTGCAC
TACTCCTACG AGTACGCGGT GGGCGCCGGT CAGGTGCGCG CGATCATCGA CGACTACGTA
CGGCTCGCGC CCGGGGAGTT CGCGCTCGCC AACGTGGGGT TCCGGCTGGA CGAGGACGAG
CGGCGCCGCC GCCATCTGAT CCAGTCGCTG CTCCAGGCCG AGGGGCTGGA CCCGGCCGCC
TACCGGGCGC GCTTCGGCAC CGAGGTGACG GCCGACTTCG GCGAGGACCT GGAGCGCCTG
GCCGGCCGGG GCTGGCTGGA GGCGACCGGC TCGGAGGCCG GGACGCGACC CGGCCGGAGC
GGAGCAGACG AGAGCCACGA TGGGGCGGGC GGCGGTCGGC TCCGGCTCGC CGCCGAGGGG
CTGGCGCACT CCGACGCCAT CGGCCCGGCG CTGTTCTCCG GCCGGGTCCG CGAGCTGATG
GCGGGATACG AGAACAGTTG A
 
Protein sequence
MAVTPSSGRT RTRVLETVAL GPDGIPARPA SDGATGAGPY QGYVYAYPHK TAYRPLEPRP 
SLREVWSGEP LGSLFLYLHI PFCEMRCGFC NLFTRTGAPE ELVAAYLDAL ERQAEAVRDA
LEEPRFVTAA IGGGTPTYLS AAELTRMFDL TERIMGADLR AVPLSVETSP ATATADRLAV
LAERGTTRIS IGVQSFVDAE ARAAIRPQKR QEVETALGHI RETGFEVLNI DLIYGIDGQT
ERSWRHSLDA ALAWKPEEIY LYPLYVRPLT GLGRRARDWD DHRLGLYRQG RDHLLAAGYE
QVSMRMFRLR GSSGATGYDC QSDGMVGLGC GARSYTSGLH YSYEYAVGAG QVRAIIDDYV
RLAPGEFALA NVGFRLDEDE RRRRHLIQSL LQAEGLDPAA YRARFGTEVT ADFGEDLERL
AGRGWLEATG SEAGTRPGRS GADESHDGAG GGRLRLAAEG LAHSDAIGPA LFSGRVRELM
AGYENS