Gene Sros_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4537 
Symbol 
ID8667831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5054939 
End bp5056174 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content76% 
IMG OID 
Productcarotenoid oxygenase 
Protein accessionYP_003340145 
Protein GI271965949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.368379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCA CGGAGCACGT CGACCCGATC ATGGCGCTCA TGTCCGCGGC GCCCCGCCGC 
CACCGGACGC ACGACGAGCC GCTGCCGGTC GACGGCGTCA TACCTTCCGG CCTGGACGGC
GCGTTCGTGC AGGCGAGCAC GTATCCCGGG GGATCGTGGC AGTCGCACGC CACGGCCGGC
CCCGTCCTCT TCTCCGGCGT ACGGCTCGGC GGCGGCACCG CTCGGCGCCT CACGACGGCC
GGGGAGTTCG GCGGCCACCC GCTGGAGCGG ATGCCGGACC TCGCGACCTG GATCCGGCCC
GCCGGATCCG CCGCCCGGCC GCCGGACGGG CCGTGGAGCG CGAGCCTCGC GCCGCCGGTC
CAGGACCGGG CCACGGCCGA ATGGCACACG ATCGCCACCT ATCCCGGTCT GGGCTGCGCG
GAGCACCTGA CCCTCGGGAC GGACGGCGGC ATCCGCGACG CCCGGCCCTT CGCCCTCGAC
GGTGCACCGC TCATGCACGC GGTCGCGCTC ACCGAACGGT TCGTCGTGGT GTTCGACCTG
CCCGTGACCT ATCACCGGGC GGCGGCGATG GTCGGCACCC GATTCCCCTA CCGCTGGCGG
CGGGACCGGC CGGCGCGCAT CGGACTGCTG TCACGGCGGC CCGGCGACGC GACGGAACCC
CGCTGGTTCC CGATCGACCC CTGCTACGTG TCCCATTCGG TCAACGCCTA CGACGACGGC
GGCCGCGTCG TCGTGGACGC CGTCCGCCAC GAGCGGGCCT TCGACGCTCC GTCGTGGGAC
GGCGAGGACG GCGCCGGGGC GCCGCGGGTG CACCGGTGGA CGCTCGACCT GGGGAGCGGC
GCGGCGGAGG AGCGGCCGCT GGTCGACTCC ATGACGCTGG CGTCGGTCGA CTCCCGGCGG
GCCGGCCGCA GGCATCAGCT GATGTTCGGC CGCACCCCCG GCGGGCGGGC GCTGGTCGGC
CACGACCTCG CGGCCGGCAG CACGCAGGTG CGGGAGCTCG CCCCGGGCCT GCGCGCCGGC
CAGCCGGTCT TCGTCCCCCG TGGCCGCGCC GAGGGAGACG GCTGGCTCGT GGTCCTCACG
CAGGACGGCG CGCGGCGGCG GAGCGAGCTG CTCGTGCTCG ACGCGCTCCA CCTGAACGGC
CGGCCCCAGG CGGTGGTCCA CCTCCCGGCC CTCCTGCCGG ACGCGCGGCA CACCACCTGG
ATGACCACAC CCGCCGGGCG TGCGCACCGG CGGTGA
 
Protein sequence
MTVTEHVDPI MALMSAAPRR HRTHDEPLPV DGVIPSGLDG AFVQASTYPG GSWQSHATAG 
PVLFSGVRLG GGTARRLTTA GEFGGHPLER MPDLATWIRP AGSAARPPDG PWSASLAPPV
QDRATAEWHT IATYPGLGCA EHLTLGTDGG IRDARPFALD GAPLMHAVAL TERFVVVFDL
PVTYHRAAAM VGTRFPYRWR RDRPARIGLL SRRPGDATEP RWFPIDPCYV SHSVNAYDDG
GRVVVDAVRH ERAFDAPSWD GEDGAGAPRV HRWTLDLGSG AAEERPLVDS MTLASVDSRR
AGRRHQLMFG RTPGGRALVG HDLAAGSTQV RELAPGLRAG QPVFVPRGRA EGDGWLVVLT
QDGARRRSEL LVLDALHLNG RPQAVVHLPA LLPDARHTTW MTTPAGRAHR R