Gene Sros_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1535 
Symbol 
ID8664811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1623736 
End bp1625187 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content69% 
IMG OID 
Product9-cis-epoxycarotenoid dioxygenase 
Protein accessionYP_003337271 
Protein GI271963075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.194369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.146884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC CTGTGGTGGT CATCGACGAG TCGGGCGAGC CGAATCCGTA CCTGATGGGC 
GTCTACGCCC CGGTCCAGGA CGAGATCACG GCGGAGAACC TCAAGGTCGT CGGGGAGATC
CCGAAGGATC TCAACGGCGT CTACCTGCGC AACGGGCCGA ACGCGCGGTT CCCGGCCAAG
GGGCGCTACC ACTGGTTCGA CGGGGACGGC ATGGTGCACG CCGTGCACTT CGAGAACGGC
AGGGCCCGCT ACCGCAACCG CTATGTCAGG ACCAGGGCGT TCGAGGCGGA GTCGGCCGCC
GGAAGATCGC TCTGGACCGG CGTCATGGAG AACCCCAAGG GCAACCCGTT CGGCAACACC
CGCGGGCTCA ACCTCAAGGA CTCGGCCAAC ACCGATGTGA TCTTCCATCG GGGCAAGGTC
CTCACGACCT GGTACCTGTG CGGCGCGCCG TACGGCATGG ACCCCCTCAG CCTGGAGGCG
CTGGGCGCGG AGACCTTCCT CGACACCCTG ACCGGCGACT TCATGGCGCA CCCCAAGCTG
GACGAGCGGA CCGGGGAGCT GTTCTGGTTC GACTACGGGC CCGGCAGGCC CTATCTGCGC
TACGGGGTGG TCGGCGCCGG AGGCCGGGTG GAGCACTGCG TGGAGCCGGA CCTGCCCGGG
GCACGGCTGC CGCACGACAT GGCGATCACC GCCAACCACG CGATCCTCAT GGACCTGCCG
CTCTACCAGG ACATGGACGC CGCCCGGCAG GGCCGCTACA AGCTGACCTT CAACCGGGAG
CTGCCCTCCC GCTTCGGGGT CATCCCGCGC CGGGGCCAGG CGCACGAGAT CCGCTGGTTC
GAGGCGGAAC CCTGCTACAT CTACCACGTC GTCAACTCCT GGGAGGAGAG CGACGAGATC
GTCATGGACG TGTGCCGGGT GTCGCGGCCC GCGCCCGCCG GGAGCGGGAG CCCGCTGGCC
CGGATGATCT CCTACCTCAA GCTCGACGCC CGGATGCACC GCTACCGGTT CGACCTGCGC
ACCGGCCGGA CGCACGAGGA GTCCGTCGAC CCGGACCACA ACACCGAGTT CCCGTCGATC
GACGCCCGCC TGACCGGCCG GAGGTCGCGC TACGCCTACA ACGTCTCGGT CAAGGACGCC
GCCACCAACC TCTTCGACGG CCTGGTCCGC TACGACAACG TGACCGGCGC CAAGGAGACC
TACTCCTACG GCGAGCACCG CTACGGCAGC GAGGCGCCGT TCGCCCCGCG TGACGGCGCC
ACCGCCGAGG AGGACGGCTA CCTCGTCAGC TTCGTCACCG ACGAGCGCGA GGGCACCTCC
GAGGTGCAGG TCCTGCACGC CGCGGACCTG AGTGCCGGGC CGGTAGCCCG GATCATCCTC
CCCCAGCGCG TGCCGCTCGG CTTCCACGCC ACCTGGGTCC GCGCCGACCA GCTGAAGGGC
GGGACCGGAT GA
 
Protein sequence
MAEPVVVIDE SGEPNPYLMG VYAPVQDEIT AENLKVVGEI PKDLNGVYLR NGPNARFPAK 
GRYHWFDGDG MVHAVHFENG RARYRNRYVR TRAFEAESAA GRSLWTGVME NPKGNPFGNT
RGLNLKDSAN TDVIFHRGKV LTTWYLCGAP YGMDPLSLEA LGAETFLDTL TGDFMAHPKL
DERTGELFWF DYGPGRPYLR YGVVGAGGRV EHCVEPDLPG ARLPHDMAIT ANHAILMDLP
LYQDMDAARQ GRYKLTFNRE LPSRFGVIPR RGQAHEIRWF EAEPCYIYHV VNSWEESDEI
VMDVCRVSRP APAGSGSPLA RMISYLKLDA RMHRYRFDLR TGRTHEESVD PDHNTEFPSI
DARLTGRRSR YAYNVSVKDA ATNLFDGLVR YDNVTGAKET YSYGEHRYGS EAPFAPRDGA
TAEEDGYLVS FVTDEREGTS EVQVLHAADL SAGPVARIIL PQRVPLGFHA TWVRADQLKG
GTG