Gene Sros_5723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5723 
Symbol 
ID8669017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6263422 
End bp6264747 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID 
Product9-cis-epoxycarotenoid dioxygenase 
Protein accessionYP_003341214 
Protein GI271967018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00560936 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC CGCTCTACCT GCGAGGCTTC CTCGCGCCGG TGCCCGACGA GATCGACGCC 
TTCGACCTGC CGGTCAGCGG GGCCCTTCCC CCCGCGCTCA CCGGCCGCTA CTTCCGCAAC
GGCCCCAACC CGCTGCCCGG CCGCGACCCC GGCCACTGGT TCACCGGGCC CGGGATGGTG
CACGGCGTAC GGCTGCGCGA CGGCCGCGCC GAGTGGTATC GCAACCGCTG GGTGCGGACC
CGCGAGTTCA CCGAGGACGC GCCCTTCGTC CGGGACGACC TCTCCGTCGA CCTCACCGCG
GTGCCCGCCA ACACCCACGT CGTCCCGCAC GGGGACAAGA TCTTCGCGCT GGTCGAGAAC
GGCCTGCCGT ACGAACTCAC CGCCGGGCTG GAGACGGTCG GCCCGTGTGA CTTCGGCGGC
CTGCTGACCA CCGCCATGAC CGCCCACCCC AAGCGCGACC CGCTCACCGG CGAGCTGCTC
TTCTTCGGCT ACGGCTTCCT GCCGCCCTAT CTCACCTACC ACCGGCTCTC GGCGGACGGA
GAGCTCGTGG AGAGCCGCGA GGTCCCGGTG CCGGGGCCGA CGATGATGCA CGACTTCGCC
ATCACCGCCG GTCACGTCGT GTGGATGGAC CTGCCCGTCG TGTTCGACCT GGCGCTCGCC
GAGGGCGGCG GCATGCCGTA CCGGTGGGAC GACCGCTACG GCGCCCGGCT CGGGGTCATG
CCGCGCACCG GCGACGCCGG CGTGACCTGG TCCGACATCA ACCCGTGCTA CGTCTTCCAC
ACGGCCAACG CCCACGAGGA CGGCTCCGGC CGCGTCGTGC TCGACACCGT CCGCTACACC
CCCGCCGAGT TCGCCGCCGT GTGGGACGAC ATCGGCGGCA GTGCCCACCC GGCGGCCAGG
GCGGCCGTGA GCGGGACGGC CCACCTGCAC CGGTATGTCC TCACTCCCGG CGCCGCCTCC
CACGAGGAGC AGCTCGACGA CCTCGACGTG GAGTTCCCCA CGCTGCACGA CGGCCGGACC
GGCGACCGCA ACCGCTACCT CTACGCGGTC TCCTCCGGGG CGATCGTCAA ATACGACGTG
CGGAGCGGGG CGAGCATCCT CCACAAGACA GGACCGGACC GGATGGCAGG CGAGGCGGTC
TTCGTCCCCG CCGAGGACGC GCGGGGAGAG GACGAGGGCT GGCTGATCTC CATCGTCACC
GGCGGGCCCG GTGTCGGCTC CGAACTGCTG GTCCTCGACG CCGTCGACCT GTCCCGGGTG
GCCTCGGTAC GGCTGCCGCG GCGCGTCCCG GCCGGTTTCC ACGGCTCCTG GATCCAGGAC
GGCTGA
 
Protein sequence
MTTPLYLRGF LAPVPDEIDA FDLPVSGALP PALTGRYFRN GPNPLPGRDP GHWFTGPGMV 
HGVRLRDGRA EWYRNRWVRT REFTEDAPFV RDDLSVDLTA VPANTHVVPH GDKIFALVEN
GLPYELTAGL ETVGPCDFGG LLTTAMTAHP KRDPLTGELL FFGYGFLPPY LTYHRLSADG
ELVESREVPV PGPTMMHDFA ITAGHVVWMD LPVVFDLALA EGGGMPYRWD DRYGARLGVM
PRTGDAGVTW SDINPCYVFH TANAHEDGSG RVVLDTVRYT PAEFAAVWDD IGGSAHPAAR
AAVSGTAHLH RYVLTPGAAS HEEQLDDLDV EFPTLHDGRT GDRNRYLYAV SSGAIVKYDV
RSGASILHKT GPDRMAGEAV FVPAEDARGE DEGWLISIVT GGPGVGSELL VLDAVDLSRV
ASVRLPRRVP AGFHGSWIQD G