Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5723 |
Symbol | |
ID | 8669017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 6263422 |
End bp | 6264747 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | 9-cis-epoxycarotenoid dioxygenase |
Protein accession | YP_003341214 |
Protein GI | 271967018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00560936 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCC CGCTCTACCT GCGAGGCTTC CTCGCGCCGG TGCCCGACGA GATCGACGCC TTCGACCTGC CGGTCAGCGG GGCCCTTCCC CCCGCGCTCA CCGGCCGCTA CTTCCGCAAC GGCCCCAACC CGCTGCCCGG CCGCGACCCC GGCCACTGGT TCACCGGGCC CGGGATGGTG CACGGCGTAC GGCTGCGCGA CGGCCGCGCC GAGTGGTATC GCAACCGCTG GGTGCGGACC CGCGAGTTCA CCGAGGACGC GCCCTTCGTC CGGGACGACC TCTCCGTCGA CCTCACCGCG GTGCCCGCCA ACACCCACGT CGTCCCGCAC GGGGACAAGA TCTTCGCGCT GGTCGAGAAC GGCCTGCCGT ACGAACTCAC CGCCGGGCTG GAGACGGTCG GCCCGTGTGA CTTCGGCGGC CTGCTGACCA CCGCCATGAC CGCCCACCCC AAGCGCGACC CGCTCACCGG CGAGCTGCTC TTCTTCGGCT ACGGCTTCCT GCCGCCCTAT CTCACCTACC ACCGGCTCTC GGCGGACGGA GAGCTCGTGG AGAGCCGCGA GGTCCCGGTG CCGGGGCCGA CGATGATGCA CGACTTCGCC ATCACCGCCG GTCACGTCGT GTGGATGGAC CTGCCCGTCG TGTTCGACCT GGCGCTCGCC GAGGGCGGCG GCATGCCGTA CCGGTGGGAC GACCGCTACG GCGCCCGGCT CGGGGTCATG CCGCGCACCG GCGACGCCGG CGTGACCTGG TCCGACATCA ACCCGTGCTA CGTCTTCCAC ACGGCCAACG CCCACGAGGA CGGCTCCGGC CGCGTCGTGC TCGACACCGT CCGCTACACC CCCGCCGAGT TCGCCGCCGT GTGGGACGAC ATCGGCGGCA GTGCCCACCC GGCGGCCAGG GCGGCCGTGA GCGGGACGGC CCACCTGCAC CGGTATGTCC TCACTCCCGG CGCCGCCTCC CACGAGGAGC AGCTCGACGA CCTCGACGTG GAGTTCCCCA CGCTGCACGA CGGCCGGACC GGCGACCGCA ACCGCTACCT CTACGCGGTC TCCTCCGGGG CGATCGTCAA ATACGACGTG CGGAGCGGGG CGAGCATCCT CCACAAGACA GGACCGGACC GGATGGCAGG CGAGGCGGTC TTCGTCCCCG CCGAGGACGC GCGGGGAGAG GACGAGGGCT GGCTGATCTC CATCGTCACC GGCGGGCCCG GTGTCGGCTC CGAACTGCTG GTCCTCGACG CCGTCGACCT GTCCCGGGTG GCCTCGGTAC GGCTGCCGCG GCGCGTCCCG GCCGGTTTCC ACGGCTCCTG GATCCAGGAC GGCTGA
|
Protein sequence | MTTPLYLRGF LAPVPDEIDA FDLPVSGALP PALTGRYFRN GPNPLPGRDP GHWFTGPGMV HGVRLRDGRA EWYRNRWVRT REFTEDAPFV RDDLSVDLTA VPANTHVVPH GDKIFALVEN GLPYELTAGL ETVGPCDFGG LLTTAMTAHP KRDPLTGELL FFGYGFLPPY LTYHRLSADG ELVESREVPV PGPTMMHDFA ITAGHVVWMD LPVVFDLALA EGGGMPYRWD DRYGARLGVM PRTGDAGVTW SDINPCYVFH TANAHEDGSG RVVLDTVRYT PAEFAAVWDD IGGSAHPAAR AAVSGTAHLH RYVLTPGAAS HEEQLDDLDV EFPTLHDGRT GDRNRYLYAV SSGAIVKYDV RSGASILHKT GPDRMAGEAV FVPAEDARGE DEGWLISIVT GGPGVGSELL VLDAVDLSRV ASVRLPRRVP AGFHGSWIQD G
|
| |