Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1773 |
Symbol | |
ID | 8665051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 1888814 |
End bp | 1890469 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Urocanate hydratase |
Protein accession | YP_003337506 |
Protein GI | 271963310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0949014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA GCCGGATCGT GCGCGCGCCG CGCGGCACCA CGCTCACCGC CAAGGGGTGG CCGCAGGAGG CCGCGCTCCG CATGATCCAG AACAATCTGG ATCCCGAGGT GGCCGAGCAC CCGGAGCAGC TCGTCGTCTA CGGCGGTTCG GGCAGGGCGG CGCGCGACTG GCGCTCCTTC GACGCGATCA CCCGCACCCT GACCACGCTG GAGGGCGACG AGACGCTGCT GGTGCAGTCC GGCCGGCCGG TGGGGGTCTT CCGCACGCAC GAGTGGGCGC CGCGGGTGCT CATCGCCAAC TCCAACCTCG TGCCCGACTG GGCGAACTGG GAGGAGTTCC GCCGCCTGGA GGCCGCGGGC CTGACCATGT ACGGGCAGAT GACGGCCGGG TCCTGGATCT ACATCGGCAC CCAGGGCATC CTGCAGGGAA CCTACGAGAC CTTCGCCGCG GTCGCCGCCA AGCGGTTCGG CGGCTCCCTG GCCGGGACGA TCACCCTGAC CGCCGGGCTC GGCGGCATGG GCGGCGCCCA GCCGCTCGCC GTCACCATGA ACGACGGCGT GGTGATCTGC GTCGACTGCG ACCCCAGGTC GATCGACCGG CGGATCGAGC ACCGCTACCT GGACGTCAGG GCCAAGGACC TCGACGAGGC GCTGCGCCTG GCCTACGAGG CCCGTGACCT GCGCAGGCCC CTGAGCATCG GTGTCGAGGG CAACGCGGCC GAGGTGCTGC CCGAGCTGCT CCGCCGGGGT GCCGAGATCG ACATCGTCAC CGACCAGACG TCGGCGCACG ACCCGCTGAT GTACCTGCCG ATCGGCGTGG CCTTCGAGGA CATGGCCGCC GAGCGGGAGA AGGACCCGGC CGGGTTCACG ACGAAGGCGC GCGAGGCCAT GGCCACGCAC GTCGAGGCTA TGGTCGGCTT CCAGGACGCG GGCGCCGAGG TCTTCGACTA CGGCAACTCC ATCCGGGGCG AGGCGCAGCT CGCCGGCTAC GCCCGCGCGT TCGACTTCCC CGGCTTCGTG CCCGCCTACA TCCGGCCGCT GTTCTGCGAG GGCAAGGGCC CCTTCCGCTG GGCCGCGCTG TCCGGATCCG CCCAGGACAT CGCCAAGACC GACCGGGCGA TCCTGGAGCT GTTCCCCGAC AACGAGCCGC TGGCCCGGTG GATCCGGATG GCCGAGGAGC GGGTCCACTT CCAGGGCCTG CCCGCGCGGA TCTGCTGGCT CGGGTACGGC GAGCGCCATC TGGCCGGTGA GCGGTTCAAC GACATGGTGG CCTCCGGCGA GATCGAGGCC CCGCTGGTGA TCGGCCGCGA CCACCTCGAC TGCGGTTCGG TCGCCTCGCC GTACCGGGAG ACCGAGGGCA TGGCCGACGG CTCCGACGCG ATCGCCGACT GGCCGCTGCT GAACGCCATG CTCAACGTGG CCTCCGGCGC CGCCTGGGTC TCCATCCACC ACGGCGGCGG CGTCGGCATC GGCCGCTCCA TCCACGCCGG CCAGGTCACC GTCGCCGACG GCACCAGGCT CGGAGCCGAG AAGCTCAACC GGGTCCTCAC CAACGACCCG GGCATGGGCG TGATCCGTCA CGTCGACGCG GGCTACGACG GGGCCGTCAC CGTGGCGGAG GAGCGGGGCG TCCGCGTCCC GATGCGCGAG TCCTGA
|
Protein sequence | MTGSRIVRAP RGTTLTAKGW PQEAALRMIQ NNLDPEVAEH PEQLVVYGGS GRAARDWRSF DAITRTLTTL EGDETLLVQS GRPVGVFRTH EWAPRVLIAN SNLVPDWANW EEFRRLEAAG LTMYGQMTAG SWIYIGTQGI LQGTYETFAA VAAKRFGGSL AGTITLTAGL GGMGGAQPLA VTMNDGVVIC VDCDPRSIDR RIEHRYLDVR AKDLDEALRL AYEARDLRRP LSIGVEGNAA EVLPELLRRG AEIDIVTDQT SAHDPLMYLP IGVAFEDMAA EREKDPAGFT TKAREAMATH VEAMVGFQDA GAEVFDYGNS IRGEAQLAGY ARAFDFPGFV PAYIRPLFCE GKGPFRWAAL SGSAQDIAKT DRAILELFPD NEPLARWIRM AEERVHFQGL PARICWLGYG ERHLAGERFN DMVASGEIEA PLVIGRDHLD CGSVASPYRE TEGMADGSDA IADWPLLNAM LNVASGAAWV SIHHGGGVGI GRSIHAGQVT VADGTRLGAE KLNRVLTNDP GMGVIRHVDA GYDGAVTVAE ERGVRVPMRE S
|
| |