Gene Sros_1773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1773 
Symbol 
ID8665051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1888814 
End bp1890469 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content72% 
IMG OID 
ProductUrocanate hydratase 
Protein accessionYP_003337506 
Protein GI271963310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0949014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA GCCGGATCGT GCGCGCGCCG CGCGGCACCA CGCTCACCGC CAAGGGGTGG 
CCGCAGGAGG CCGCGCTCCG CATGATCCAG AACAATCTGG ATCCCGAGGT GGCCGAGCAC
CCGGAGCAGC TCGTCGTCTA CGGCGGTTCG GGCAGGGCGG CGCGCGACTG GCGCTCCTTC
GACGCGATCA CCCGCACCCT GACCACGCTG GAGGGCGACG AGACGCTGCT GGTGCAGTCC
GGCCGGCCGG TGGGGGTCTT CCGCACGCAC GAGTGGGCGC CGCGGGTGCT CATCGCCAAC
TCCAACCTCG TGCCCGACTG GGCGAACTGG GAGGAGTTCC GCCGCCTGGA GGCCGCGGGC
CTGACCATGT ACGGGCAGAT GACGGCCGGG TCCTGGATCT ACATCGGCAC CCAGGGCATC
CTGCAGGGAA CCTACGAGAC CTTCGCCGCG GTCGCCGCCA AGCGGTTCGG CGGCTCCCTG
GCCGGGACGA TCACCCTGAC CGCCGGGCTC GGCGGCATGG GCGGCGCCCA GCCGCTCGCC
GTCACCATGA ACGACGGCGT GGTGATCTGC GTCGACTGCG ACCCCAGGTC GATCGACCGG
CGGATCGAGC ACCGCTACCT GGACGTCAGG GCCAAGGACC TCGACGAGGC GCTGCGCCTG
GCCTACGAGG CCCGTGACCT GCGCAGGCCC CTGAGCATCG GTGTCGAGGG CAACGCGGCC
GAGGTGCTGC CCGAGCTGCT CCGCCGGGGT GCCGAGATCG ACATCGTCAC CGACCAGACG
TCGGCGCACG ACCCGCTGAT GTACCTGCCG ATCGGCGTGG CCTTCGAGGA CATGGCCGCC
GAGCGGGAGA AGGACCCGGC CGGGTTCACG ACGAAGGCGC GCGAGGCCAT GGCCACGCAC
GTCGAGGCTA TGGTCGGCTT CCAGGACGCG GGCGCCGAGG TCTTCGACTA CGGCAACTCC
ATCCGGGGCG AGGCGCAGCT CGCCGGCTAC GCCCGCGCGT TCGACTTCCC CGGCTTCGTG
CCCGCCTACA TCCGGCCGCT GTTCTGCGAG GGCAAGGGCC CCTTCCGCTG GGCCGCGCTG
TCCGGATCCG CCCAGGACAT CGCCAAGACC GACCGGGCGA TCCTGGAGCT GTTCCCCGAC
AACGAGCCGC TGGCCCGGTG GATCCGGATG GCCGAGGAGC GGGTCCACTT CCAGGGCCTG
CCCGCGCGGA TCTGCTGGCT CGGGTACGGC GAGCGCCATC TGGCCGGTGA GCGGTTCAAC
GACATGGTGG CCTCCGGCGA GATCGAGGCC CCGCTGGTGA TCGGCCGCGA CCACCTCGAC
TGCGGTTCGG TCGCCTCGCC GTACCGGGAG ACCGAGGGCA TGGCCGACGG CTCCGACGCG
ATCGCCGACT GGCCGCTGCT GAACGCCATG CTCAACGTGG CCTCCGGCGC CGCCTGGGTC
TCCATCCACC ACGGCGGCGG CGTCGGCATC GGCCGCTCCA TCCACGCCGG CCAGGTCACC
GTCGCCGACG GCACCAGGCT CGGAGCCGAG AAGCTCAACC GGGTCCTCAC CAACGACCCG
GGCATGGGCG TGATCCGTCA CGTCGACGCG GGCTACGACG GGGCCGTCAC CGTGGCGGAG
GAGCGGGGCG TCCGCGTCCC GATGCGCGAG TCCTGA
 
Protein sequence
MTGSRIVRAP RGTTLTAKGW PQEAALRMIQ NNLDPEVAEH PEQLVVYGGS GRAARDWRSF 
DAITRTLTTL EGDETLLVQS GRPVGVFRTH EWAPRVLIAN SNLVPDWANW EEFRRLEAAG
LTMYGQMTAG SWIYIGTQGI LQGTYETFAA VAAKRFGGSL AGTITLTAGL GGMGGAQPLA
VTMNDGVVIC VDCDPRSIDR RIEHRYLDVR AKDLDEALRL AYEARDLRRP LSIGVEGNAA
EVLPELLRRG AEIDIVTDQT SAHDPLMYLP IGVAFEDMAA EREKDPAGFT TKAREAMATH
VEAMVGFQDA GAEVFDYGNS IRGEAQLAGY ARAFDFPGFV PAYIRPLFCE GKGPFRWAAL
SGSAQDIAKT DRAILELFPD NEPLARWIRM AEERVHFQGL PARICWLGYG ERHLAGERFN
DMVASGEIEA PLVIGRDHLD CGSVASPYRE TEGMADGSDA IADWPLLNAM LNVASGAAWV
SIHHGGGVGI GRSIHAGQVT VADGTRLGAE KLNRVLTNDP GMGVIRHVDA GYDGAVTVAE
ERGVRVPMRE S