Gene Sros_9004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9004 
Symbol 
ID8672346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9952615 
End bp9953682 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDihydroorotate oxidase 
Protein accessionYP_003344378 
Protein GI271970182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.687517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGAT TCGTGTTCAC GCAGCTCCTC AGCCGCATCG ACCCGGAGGA TGCGCACCAT 
CTGACGGTCG GCGCGCTCCG CCTTCTCTCG GTGACGCCCG TGGTCAAGCG GCTCGTCCAC
CGCCGGCTCG CGCCGCGCGA CCCCGCCCTG CGCGTGCAGG CCTTCGGAGT GCACTTCCCC
GGCCCCTTCG GGCTGGCGGC GGGCTTCGAC AAGGACGCCG GCTGCGTGGA GGCCATGTCC
GCACTCGGCT TCAGCCATGT CGAGGTCGGC ACGATCACCG CGCACGCCCA GCCCGGAAAC
GCCAGGCCCC GGCTGTTCCG CCTGGTCAAG GACCGGGCGA TCATCAACCG GATGGGCTTC
AACAACGCCG GCGCGAGCGC CGCGGCGCGG GCGCTGAGCA GGCCCCGCGG TGTCCCGGCG
GTGGTCGGCG TCAACATCGG CAAGACCAAG GTCGTGCCTG AGGCGGAGGC CGTCCACGAC
TACGTGGCCG GCGCCAAGGA GCTGGCTCCG CTGGCCGACT ACCTGGTCGT CAACGTGAGC
TCGCCGAACA CGCCCGGTCT GCGCAACCTC CAGGCCGTCG AGCTCCTGCG CCCGCTGCTC
CAGGGGGTCA AGGAGGTCGC CGACAGCACC CCCAGGCGCA CCCCGCTGCT GGTCAAGATC
GCCCCCGACC TGGCCGACGA CGACGTGGAC GCGGTCGCCG ACCTGGCCCT CGAACTCGGG
CTCGACGGCA TCATCGCGAC CAACACCACG ATCAGCCGCG AGGGCGTCTC CAGCACCGAG
CCCGGCGGCC TGTCCGGCCG CCCGCTGAAG GCCCGCTCGC TGGAGGTGCT GCGCAGGCTC
CGCGCCCGGG TGGGCGACCG CCTGGTGCTC GTCTCCGTCG GCGGCGTCGA GAACGTGGAC
GACGTCTGGG AGCGCCTGCT CGCCGGGGCC ACCCTGGTCC AGGGCTACAG CGCGCTGATC
TACGAGGGCC CCCTCTGGGC CCACCGCATC CATCGCGGGC TCTCCCGGCG CCTGCGCCGT
CACGGTGTCA GGGACCTGCG CGAAATCATC GGCCGCACCG CGACCTGA
 
Protein sequence
MYRFVFTQLL SRIDPEDAHH LTVGALRLLS VTPVVKRLVH RRLAPRDPAL RVQAFGVHFP 
GPFGLAAGFD KDAGCVEAMS ALGFSHVEVG TITAHAQPGN ARPRLFRLVK DRAIINRMGF
NNAGASAAAR ALSRPRGVPA VVGVNIGKTK VVPEAEAVHD YVAGAKELAP LADYLVVNVS
SPNTPGLRNL QAVELLRPLL QGVKEVADST PRRTPLLVKI APDLADDDVD AVADLALELG
LDGIIATNTT ISREGVSSTE PGGLSGRPLK ARSLEVLRRL RARVGDRLVL VSVGGVENVD
DVWERLLAGA TLVQGYSALI YEGPLWAHRI HRGLSRRLRR HGVRDLREII GRTAT