Gene Sros_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4164 
Symbol 
ID8667458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4634518 
End bp4635765 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content71% 
IMG OID 
Productputative monoxygenase 
Protein accessionYP_003339811 
Protein GI271965615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0321225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC CTCGCGACCG GGGCCGGCCG CTGCGGATCA TCATCATCGG CGGCGGCATC 
GGCGGACTGT GCCTCGCCCA AGGGCTGAGG CAGGCCGGTA TCGACGACAT CGTCGTATAC
GAACGTGACG AATCCGCCCG GGGGCGGATG CAGGGATACC GGTTGCGGAT CAGTCCCGAG
GGGGAGCGGG CACTACGGCA GTGCCTGCCC CGCCAGGCGC AGGACCTGCT CACCGCGACG
TCGAACAAGC GGCACGAGGA GGGCCTGGCG GCCTACGACG ATCAGCTCAA CCCGCAGTGG
GCCCCCGCGT TCGACGATCC GCGCGGCGAC GCGCCGGACA AGGTGGACGC GGTCGACCGG
GTGACGCTGC GCCGCATACT GCTCGCCGAT CTCGACGGTG TGGTGCGCTT CGGCAAACGG
TTCACCCACT ACGAGCAGGT GGACGGGGAG GTCGTGGCGC ACTTCGCCGA CGGCGGCTCG
GACACCGGCG ACGTGCTGGT GGCCGCTGAC GGGGCGAACT CCCAGGTACG GGCCCAGCTA
CGGCCGGCCG ACCGCGCCCA CGACCTCGGC GTGCGCGCGA TCCTGTCTCG CACCCCGCGG
GCCGGCGCGA TCGAGGCCGG GTTGCCGGAG GTCCTGCGCG ACCGGTTCGT CAACGTGACG
GGATCGAACG GACTCCGTCT CGCGCTGATG CCCATGGTCT TCCGCACCCC ACCGCGGGAG
GCCGCCGAGC GGTTCTGGCC CGGCCTGGGA TTCGACGACA CCGAGGACTA CTACATGTCG
GTGTTCAGCG TGCACCGCGA GGTTCTGGGG CTGCCCGACG ACTCGTTCTT CGCCATGACC
GGCGAGGAGC TCCGCGAGCT GGTGCTCGAA CGCACCGCCG GCTGGCATCC GCACCTGCGC
GGCGTGTTCG CCCACTCCGA GGCGGAGGAG ACCTACCCGC TCGCGCTGAG GGCCACCCTG
CCCGTCGAGC CCTGGGCGCC GGGGAACGTG ATCCCGCTCG GCGACGCGGT GCACACGATG
CCGCCGACCG GCGGGGTCGG AGCCAACACG GCGCTGCGCG ACGCCGCCTC GCTGTGCCGC
GCGCTGACCG CGGTGACGCG TGGCGAGCGG CCACTGCTGG ACGCCGTGGC CGAATACCAG
GCGGAGATGG TCCGGTACGC GACCGAGGCG ACGAACATGT CGCTGAAGAT CGCCAAATGG
TCCATGCAGA AGATCGACCT CAGTGAGAAG AAGCTCTCCC AAGCGTAA
 
Protein sequence
MDTPRDRGRP LRIIIIGGGI GGLCLAQGLR QAGIDDIVVY ERDESARGRM QGYRLRISPE 
GERALRQCLP RQAQDLLTAT SNKRHEEGLA AYDDQLNPQW APAFDDPRGD APDKVDAVDR
VTLRRILLAD LDGVVRFGKR FTHYEQVDGE VVAHFADGGS DTGDVLVAAD GANSQVRAQL
RPADRAHDLG VRAILSRTPR AGAIEAGLPE VLRDRFVNVT GSNGLRLALM PMVFRTPPRE
AAERFWPGLG FDDTEDYYMS VFSVHREVLG LPDDSFFAMT GEELRELVLE RTAGWHPHLR
GVFAHSEAEE TYPLALRATL PVEPWAPGNV IPLGDAVHTM PPTGGVGANT ALRDAASLCR
ALTAVTRGER PLLDAVAEYQ AEMVRYATEA TNMSLKIAKW SMQKIDLSEK KLSQA