Gene Sros_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3879 
Symbol 
ID8667169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4315442 
End bp4316563 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content75% 
IMG OID 
Productmonooxygenase 
Protein accessionYP_003339539 
Protein GI271965343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.535527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.325627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAC TGTCGATCCT CGACCTGTCC CCCGTCCCCT CCGGGGGCAC GACGGGCGAC 
GCGCTGCGCA ACACCCTGGA CCTGGCCAGG CGCGCCGAGG AGTTCGGCTA CCGCCGTTAC
TGGCTGGCCG AGCACCACTT CGCGCCCGGC GTCGCCGGCG CCGCCCCCGC CGTGCTCATC
GCCCTCGTGG CGGCCGCGAC CAGCACGATC CGGGTCGGCT CCGGCGCCGT GCAGCTCGGC
CACCAGACGG CGCTCGCCGT GGTCGAGCAG TTCGGCCTGA TCGACGCGCT GTACCCGGGC
CGCCTCGACC TGGGCCTCGG CCGGTCGGGC CGGCGCGGGA GCGAGTTCGC CGAGCTCGCC
AAGAGGCCCC CGCAGCCGCC CGGACCGGCC AGGGTCGTGG ACGGCCTGCT CATCCCGGAA
CCGTTCTCCT TCGCCGCCCT GGCCGCGTCG CCGCTCCTGG CGCTGTACGG CTCGCTGCTG
CAGCAGCCGG GCGCGGAGAG CCCCGACTTC GCCGACCAGG TGGACGACAT CCTCGCGCTG
CTCGCCGGGA CCTACCGGTC GGCCGAGGGG GTGGCGGCGC ACGCCGTACC GGGCGAGGGC
GCGGACGTGG AGCTGTGGGT GCTGGGCAGC AGCGGCGGTC AGAGCGCGCA GGTGGCGGGG
GAGCGCGGGC TGCCGTTCGC GGCGAATTAC CATGTCAGCC CGTCCACCGT GCTGGAGGCG
GCCGAGGCCT ACCGGGAGGC GTTCAAGCCG TCGGAGACCC TCGCCGAACC CCATCTGATC
GTCTCGGCCG ACGTGGTCGT GGCCGAGGAC GACGACACGG CCCGCGAGCT CGCCTCGCCG
TACGGATTGT GGGTGCGCAG CATCCGCACC GGCGCGGGCG CGATCCCGTT CCCGACGCCG
GAGGAGGCCG CGGCGCACGA GTGGAGCGAG GAGGACCACG CGCTGGTCGC CGACCGGGTG
GCGACCCAGT TCGCCGGCTC GCCGCGGACC GTCGCCGAGA GGCTGCGCGT CCTGCGCGAC
GTCACCGGCG CCGACGAGCT GCTCGTCACC ACCATCACCC ACGACCACGC CGACCGGGTC
CGCTCCTACG AGCTGCTCGC AAAGGAGTGG GCCGGCGGCT GA
 
Protein sequence
MTSLSILDLS PVPSGGTTGD ALRNTLDLAR RAEEFGYRRY WLAEHHFAPG VAGAAPAVLI 
ALVAAATSTI RVGSGAVQLG HQTALAVVEQ FGLIDALYPG RLDLGLGRSG RRGSEFAELA
KRPPQPPGPA RVVDGLLIPE PFSFAALAAS PLLALYGSLL QQPGAESPDF ADQVDDILAL
LAGTYRSAEG VAAHAVPGEG ADVELWVLGS SGGQSAQVAG ERGLPFAANY HVSPSTVLEA
AEAYREAFKP SETLAEPHLI VSADVVVAED DDTARELASP YGLWVRSIRT GAGAIPFPTP
EEAAAHEWSE EDHALVADRV ATQFAGSPRT VAERLRVLRD VTGADELLVT TITHDHADRV
RSYELLAKEW AGG