Gene Sros_3451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3451 
Symbol 
ID8666739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3794272 
End bp3796404 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339130 
Protein GI271964934 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.975495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.147632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGG TGGCGGTGAT GAGGGCGGCC CGTACCGTCC GCGACGAGCT TGACCATTTG 
CTGGGCGCGG CCTCCGGCGA GCTGCGGGAG CGGCTCGATC CGCTGCTGGC CGAGGAGGGC
AGGCAGGATC CGGGGCCGCT GGCCGACGTC ATCGTGATGC TCATCGCCCA GTACGGCCCC
GCCTGGGAGC GGTTCGCCCT GCTGCGCCGC CTGAACGAGC AGGCCCTGAC CGAGGTGCCG
GACCTGTCGT CACCGGGCCC CATCGGCCCG GCCCCGTCAC CGGGCCCCGC CGGCCCGGCC
CCATCGCCTG GCCTCACCGG TCCGGACCTG CCGCCGGGCC CCTCCGGCCC GGACGGCCCG
GACACGCCGC CCGGCTCCGC GCCCCCGGAC CCGCCACCGG GGCACACGAG TCCGGCTCCG
CCTCCGGGGC ACCCGGCCCC GCCCCCCGGA CCCGCCGGTC CGGGGCACCC GCCGTACGGC
CCCGCCGGTC CGGGGCACCC GCCGTACGGC CCCGCCGACG CGGGGCACCC GCCGCACGGG
CCCGTCGGCG CGGGGCACCC GCCGCACGGC CAGGACGGGC CGGACCGGCC GGGGGCCGGC
CAGGCACCGG CCGGCACGTC CCCCGGCCAC GGACCTCAGC CCCAGCAGGG GGCACCTCCC
CCGTCCGACC GTGGCCACTC CGTGCTCGGC CGGCTGACCG GGGCCTTCCG CAGGCGGCGC
AGGCAACCCG CTCCCACCGG GCCGGAGTGG GAGGCGTTCC CGCTGATCGA GGCGCCCGGC
GAGGTGGTGG CGGGCTGGGG GATCGAGATC GAGGTCGGCC TCTCCCCCAG AGCCGACTCC
TCCGGAACCG CCCCGTTCCC CGGGGCGGGA ACCCCACCCG GCACGGGGGC GCCGCCCGGG
GCCGACGCCT CACCCGGAGC CGCTCTTGCC TCCGGAGCCG TACCCGGCGG CCAGGAGGCG
CCCTTCACCC ACGAGCCGGT CGACCTGGAC ATCCAGATCG TCGCCGAGGG GTTCGAGGCG
CCCGGCGGGT GGCGGGTGCG GCTCCGCGTC GACGGGGCCT CGCCGTACCC GAAGGCGACG
GTCCGCCTGG TCGCCCTCCC TCAGGACCAG CCGGCGGTCG CCCGGCAGAT CCAGGCGGTC
CACTCCGTCG GCGGTCAGGT GACCGGGTTC GGCGTGCGCG CGCTCGCGGT CCTCGACTCC
CCCGGGCTGC TCGGCCGGGA GAGCGTCCCT CAGCCGGTCG CGGGAACCCG GGTGCGGGTG
GCCGGCGGCG GGGAGCCGGC GGACGTCACG GTGGTCATCG TCCACGGCGA CCGGCCGGGG
CTGCTGTGGT GGACCTACCA GTCCCCGCAC TTCACCACCC CCGACCAGGC GGAGGCGTGC
GACCTGGGGA TGCGGACCTC CGAGTTCGGG CGGCATGTGG CGGAGCGGGC GCGGACGGTC
GAGGACATGG GCCGGCAGGT CGCCTCGGAG ATCCCGCGGG GGTTCTGGGA GCTGCTGAAC
GCGGTGGCCG GGCGCGTGGC CCCGCGACGC CCGAGCGTGC TGGTCCTCTC CCAGGAGCCG
CACATCCCCT GGGAGCTGGC CACGCTGGAG CAGCCGTTCG ACCGCTCCGC CCCCGCCTTC
CTGAACTGCC AGACGGTGAT CGGCCGCTGG CCCCTCGGCG GGCGGCGTCC CGAGCTGCCT
CCGCCGGTGC GCGCGCGAGG CGACAGCATG GCGGTCGTGT ACGGCGGGGC GGAGCCCCAT
CCGCTGGTCA CCGCCTACGG CGCCGCCCGG GTCACCCCGG TGCTCGGCGA GGTGCTGGCG
ATGCTGGGCG ATCCACCGGA CATGATCCAC TTCACCTCCG GCGGCGAGGC GTTCGACTCC
CTGGGCCGGG GCATGGGCGA GGGCCCGTTC GTCTTCCTTG AGGAGCCCGG AGACTGCCAG
GCGTTCCTGC TCGCGGGCGC GAGCGGGGTC GTCGCCCCGC TCTGGCCCGT CGGCGACGGC
CTGGCACCGG AGTTCTACCG CCGCTGCCTC GGCGGCGAGC CCCCGGCCGA GGTGCTGCGC
TCCCTGCGCT GCCAGGTCCC CGCCACGGGG CCCGCCTACC GGTTCTTCGG CCACCCGTCG
CTCACGCTGT CGCGGGGTCA CTCGAGCGCG TGA
 
Protein sequence
MEPVAVMRAA RTVRDELDHL LGAASGELRE RLDPLLAEEG RQDPGPLADV IVMLIAQYGP 
AWERFALLRR LNEQALTEVP DLSSPGPIGP APSPGPAGPA PSPGLTGPDL PPGPSGPDGP
DTPPGSAPPD PPPGHTSPAP PPGHPAPPPG PAGPGHPPYG PAGPGHPPYG PADAGHPPHG
PVGAGHPPHG QDGPDRPGAG QAPAGTSPGH GPQPQQGAPP PSDRGHSVLG RLTGAFRRRR
RQPAPTGPEW EAFPLIEAPG EVVAGWGIEI EVGLSPRADS SGTAPFPGAG TPPGTGAPPG
ADASPGAALA SGAVPGGQEA PFTHEPVDLD IQIVAEGFEA PGGWRVRLRV DGASPYPKAT
VRLVALPQDQ PAVARQIQAV HSVGGQVTGF GVRALAVLDS PGLLGRESVP QPVAGTRVRV
AGGGEPADVT VVIVHGDRPG LLWWTYQSPH FTTPDQAEAC DLGMRTSEFG RHVAERARTV
EDMGRQVASE IPRGFWELLN AVAGRVAPRR PSVLVLSQEP HIPWELATLE QPFDRSAPAF
LNCQTVIGRW PLGGRRPELP PPVRARGDSM AVVYGGAEPH PLVTAYGAAR VTPVLGEVLA
MLGDPPDMIH FTSGGEAFDS LGRGMGEGPF VFLEEPGDCQ AFLLAGASGV VAPLWPVGDG
LAPEFYRRCL GGEPPAEVLR SLRCQVPATG PAYRFFGHPS LTLSRGHSSA