Gene Sros_8343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8343 
Symbol 
ID8671677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9208541 
End bp9211492 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343734 
Protein GI271969538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.715387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCTTCC GGACCCCCGG AGCCGGTAGG GCAATGCGCT TGCCCCGCCG GCCACGACTG 
CTACTTCCTG TCGCGATCGC CATCGTCGCG CTCGTTGCGT TGTTCTTCCT CTTCGCCGGC
ATCTTCACCG ACTACCTCTG GTACGACTCG GTGAACTACA CGACGGTCTT CTCCGGTGTG
ATCGTCACAC AGATCGTGCT CTTCATCGTG GGCGCCGTGC TGATGGTCGG CCTCGTGGGC
GGCAACATGC TGCTGGCCTA CCGGATGCGG CCGATGTTCG GCCCGGGGAT GTTCGGCGGC
GCCAGCGGCG CCGACCGCTA CCGGATGGCC CTCGACCCGC ACCGCAAGCT GATCTTCCTG
ATCGGCGTCG CCGTGCTCGC CCTGTTCTCC GGCTCGTCGT TCTCCAGCCA GTGGAAGACC
TGGCTGGAGT TCACCAACGC CACGCCGTTC AAGGAGACCG ACGCGCTGTT CGGCATGGAC
GTGTCGTTCT TCATGTTCAC CTACCCGTTC CTGCGGATGG TGCTGAACTT CCTGTTCACG
GCGGTGGTGC TGTCGGCCAT CATGGCGGCG ATCGTGCACT ACCTGTACGG CGGCTTCCGG
CTCCAGTCGC CGGGCGTGCA CGCCTCGCGG GCCGCCCGGG TGCACCTGTC GGTGCTGCTC
GGCGTGTTCG TGCTGCTGAA GGCGGTCGCC TACTGGATCG ACCGGTACGG ACTGGTCTTC
TCCGACCGGG GCTTCGCGTA CGGCGCGTCC TACACCGACG TCAACGCCGT GCTGCCGGCC
AAGACCATCC TGGCGATCAT CGCCCTCATC TGCGCCGCCC TGTTCTTCGC CGGAGTCGTA
CGGCCCGGCG GCATGCTGCC CGGCGTCTCC TTCGGACTGC TGGTGCTCTC GGCCATCCTG
ATCGGCGGGG TCTATCCGGC CCTGGTCGAG CAGTTCCAGG TCAAGCCGAA CCAGCAGGAC
AAGGAACAGG TCTACATCAA GCGCAACATC GACGCCACCA GGAAGGCCTA CGGCGTCGAC
AAGTCCGAGG TCACCGACTA CACGGCGCAG GGTGACGCCA GCAAGGTCAA CGTCACCGCC
GACAGTTCGA TCTCGGGCGT GCGGCTGCTC GACCCCAACC TGGTCGCCAA GACCTACCAG
CAGAAGCAGC GCATTCGCGG CTACTACGAC TTCCATGACC CGCTCGACGT CGACCGCTAC
CCGGATGAGT CGGGCAAGCT GCGCGACACC GTGGTGGCCG TGCGAGAGCT CACCGGCCCG
CCGCGCGAGC AGGACAACTG GATCAACCGG CACCTGGTCT ACACCCACGG CTACGGCTTC
GTGGCCGCGC CGGGCAACGA GGTCGACTCG CAGGGCCTGC CCAACTTCGA CGCCAAGGAC
ATGCCGGTGA CCGGCCCCCT GGTGCAGCGG ACCGGGCTGA AGGAGTCGCG GATCTACTTC
GGCGAGTCCC CGACCGCACC CGAGTACGTC GTGGTGGGCG GTGACAAGAA GCAGGAGCTC
GACTACCCGG AGAGCGGCGG CACCGGCCAG CAGAACACCA CCTACGTGGG CAAGGGCGGC
GTCCCGGTCG GCTCGTTCCT CAACCGGGTT CTCTACGCGG CCAAGTACGG CGAGAAGAAC
CTGCTGCTGT CGAGCGACAT CAACGACAAG TCGAAGATCC TCTACGAGCG CAACCCGCTG
GACCGCATCG CCAAGGTGGC GCCGTTCCTG AGCCTGGACG ACAACCCCTA CCCGGCGATC
GTCGACGGCA GGGTGGTCTG GATCGCCGAC GCCTACACCA CCTCCAACGC CTACCCCTAC
TCCGACAGCC GGAGCCTGGA GGCGATGACG CGGGACACCG TCACGGACCC GCGCCTGGTG
GTGCAGCAGC CGCGCGACCA GATCAACTAC ATGCGCAACG CGGTCAAGGC CACGGTCGAC
GCCTACGACG GCACCGTCAA CCTGTACGCC TGGGACGAGA CGGACCCGAT CCTGCAGACC
TGGCGCAAGG CCTTCCCCGG GATCATCAAG CCGCAGAGCG AGATGGGCGA CGGTCTCAAG
CAGCACCTGC GCTACCCGGA GGCGCTGTTC AAGGTCCAGC GTGACGTGCT GTCCCGCTAC
CACATCGAGG ACCCGAACGC CTTCTACAGC GGTCAGGACT TCTGGAACGT CCCGAACGAC
CCGTCGTCGG GCGAGCGCGA CGTCAAGCAG CCGCCGTACT ACCTGTCGGT CAAGATGCCC
GACACGACGG CCCCGACGTT CTCGCTGACC ACCACGTTCG TGCCGCGCCA GGGTCCGAAC
CTGGCGGCGT TCATGGCAGT GGACGCCACC CCGGGAGCGG ACTACGGAAA GCTCCGCATC
CTGCGGATGC CCTCCAACAC CACGATCCCC GGCCCCGGCC AGGTGCAGAA CAACTTCCAG
AACAAGTTCT CCGGCGAGCT CAACCTGCTC GGCCTCGGCC AGGCGAAGGT CCGCTACGGC
AACCTGCTGA CCCTGCCGTT CGCCGGGGGC CTGGTCTACG TCGAACCGGT CTACGTGGAG
ATCGCCGCGG CCTCCGGCCA GGAGCCGTAC CCGATCCTCC GCCGGGTCCT GGTCTCCTAC
GGAGACAAGG TCGGCTCCGC CGACACACTG GAAGCCGCGC TGCAGCAGGT CTTCGGGGAA
GGCGCCGCGC CGCCGGCCAA GCCTGACGTC ACCAAGCCGA ACACGCCCGC CCAGCCGAGC
ACCGCGCTGA GCCAGGCGAT CGGCGAGGCG CAGCAGGCCT ACGAGAAGGC GCAGGCGGCC
CTGGCGAAGA ACCCGCCGGA CTGGACCGCG TACGGCGAGG CACAGAAGGA GCTGGAGAAG
GCCCTGGAGA AGTTGAAGGG CGTCGCGGTC CCGTCGGCCA CGGCCACTCC GGTCCCCTCG
CAGAGCCCGA CTCCGGCCCC TTCCGGAAGC CCCACACCAG CGGCGACACC GACGTCCTCA
CCGAGTCCAT AG
 
Protein sequence
MSFRTPGAGR AMRLPRRPRL LLPVAIAIVA LVALFFLFAG IFTDYLWYDS VNYTTVFSGV 
IVTQIVLFIV GAVLMVGLVG GNMLLAYRMR PMFGPGMFGG ASGADRYRMA LDPHRKLIFL
IGVAVLALFS GSSFSSQWKT WLEFTNATPF KETDALFGMD VSFFMFTYPF LRMVLNFLFT
AVVLSAIMAA IVHYLYGGFR LQSPGVHASR AARVHLSVLL GVFVLLKAVA YWIDRYGLVF
SDRGFAYGAS YTDVNAVLPA KTILAIIALI CAALFFAGVV RPGGMLPGVS FGLLVLSAIL
IGGVYPALVE QFQVKPNQQD KEQVYIKRNI DATRKAYGVD KSEVTDYTAQ GDASKVNVTA
DSSISGVRLL DPNLVAKTYQ QKQRIRGYYD FHDPLDVDRY PDESGKLRDT VVAVRELTGP
PREQDNWINR HLVYTHGYGF VAAPGNEVDS QGLPNFDAKD MPVTGPLVQR TGLKESRIYF
GESPTAPEYV VVGGDKKQEL DYPESGGTGQ QNTTYVGKGG VPVGSFLNRV LYAAKYGEKN
LLLSSDINDK SKILYERNPL DRIAKVAPFL SLDDNPYPAI VDGRVVWIAD AYTTSNAYPY
SDSRSLEAMT RDTVTDPRLV VQQPRDQINY MRNAVKATVD AYDGTVNLYA WDETDPILQT
WRKAFPGIIK PQSEMGDGLK QHLRYPEALF KVQRDVLSRY HIEDPNAFYS GQDFWNVPND
PSSGERDVKQ PPYYLSVKMP DTTAPTFSLT TTFVPRQGPN LAAFMAVDAT PGADYGKLRI
LRMPSNTTIP GPGQVQNNFQ NKFSGELNLL GLGQAKVRYG NLLTLPFAGG LVYVEPVYVE
IAAASGQEPY PILRRVLVSY GDKVGSADTL EAALQQVFGE GAAPPAKPDV TKPNTPAQPS
TALSQAIGEA QQAYEKAQAA LAKNPPDWTA YGEAQKELEK ALEKLKGVAV PSATATPVPS
QSPTPAPSGS PTPAATPTSS PSP