Gene Sros_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4060 
Symbol 
ID8667354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4519352 
End bp4522531 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339711 
Protein GI271965515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00663013 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0372283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCT ACCTGCGATT CCCGACGATC TTCGGCGACC GGGTCGTCTT CGCCGCCGAG 
GACGACCTGT GGATGGTGCC CGTCACCGGC GGACGGGCCT TCCGGCTGAC CGCCGGGGTG
GCCGAGGCGG GCTACCCCCG GTTCTCCCCG TGCGGCGACC AGCTCGCCTT CGCCGGCCGC
GAGGAGGGGC CGGAGGAGGT CTACGTGATG CCCGCCGACG GCGGGGCGGC CCGGCGGATC
ACCTATCACG GGGCCCGCTC CACGGTCACC GGCTGGGATC CCGACGGCGC CGTCCTGTAC
GCCAGCGACG AGTCCCAGCC CTTCGAGGGG CAGAAGTGGC TGCACCGGAT CCACCCGGAC
GGGATTCCGG AGCGCCTGCC GTACGGCCCG GCCAACTCCA TCTCCTACGG CCCGCAGATC
GTGCTGGGCC GCAACACCGC CGACCCGGCC CGCTGGAAGC GCTACCGGGG CGGCACCGTG
GGCGACCTGT GGATCGGCAC CGAGGAGTTC CGGCGGCTCA TCGCCCTGCC GGGCAACCTG
GCCTCGCCCT GCTGGGCCGG GGAGCGGGTC TACTTCATCT CCGACCACGA GGGTGTCGGC
AACGTCTACT CCTGCACCGC GGACGGCGGG GACCTGCGCA GGCACTCCGA CCACGCCGAC
TACTACGCCC GCAACCTGTC CGGTGACGGC CACCGGCTGG TCTACCACGC CGGAGCCGAG
CTCTACCTGG TCGAGGACGG GGAGTCACAC CGTGTCGAGG TGTGCCTGCG CAGCTCCCGC
ACCCAGCGCA ACCGCCGTTT CGCCGCGGCC GAGGACTTCC TCGACAGCGC CACGCTCAGT
CCCGACGGCA GCGGCCTGGC CATCACCACC CGGGGCAAGG CGTTCTCCTT CGCCGACTGG
GAGGGCCCGG TCCGCCAGCA CGGCGCGCCG TACGGCGTCC GCTACCGCCT GCTGACCTGG
CTGAACGACG ACGAGCGGCT GATCGCGGCG GCCAGCGACG ACGGCGACCG CGAGGTGCTG
TCCATACTCA CCGCCGACGG CAGCGCCGAA CCGGTCCAGC TCGACCACCT CGACACCGGG
CGCGTCACCG CGCTGGAGGT CTCCCCCAAG GACGACAGGG TCGCGATCGC CAACCACCGC
AACGAGCTGC TCGTGGTCGA CCTCACCGGG GGCACGGTGA CCGGCGCCCG GAGCACGGTG
ATCGACGCCA GCAGGTTCGG CGCGATCGAG GACCTCGCCT GGTCTCCCGA CGGCCGCTGG
CTGGCCTACG CCTGCCGTGA CACCGCGCAG ACCATGGCCG TCAAGCTGTG CCGGATCGAG
ACCGGCGAGA CGTTCTTCGC CACCCGCCCG GTGCTGTGGG ACAGCGGCCC CGCCTTCGAC
CCCGGCGGCG ACTACCTCTA CTTCATCGGC CAGCGCGTCT TCAACCCGGT CTACGACGAG
CTCCAGTTCG ACCTGGGATT CCCCCTCGGC TCCCGCCCCT ACGCCATCGG GCTCCGCGCC
GACGTCCGCT CCCCCTTCGT CCCCGAACCC CGGCCGCTCA AGGACGACGA CGATGACGAC
GACGGTGACG GTGACGGTGA CGACGGGCAG GAGACCGAGG TGGTCATCGA CCTGGCGGGC
ATCCAGGACC GCGTCGTCGC CTTCCCCGTC CCCGAGGGAC GCTACGACCG CATCGCCGGG
ATCAAGGGCA AGGCCGTCTA CCTGACGTTC CCCGTGGAGG GCAGCCTCGG CGACGACTAC
GCCGACTCCT CCGACGGCAC GCTGCAGGTC TACGACTTCG CCGGCCAGAA GCAGGAGACC
CTGGTCGGGG ACGTCTCGGA GTTCCAGCTG GGCCGTGACG GCACCACCCT GCTCTACCAG
GCCGGAAAAC GGCTGCGGGT GATCAAGGCG GGCGAGGCGC CCGAGGACGA CGACACGCCG
AGCCGCGGCA GCGGATGGGT CGACCTGTCG CGGGTCAAGG TGTCCATCCG CCCGGAGGCC
GAGTGGCGGC AGATGTTCCG CGAGGCCTGG CGGCTGCAGC GGGAGAACTT CTGGACCCAG
GACATGGCCG GGATCGACTG GGAGGGTGTC TACCGGCGCT ACCTCCCGCT GGTGGACCGG
GTCACCACCC GGGGAGAGTT CTCCGACCTG CTGTGGGAGC TGCTCGGCGA GCTCGGCACC
TCCCACGCCT ACGAGAGCGG CGGCGCCTAC CCGTCCCGGC CGCACTACCG GCAGGGCAAG
CTCGGCGTCG ACTGGTCCTT CGAGGACGGC CTCTACCGGG TCGCCCGGAT CGTCAACGGC
GACCGCTGGG ATCCCGAGGT CACCTCGCCG CTCAACCGCC TCGGGGTGGA CGTACGGCCC
GGCGACGTGG TGCTGGCCGT CAACGGCCAG CCCGTCGGCC CGTCGGCCGG CCCGGACGAA
AGGCTGGTCA ACCAGGCCGA TCAGGAGGTC CAGCTCACCG TCAGGCGCGG GCAGGACAAG
CGGACCTTCA ACGTGAAGGC CATCGGCGAC GAGCAGCCGG GCCGCTACCG CGACTGGGTG
GAGGCCAACC GGACCCACTG CCACGAGCGC AGCGGCGGCC GGGTCGGCTA CCTGCACATC
CCCGACATGG GGCCGGACGG CTACTCCGAG TTCCACCGCG GCTTCCTCAC CGAATACGAC
CGGGAGGGCC TGATCGTGGA CGTCCGGTTC AACGGCGGCG GCCACGTGTC GGCCCTGCTG
CTGGAGAAGC TCTCCCGCCG CCGCCTCGGC TACAACTTCC CGCGGTGGAG CGTGCCCGAG
CCCTACCCCG ACGAGTCCCC CCGGGGTCCG ATGGTCGCGA TCACCAACGA GTGGGCCGGC
TCCGACGGCG ACATCTTCAG CCACACCTTC AAACTGCTCG GCCTGGGCCC GCTGATCGGC
AAGCGCACCT GGGGCGGGGT GATCGGCATC TGGCCCCGGC ACCAGCTCGC CGACGGCACG
GTCACCACCC AGCCGGAGTT CTCCTTCGCC TTCGACGACG TGGGCTGGCG GGTGGAGAAC
TACGGCACCG ACCCCGACAT CGAGGTGGAC ATCACCCCGC AGGACTACGC CCGCGGCGTG
GACACCCAGC TCGACAAGGC GATCGAGGTC GCCCTGGAAC GCCTGCTCCT CCATCCCCCG
CACACGCCCA ATCCGGCCGA CCGGCCGCGG CTCACGGTCC CGCGCCTGCC ACCTCGCTGA
 
Protein sequence
MSAYLRFPTI FGDRVVFAAE DDLWMVPVTG GRAFRLTAGV AEAGYPRFSP CGDQLAFAGR 
EEGPEEVYVM PADGGAARRI TYHGARSTVT GWDPDGAVLY ASDESQPFEG QKWLHRIHPD
GIPERLPYGP ANSISYGPQI VLGRNTADPA RWKRYRGGTV GDLWIGTEEF RRLIALPGNL
ASPCWAGERV YFISDHEGVG NVYSCTADGG DLRRHSDHAD YYARNLSGDG HRLVYHAGAE
LYLVEDGESH RVEVCLRSSR TQRNRRFAAA EDFLDSATLS PDGSGLAITT RGKAFSFADW
EGPVRQHGAP YGVRYRLLTW LNDDERLIAA ASDDGDREVL SILTADGSAE PVQLDHLDTG
RVTALEVSPK DDRVAIANHR NELLVVDLTG GTVTGARSTV IDASRFGAIE DLAWSPDGRW
LAYACRDTAQ TMAVKLCRIE TGETFFATRP VLWDSGPAFD PGGDYLYFIG QRVFNPVYDE
LQFDLGFPLG SRPYAIGLRA DVRSPFVPEP RPLKDDDDDD DGDGDGDDGQ ETEVVIDLAG
IQDRVVAFPV PEGRYDRIAG IKGKAVYLTF PVEGSLGDDY ADSSDGTLQV YDFAGQKQET
LVGDVSEFQL GRDGTTLLYQ AGKRLRVIKA GEAPEDDDTP SRGSGWVDLS RVKVSIRPEA
EWRQMFREAW RLQRENFWTQ DMAGIDWEGV YRRYLPLVDR VTTRGEFSDL LWELLGELGT
SHAYESGGAY PSRPHYRQGK LGVDWSFEDG LYRVARIVNG DRWDPEVTSP LNRLGVDVRP
GDVVLAVNGQ PVGPSAGPDE RLVNQADQEV QLTVRRGQDK RTFNVKAIGD EQPGRYRDWV
EANRTHCHER SGGRVGYLHI PDMGPDGYSE FHRGFLTEYD REGLIVDVRF NGGGHVSALL
LEKLSRRRLG YNFPRWSVPE PYPDESPRGP MVAITNEWAG SDGDIFSHTF KLLGLGPLIG
KRTWGGVIGI WPRHQLADGT VTTQPEFSFA FDDVGWRVEN YGTDPDIEVD ITPQDYARGV
DTQLDKAIEV ALERLLLHPP HTPNPADRPR LTVPRLPPR