Gene Sros_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3338 
Symbol 
ID8666626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3649182 
End bp3655010 
Gene Length5829 bp 
Protein Length1942 aa 
Translation table11 
GC content72% 
IMG OID 
ProductType II secretory pathway pullulanase PulA and related glycosidase-like protein 
Protein accessionYP_003339020 
Protein GI271964824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0998506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTGGC CACTCCTCCC ACGACGCGCG AGAGCGACCC TCCTGTCGGG TGCCGTCCTC 
ACGGCGGCCC TGATCATCCC CCTCGCCGCA CCCCTCCCCC CCGCCGCCGC CGAAGCACGG
CGACCGGCCC TGCCCGCCGG AGCCGTGACG GCGCCACAGC CGCCCGCCTC CCCCGTGGTC
TCCGAGCTGA GCCGTACGGC GGGGCCCTCC GACCGCGACC TCGCCCGGGA CGCGCTCCGC
GCCGACCTGA GCCGCGAGCG CTTCTACTTC GCGATGACCG ACCGGTTCGC CAACGGCGAC
ACCGGCAACG ACCGGGGCGG GCTGTCCGGA GACCGGGACG CCACCGGCTA CGACCCCACG
CACAAGGGCT TCTACCAGGG CGGCGACCTC AAGGGACTGC TCGGGAAGCT CGACTACGTC
AAGAACCTCG GCAGCACCGC GATCTGGATC ACCCCGGCGT TCCGGAACCG TCCCGTCCAG
GGCACGGGCG CGAACGTCTC GGCCGGCTAC CACGGCTACT GGATCACCGA CTTCACCCGG
ATCGACCCGC ACCTGGGCAC GAACGCCCAG ATGAAGCAGC TCGTGCGCGA GGCGCACCGG
CGCGGGATGA AGGTCTTCTT CGACATCATC ACCAACCACA CCGCCGACGT GATCGACTAC
CGGGAGAAGA CCTACTCCTA CCGGTCGAAG GGGGCCTACC CCTACGTAGA CGCCGGCGGC
GCGCCGTTCG ACGACCGGCG ATACGCCGGC GGGGACACCT TCCCCGAGGT CGGCACCGGC
TCCTTCCCCT ACACGCCGGT CGCGCCCGGG GACGCCAAGA CCCCGTCCTG GCTGAACGAC
CCGACGATGT ACCACAACCG GGGCGACTCC ACCTTCAGCG GCGAGAACGA CCAGTACGGC
GACTTCTTCG GCCTCGACGA CCTGTGGACC GAACGGCCCG AGGTCGTCAA GGGCATGACC
GACATCTACA AGACCTGGGT GCGCGACACC GGTCTCGACG GCTTCCGCAT CGACACCGCC
AAGCACGTCA ACATGGAGTT CTGGGAGAGG TTCTCCCCCG CCCTGCGCGG CTACGCGGCC
GGACTCGGCA ACAAGCGCTT CTTCATGTTC GGCGAGGTCT ACTCCAGCGA GCCGGCCTTC
ACCAGCCGCT ACTCCACCCG CGGCGGGATG AACGCCACCC TGGACTTCCC GTTCCAGGAG
GCCGCCCGCT CCTTCAGCGG CGGTACGGCC GGAGCCGCCA GGCTGGCCCA GCTCTTCGCC
GGAGACGACC ACCACACCGA CGCCGACGGC AACGCGGCGT CGCTGCCGAC CTTCCTCGGC
AACCACGACA TGGGGCGCAT CGGCAGGTTC ATCGCCCAGG ACAACCCCGG CGCCGCCGAC
TCCGAACTCC TGCGCAGGGA CCTGCTCGCC CACGAGCTGA TGTATCTCAC CCGGGGCCAG
CCGGTCGTCT ACTACGGCGA CGAGCAGGGC TTCACCGGCA AGGGGGGCGA CCAGGACGCC
CGCGCCTCCA TGTTCGCCTC CCGGACCGGC AGCTACCTGA GCGACGACCT GATCGGCACC
GAGGCCACCC ACGCCCAGGA CAACTTCGTC CCCGGCCACC CGCTCTACCA GGGCATCTCG
GCCCTGGCCC GGCTCCGTGA CGCCCACCCG GCCCTGGCCG ACGGCGCCCA GGTCGAGCGG
CTCGCCGCGG GCGGCGTCTA CGCCTTCTCC AGGATCGACG CCAGAGCCCA GGTCGAATAC
GTCGTCGCGG TCAACAACGC CGAGCAGCCG GCCGCCGTGA ACGTGCCGAC CTTCTCGGCG
GACGTGGCCT TCACCAAGGT GTACGGCGAG GCCGCCGCGG CGCTCACGAC GGGTGCGGAC
ACCAAGCTGG CGGTGACCGT GCCGGCCCTG TCGGCGGTGG TCTACCGGGC CTCGGCGAGG
CTCGCCGCCC CGGCGGCGAC CCCGGCGGTG TCGATCGCCC TGCCCGGCGC CGAGATCAGG
GGGACCGCCG AGGACGGCCG GATCCCCGTC ACCGCGACCG TGCCGGGAAC CGGCTTCGAC
CAGGTGACCT TCGCCGCCAA GGTGGGCGGC GGCTCCTGGA AGGTGCTCGG CACCGACGAC
GCGCCCAACG CCGTGCCGGG CGCGAGCCGT ACCTTCCGCG TCTTCCACGA CCTGCGCGGC
ATCCCGGCGG GAACGGAGAT CGCCTACAAG GCGGTCGTGA AGGACTCCGC CGGCAGGTTC
GCCTCGGCGA CCGCGACGGC CGCCGTCGGC CCCGAGCCCG CCCAGGAGGA GCCCAGCGCG
GTCAAGCGCG ACTGGCTGGT CGTGCACTAC CAGCGCGAGG ACGCCGACGG CTGGGGCCTG
CACGTCTGGG GCGACGTCGA CAAGCCGACC GAGTGGGGCA GCCCGCTCCC GCTGACCGGC
GAGGACTCCT ACGGCAGGTT CGCCTGGATC AAGCTGAAGC CCGGCGCCTC CAACGTCGGG
ATCATCGTCC ACAAGGGCGA CGAGAAGGAC GGCGGCGACC GCGTCGTCAA CCCGGCCAGG
ACCGGCGAGG TCTGGCTGGC GGCGGGCGAC CCCGCCACCC ACGCCTCCCG CGCGGCGGCC
CAGGGCTACG CGACCGTCCA CTACCGCCGC CCCGACGGGA ACTACCAGGG CTGGGGCCTG
CACCTGTGGG GCGACGGGCT GGCCGACGGA GTGCCCACCG AGTGGGCCGC GCCGCGCCCT
CCGGACGGCA CCGACGCCTA CGGCGTGTTC TGGAAACTCC CGCTGAAGAA CGCCTCAGCC
CCGGTGAACC ACATCATCCA CAGGGGCGAC ACCAAGGACC CCGGCCCCGA CCAGACGTTC
ACCCCCGCCC TCCAGCCGGA CGCCTACGTC GGCTCGGGTG TGGCGAAGGT CCATCCGACC
CGGGCCGCGG CCGAGAACGT CGCGATCCTG CATTACCACC GGCCGGACGG GAATTACGAA
GGCTGGGGAC TACACCTGTG GGGTGACGTG GCCACTCCCA CCGAGTGGGC CACCCCGCTC
CAGCCGGCGG GTGAGGACGG GTTCGGAGTC CACTTCCGCG TCCCCCTGAC CGAGGGGGCG
AAGAACGTGA GCTACATCAT CCACAAGGGC GACGAGAAGG ACCTGCCGGA CAACCAGGCG
CTGGACCTCA CCGCCGCGGG CCACGAGGTC TGGCGGGTGG CGGCCACCGA GGGCCACCTG
CTGCCCCAGC CCCCGGCCCG CGGCGCGGCC GCCGACCTGA GCAAGTCGGC GGCCCACTGG
ATCGACCGGG ACACCGTAGC CTGGAAGGTC GAGCCGTCCG CCTCGCTCCA CCATTCCCTG
GCCTTCTCGG AGAAGGGCGA CATCGCCTAC GCCAAGGGGG ACCTCACCGG CGACCTGCGG
ATCATCCGGC TGATCCCCGG CGAGCTCACC GATGCCCAGA AGGCCAAGTG GCCGCACCTG
GCCGGCTACG CCGCGCTGAA GGTGGACCCG CGCGACGCCG GCCTGACCGG TAAGGCCCTG
CGCGGCCAGG TCGTGGCCGT CGAGCGGGAC GCCTCGGGTG TGCTGCTCAC CGCCACCGGC
GTGCAGATCC CCGGCGTGCT CGACGACGTC TACGCCAAGG CGGCCGGCGT CGAGCTCGGC
CCGGTCTGGC GCGGGACCCC CAGGTTGTCG GTCTGGGCGC CGACCGCGCG GAAGGTCGAG
CTGGCGCTCC ACCGCGACGC CGCGGGCGGC GGCCGTACCG TCCACGAGAT GCGGCGCGAC
GACGAGACCG GCGTCTGGTC GGTGCGGGGC CTGTCCTCCT GGAAGGGGCG CTACTACACC
TTCCTGGTCA CGGTCTACTC GCCCGCGGCG GGCAAGGTCG TCACCAACGA GGTGACCGAC
CCCTACAGCC TGTCGCTGGC CGCCGACTCG GTCCGCAGCC AGGTGGTGGA CCTGTCCGAC
CGGTCCCTCG CGCCCGGCGG CTGGTCCTCT CTGGCCAAGC CCCAGGCCGT GACCCAGGAC
AGGGCGTCGG TCTACGAGCT GCACGTGCGT GACTTCTCCG CCTCCGACGC CTCGGTCCCG
GCGGACCGGC GGGGCACCTA CGCGGCGTTC GCCGGGGACG GCGCCGGGAT GAAGGAGCTG
CGCAAGCTCG CCGAGGACGG CCTGACCCAC GTGCACCTGC TGCCGGTCTT CGACATCGCG
ACCGTCCCGG AGAGGAAGGC CGACCGCGCC GAGCCGGACT GCGACCTGGC CTCGATGCCC
GCCGACTCCG ACCAGCAGCA GGCCTGCGTC GCGAAGGTCG CGGCCAAGGA CGCCTTCAAC
TGGGGATACG ACCCGCGGCA CTACACCGTG CCCGAGGGCT CCTACGCGTC CGACCCGGAC
GGCTCCGGCC GGATCAAGGA GTTCCGGGGC ATGGTGGCGG GGCTGAACGG AGCCGGGCTC
CGGGTGGTCA TGGACGTGGT CTACAACCAC ACCCACGCCG CCGGCCAGGA CCCCACCTCG
GTGCTCGACC GCATCGTGCC CGGCTACTAC CACCGGCTGC TCGACGACGG TGCCGTGGCC
ACCTCCACCT GCTGCGCCAA CACCGCGCCG GAGCACGCGA TGATGGGCAA GCTCGTCGTC
GACTCCGTCG TCACCTGGGC CCGCGACTAC AAGGTGGACG GCTTCCGGTT CGACCTGATG
GGCCACCACC CGAAGGCGAA CATGCTCGCG GTCCGCAAGG CCCTGGACGG GCTGACCCTC
GCCAAGGACG GCGTGGACGG CAAGTCGATC ATCCTGTACG GCGAGGGCTG GAACTTCGGC
GAGGTGGCCG GAGACGCCCG CTTCGAACAG GCCACCCAGC TCAACATGGC CGGCAGCGGG
ATCGGCACCT TCAGCGACCG CCTGCGTGAC GCGGTGCGCG GCGGCAGCCC CTTCGACGCC
GACCCCCGGG TCCAGGGCTT CGGCTCGGGC CTGGCCGGAG CGCCCAACGG ATCGCCGGCC
AACGGCACGG CCGAGCAGCA GCGGGCCCGC CTCCTGGGCT ACCAGGACCT GATCAAGGTG
GGGCTCACCG GCAACCTGCG CGACTACACC TTCACCGCCT CCGGCGGCAG GCAGGTCAAG
GGCTCGGAGG TCGACTACAA CGGCTCCCCG GCCGGCTACA CCGCCTCGCC GGGCGAGGTC
GTGACCTACG TGGACGCCCA CGACAACGAG ACGCTGTTCG ACGCCCTGGC CTACAAGCTG
CCGCAGGCGA CCACGATGGC CGACCGGGTG CGGATGCAGT CGCTCTCCCT GGCCACGGCC
GTGCTGGCGC AGGGCACCTC CTTCGTCCAC GCGGGCAGCG AGCGGCTGCG CTCGAAGTCG
CTCGACCGCA ACTCCTTCGA CTCCGGTGAC TGGTTCAACC GGCTGCTGTG GGACTGCTCG
CAGGGCAACG GCTTCGGCGC GGGCCTGCCG CCCAGGGCCG ACAACGAGGA CAAGTGGGCC
TACGCCAGGC CGCTGCTCGC CGACCCGGCG CTCAGGCCCG ACTGCGCGTC GATCGGCTCC
GCGCGGGCCA GGTACGGCGA GCTGCTGAGG ATCCGTTCCT CCTCGCCCGC CTTCGCGCTC
GGCTCCCTGG CCGAGGTGCA GAAGCGGCTG ACCTTCCCCA CCAGCGGCGC CGCGGAGACC
CCGGGCGTCG TCACCATGCA CCTGGACGCC TCAGGCATCG ACCCGCGCTG GAAGTCGATC
ACCGTCGTCT TCAACGCGAC CCCCGAAACG CAGCCGCAGA CCGTGACCGC GCTGAAGGAC
GCCCAGGTGA CGCTCCACCC CGTCCAGACG GCCGGCGACG ACGCCGTCGT GAAGCAGTCG
GCCTTCGATC CCATCACCGG CACGCTGACG GTCCCGGCCC GCACGGTGGC CGTCTTCGTG
CGTTCCTGA
 
Protein sequence
MVWPLLPRRA RATLLSGAVL TAALIIPLAA PLPPAAAEAR RPALPAGAVT APQPPASPVV 
SELSRTAGPS DRDLARDALR ADLSRERFYF AMTDRFANGD TGNDRGGLSG DRDATGYDPT
HKGFYQGGDL KGLLGKLDYV KNLGSTAIWI TPAFRNRPVQ GTGANVSAGY HGYWITDFTR
IDPHLGTNAQ MKQLVREAHR RGMKVFFDII TNHTADVIDY REKTYSYRSK GAYPYVDAGG
APFDDRRYAG GDTFPEVGTG SFPYTPVAPG DAKTPSWLND PTMYHNRGDS TFSGENDQYG
DFFGLDDLWT ERPEVVKGMT DIYKTWVRDT GLDGFRIDTA KHVNMEFWER FSPALRGYAA
GLGNKRFFMF GEVYSSEPAF TSRYSTRGGM NATLDFPFQE AARSFSGGTA GAARLAQLFA
GDDHHTDADG NAASLPTFLG NHDMGRIGRF IAQDNPGAAD SELLRRDLLA HELMYLTRGQ
PVVYYGDEQG FTGKGGDQDA RASMFASRTG SYLSDDLIGT EATHAQDNFV PGHPLYQGIS
ALARLRDAHP ALADGAQVER LAAGGVYAFS RIDARAQVEY VVAVNNAEQP AAVNVPTFSA
DVAFTKVYGE AAAALTTGAD TKLAVTVPAL SAVVYRASAR LAAPAATPAV SIALPGAEIR
GTAEDGRIPV TATVPGTGFD QVTFAAKVGG GSWKVLGTDD APNAVPGASR TFRVFHDLRG
IPAGTEIAYK AVVKDSAGRF ASATATAAVG PEPAQEEPSA VKRDWLVVHY QREDADGWGL
HVWGDVDKPT EWGSPLPLTG EDSYGRFAWI KLKPGASNVG IIVHKGDEKD GGDRVVNPAR
TGEVWLAAGD PATHASRAAA QGYATVHYRR PDGNYQGWGL HLWGDGLADG VPTEWAAPRP
PDGTDAYGVF WKLPLKNASA PVNHIIHRGD TKDPGPDQTF TPALQPDAYV GSGVAKVHPT
RAAAENVAIL HYHRPDGNYE GWGLHLWGDV ATPTEWATPL QPAGEDGFGV HFRVPLTEGA
KNVSYIIHKG DEKDLPDNQA LDLTAAGHEV WRVAATEGHL LPQPPARGAA ADLSKSAAHW
IDRDTVAWKV EPSASLHHSL AFSEKGDIAY AKGDLTGDLR IIRLIPGELT DAQKAKWPHL
AGYAALKVDP RDAGLTGKAL RGQVVAVERD ASGVLLTATG VQIPGVLDDV YAKAAGVELG
PVWRGTPRLS VWAPTARKVE LALHRDAAGG GRTVHEMRRD DETGVWSVRG LSSWKGRYYT
FLVTVYSPAA GKVVTNEVTD PYSLSLAADS VRSQVVDLSD RSLAPGGWSS LAKPQAVTQD
RASVYELHVR DFSASDASVP ADRRGTYAAF AGDGAGMKEL RKLAEDGLTH VHLLPVFDIA
TVPERKADRA EPDCDLASMP ADSDQQQACV AKVAAKDAFN WGYDPRHYTV PEGSYASDPD
GSGRIKEFRG MVAGLNGAGL RVVMDVVYNH THAAGQDPTS VLDRIVPGYY HRLLDDGAVA
TSTCCANTAP EHAMMGKLVV DSVVTWARDY KVDGFRFDLM GHHPKANMLA VRKALDGLTL
AKDGVDGKSI ILYGEGWNFG EVAGDARFEQ ATQLNMAGSG IGTFSDRLRD AVRGGSPFDA
DPRVQGFGSG LAGAPNGSPA NGTAEQQRAR LLGYQDLIKV GLTGNLRDYT FTASGGRQVK
GSEVDYNGSP AGYTASPGEV VTYVDAHDNE TLFDALAYKL PQATTMADRV RMQSLSLATA
VLAQGTSFVH AGSERLRSKS LDRNSFDSGD WFNRLLWDCS QGNGFGAGLP PRADNEDKWA
YARPLLADPA LRPDCASIGS ARARYGELLR IRSSSPAFAL GSLAEVQKRL TFPTSGAAET
PGVVTMHLDA SGIDPRWKSI TVVFNATPET QPQTVTALKD AQVTLHPVQT AGDDAVVKQS
AFDPITGTLT VPARTVAVFV RS