Gene Sros_7875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7875 
Symbol 
ID8671198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8684340 
End bp8686598 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content72% 
IMG OID 
Productpolysaccharide deacetylase 
Protein accessionYP_003343279 
Protein GI271969083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACAC AGGTGTGGCC CCCGCCGAAC CTGTCCGGGA CGAACCCGGC CGAGACGGCG 
AGACTCGACG TCTCCGCCAC CATGCCCCTC ACCGACCTGA GGCCTCCCCC TCGCGAGCAG
CCCCCCGGCA GAGGGAAGCG CCCCGTCCGG CGGCGCCGCC AGGGCCAGGC CGACCCGCGC
GGACACTGGT TCCTCGTCGC GGTCGGCATC GTGCTGATGG TCGGAGCGCT GGTGCTCAAC
GGATACGCCA CCAACGCGGT GGGCGAGACG CACGCGGCCC CGCCCCCGGC CGGCCGGCCT
TCCGCGGACC TCGACCTCGT ACGGCGCAGC GGTCCGGTGC TCAACCTCAC CGGCGACAAA
CCGGACAGCG CCCGCCTCCC GGCCCGCACG GTGGCGCTCA CCTTCGACGA CGGCCCCGAC
CCGCGCTGGA CCCCTCAGCT CCTCGATGTG CTGGCCCGGC ACGGCGCCAA GGCGACGTTC
TTCGCCGTCG GCGCCCGGAT CGCCGAGAAC CCGGAGCTGA CCCGGCGGAT CATCGCCGAA
GGGCACGAGA TCGGCAACCA CACCTACGCC CACGCCGACC TCACCGCCGT GCCGGAGTGG
CGGCTGCGGC TGGAGCTGTC GCTGACCCAG AAGGCACTGG CCGGTACGGC GGGCATGCAC
ACCCGCCTGG TGCGCCCGCC GTACTCCTCC TCGCCGGCGG CCGTCACCGG CCCCCAGTTG
CGGGCGCTGC ACGCCATGGG TGGTGAGGGA TATCTCGTCG CGCTTACCGA CCTGGACACC
AAGGACTGGG CCAGGCCCGG CGTCGAGCAG ATCGTCCGCG CCGCACTTCC CCGCCGGGGC
AGAGGCGCGG TGATCATGAT GCACGACTCC GGAGGGGACC GCGGCCAGAC GGTCGCTGCG
GTGGATCGGC TGCTCACCCA GCTCGCCGAG CAGAGATACC GGGCGACGAC GCTCACCACC
GCGCTCGGCA TGCCGTCGGC CCTCACCCCG GCGGGAGGCA TAGACCGTTT CGTCGGCACC
GGCCTGAGCC TCGCCCAGCG CGGCGCGACC GGCTTCGTTA CGGCGATGTC GTGGATCCTG
GTCGTCGCCG GCGTGATCAC CTTGCTGCGG CTGCTGTTCT TCCTGGTCCT GGCCTGGGTG
CACGCCCGGC GGGTGCGGGG TGGCAAGAGG CGCGCCGGGC GCGCTCCGGC ATGGCCCGAG
CCACCGGCCG TCACCGTAAT CGTCCCCGCC TACAACGAGG CGGCCGGGAT CGAGGCGACC
GTACGCTCGC TGGTGAACAC CGACTACCCA GGCGTGCTGG AGGTGGTCGT CGTGGACGAC
GGCTCCTCCG ACGACACCGC CGCCATTGCC GCCTCCCTCG GCCTGCCCGG AGTGCGGGTG
ATCCGTCAGG AGAACGGCGG CAAACCCTCG GCGCTCAACA CCGGCATCGC GCACGCCTCG
CACGACATCC TGGTCATGGT GGACGGCGAC ACCGTCTTCG AGCCCGCCAC CATCGGGCAC
CTGGTCCGGC CGCTCTCCGA CCCGGCCGTA GGCGCGGTCA GCGGCAACAC CAAGGTCGGC
AACCGACGCG GCATGATCGG CCGCTGGCAG CACATCGAAT ACGTCATCGG CTTCAACCTC
GACCGCCGCG CCTTCGACCT GCTGGGCTGC ATGCCGACCG TACCGGGGGC GATCGGAGCC
TTCCGCCGCT CGGCGCTCCA GGAGATCGGC GGCGTCAGCG TCGACACCCT GGCCGAGGAC
ACCGACCTGA CCATGGCGAT GTGCCGGGGC GGTTGGCGGG TGGTCTACGA GGAGAACGCC
CTGGCCTGGA CCGAGGCGCC GACCTCGCTC AGCCAGCTGT GGCGCCAGCG TTACCGGTGG
TGCTACGGCA CGTTGCAGGC GATGTGGAAG CATCGCCGCG CGATCACGGA GCCGAGCCCC
TTCGGCCGTC GCTGCCTGGG TTATCTGACG CTCTTCCAGG TGGTGCTGCC GCTGCTCGCC
CCGGTCGTGG ACGTCATGGC CGTCTACAGC GTGGTGATGG GAGATCCGTT GCCGGTGGTG
GCGGTCTGGG CTGGATTCGT GCTGGTCCAG GCGTTCAGCG GCTGGTACGC GCTGCGGCTG
GACCGCGAGC GGGCCTCGGT GCTGTGGGTG CTGCCGTTGC AGCAGTTCGT CTACCGGCAG
CTGATGTACC TGGTGGTGAT CCAGTCGGTG GCGACCGCCG TACTCGGAGT GCGCCTGCGC
TGGCAGACGA TCCGCAGGGA GGGCACCTTC GCGGCCTGA
 
Protein sequence
MGTQVWPPPN LSGTNPAETA RLDVSATMPL TDLRPPPREQ PPGRGKRPVR RRRQGQADPR 
GHWFLVAVGI VLMVGALVLN GYATNAVGET HAAPPPAGRP SADLDLVRRS GPVLNLTGDK
PDSARLPART VALTFDDGPD PRWTPQLLDV LARHGAKATF FAVGARIAEN PELTRRIIAE
GHEIGNHTYA HADLTAVPEW RLRLELSLTQ KALAGTAGMH TRLVRPPYSS SPAAVTGPQL
RALHAMGGEG YLVALTDLDT KDWARPGVEQ IVRAALPRRG RGAVIMMHDS GGDRGQTVAA
VDRLLTQLAE QRYRATTLTT ALGMPSALTP AGGIDRFVGT GLSLAQRGAT GFVTAMSWIL
VVAGVITLLR LLFFLVLAWV HARRVRGGKR RAGRAPAWPE PPAVTVIVPA YNEAAGIEAT
VRSLVNTDYP GVLEVVVVDD GSSDDTAAIA ASLGLPGVRV IRQENGGKPS ALNTGIAHAS
HDILVMVDGD TVFEPATIGH LVRPLSDPAV GAVSGNTKVG NRRGMIGRWQ HIEYVIGFNL
DRRAFDLLGC MPTVPGAIGA FRRSALQEIG GVSVDTLAED TDLTMAMCRG GWRVVYEENA
LAWTEAPTSL SQLWRQRYRW CYGTLQAMWK HRRAITEPSP FGRRCLGYLT LFQVVLPLLA
PVVDVMAVYS VVMGDPLPVV AVWAGFVLVQ AFSGWYALRL DRERASVLWV LPLQQFVYRQ
LMYLVVIQSV ATAVLGVRLR WQTIRREGTF AA