Gene Sros_2256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2256 
Symbol 
ID8665538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2434914 
End bp2438045 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content72% 
IMG OID 
Productchondroitinase 
Protein accessionYP_003337981 
Protein GI271963785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0824242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAGA TCACTCGTAG GACCGCGCTC AAGGCCGGAG CGGCGGGTGT GGCGGGAGCC 
GTCTTCCTGC GGCCCGGTAC GGCGTCCGCC GCCGTGAGCG GCGACCTGGA ACAGCGGGCC
CTGGCCATGA ACCCGCCGGT CTTCCTGCTG GAGACCGCCG TCCCGCCGCA GATGACCGCC
GGGCAGGGCT CGACGCTGAG CATCACCGAC CGGGCCGCGA TCTGCGGGAG CCACTCGCTC
CGCTGGGAGC ACGGCAGCAG GTCCACGATC ACGGTCAGCG GCCCCATCGG CTTCGCCCCC
GACCCCTACC GGCCCATGGA CGACCAGGCC TGGCAGGGGA CCGTGGACAC CTTCTCGGTC
TGGATCTTCA ACGAGACGCC GGTGGACGAC GCGGTCCGCT TCGAGTTCGG CAGGGGGGAC
CGGACCGACG CCTGGTTCGA GTTCCGGCTC GACTTCACCG GCTGGCGGAC GGCCTGGGTC
CGCTACGACT ACGACATGCA CGGCCGGCCC CATCCCGGCA TGGACACCCT GCGGATCATC
GCGCCCCGCC GCAGCGGGAC GCTCCATCTC GACCAGCTCA TGCTCAACGT GGCGCTGCGG
CCGGACGCCC CCACCCGCGA CGCCCAGGTG CCGGACGTCG CCAAGGAGGG CGAGGACTAC
GACAACCAGC ACTGGCAGGC GCTCTACCTC TACGACACCC TGCTGACCAG GGCCCGGCCG
GACATGTCCG CGCCGTCGGC GGAGGAGACG GCCTCGCTGC GCGCGCTGGT CACCCGCTAC
CGGGACGAGT ACCTCGTCTC ACCGGTGAAG GTGGACGACG CGTCGGTGGC CGCGCTCACC
GGCCAGGTCG CCGCGCTCGG CGTCCCCGAG GTGTCCTCGA GCGGGGTGGG CCGCCCAGTC
TTCTCCTACC AGTCGCAGAT CTACCCGCCG GAGATCGCCG CCGAGCTCAA GACGTTCGTC
AACGCGGTCA CGCTGCGCGC CTACACCGAC TTGATGAGGA CGGTGGCGCT GGCCTTCACC
TCGGCCGGCG CGGCGCACCG GCCGGCGCTC GCCGGCCTCT ACACCCGGCT GGTCGTCCAC
CTGCGCGACC AGGGCTGGAC CCGGGGAAGC TGCCAGGGCA CCATCCACCA CCTGGGCTAC
GACGCGCGCG GCTACTACGA CTCGGTCTAC CTCATGAAGG ACGCGCTGCG CGAGGCCGGA
CTGTTCGAGC AGGTCCGCGC CGACCTGGCC TGGCTGACCG GGCTCGGCCG GATCTTCCGG
GGCTGGGAGC ACCGCATGGC CTACGGCAGC GTCATCGACA TCCTCAACAC CACCTTCCGG
GGGATGCTCG CCGCCGTGCT GCTCCGCGAC ACCGAGGCCG AGCAGGTCGC CTACCTGCGG
GCGTTGCGGG CCTGGCTGAA CCGGGCGCTG CTGCCCAGCG CGGGCATCCA GGACGGGCTC
AAGGCGGACG GCGCCACCTT CCACCACGTC GGATTCTTCC CCGACTACGC CCGCGACGGC
TTCGTCGGAC TGGCCCCGCT GGTCTACGTG CTCAGCGGCG GAGCGTTCCG CCTCGCCGCC
GAGTCGCACG CCTCGCTCAA ACGGGCCGTG CTCGCCATGC GCGTCTACGC CAACAAGAAC
CACTGGCCCA TCTCGATCAG CGGCCGCAAC CCCAGCGGGC TGACCGCCCT GTCACCGGTG
CCCTACCAGT GGCTGGCGAT CTCCGGCACC CCCGACGGTT CAAGCGACGT GGACCCCGAG
CTCGCCGCCG CGTTCCTGCG CCTGCTGCCC GCCGCGCCGA ACACCCAGCA GAAGCAGCTC
GCCGTACGGC TGGCCGCACG CGGCATCGTC GCCGAACCCG ACCCCAACGG CAACTGGACG
ATCAACTACG CCGCCCTGGC CGTGCACCGG CGGGAGAACT GGCAGGTCAC GGTGCGCGGG
CACAACCGCT ACCTGTGGAG CACCGAGGTC TACCACGGCG CGAACTGGTA CGGCCGCTAC
AACACCTACG GGCAGATCCA GGTGCTGCAC CGGGGCAACC CGGTCAACAA CGCCGACAGC
GGCTACGCCC AGGCGGGCTG GGACTGGAAC CGCAGGCCGG GCACCACCGT CGTCCACCGG
CCGCTGGACG AGCTGCGGGC CGACCTGACC GGGGCGATCG AGGAGATGCT GCTCACCGAC
TCGCGGTTCG GCGGGTCGAA CACCATCGAC GGCCGCAACG GCATGTTCGC CATGGAGCTG
CGCGAGCACC CGAAGTACGT GGGCTCACAC CGCGCGCTGA AGTCGGTCTT CCTGTTCGAC
GACCGGATCG TGGCCGTCGG CACCGGGATC GAGAACGACT CCCGGCAGGA GACCGAGACC
ACGCTCTTCC AGACCCGCCT GCCCGGCCGG ACGGCTCCCA CCTACGTCGA CGGGACCGAG
CCGGTCGTGG CCTTCCCGCA CGCCGCGCCG GACCTCACGC CGCGCTGGCT GCTGGATGAC
AAGGGCATCG GCTACCACCT GGCGGCCGGG CAGAAGGTCG GCCTGACGCG CTCCACCCAG
ACCTCGCGCG ACAACGCGAC CGAGGCCCCG ACCAGCGGCG ACTTCGCGAC CGCCTGGATC
AAGCACGGCA CCGCTCCACG GGGCGGGAGC TACCGGTACG CCATGGTCGT CAACACCACT
CCCGAGCGGA TGGCCGCGTT CGCCGACGCG ATGGACGACC CGGCGAGCGC GCCGTACGCC
GTGCTGCGCG CCGACACCTC CGCACACGTG GTGAGCGACC GGGCGACCGG GATCACCGGT
TACGCGGTCT TCGCACCCCT GGAGCTGACC GAGGGGCCGG TACGCAAGGT GGACACGCCG
TCCATGGTGC TGGTCAGGGG CGACGGGGAC GACGGCCTGG TGCTGGCGGT GTGCGACCCC
GACCTGCGGC TCTACAGCGG CGTCGACCAC GACCAGTACG AGCGCGGCCG GTACGTCGGC
CACTACAGCC CCTGGTCGCG GCCGTGGCTG ACCAGCCCGA GCCACCCCCA CCGGATGCGG
GTGACCCTCG ACGGACGCTG GCGCGCGGAC GGCGACCAGC CGTGCGAGGT GCGGGCACGG
CACGACCGCA CCGTGGTCGA GTTCGAGACG GTGGACGGGC GGCCGGTCCA GGTCCGCCTG
CTCAGGGAAT GA
 
Protein sequence
MLEITRRTAL KAGAAGVAGA VFLRPGTASA AVSGDLEQRA LAMNPPVFLL ETAVPPQMTA 
GQGSTLSITD RAAICGSHSL RWEHGSRSTI TVSGPIGFAP DPYRPMDDQA WQGTVDTFSV
WIFNETPVDD AVRFEFGRGD RTDAWFEFRL DFTGWRTAWV RYDYDMHGRP HPGMDTLRII
APRRSGTLHL DQLMLNVALR PDAPTRDAQV PDVAKEGEDY DNQHWQALYL YDTLLTRARP
DMSAPSAEET ASLRALVTRY RDEYLVSPVK VDDASVAALT GQVAALGVPE VSSSGVGRPV
FSYQSQIYPP EIAAELKTFV NAVTLRAYTD LMRTVALAFT SAGAAHRPAL AGLYTRLVVH
LRDQGWTRGS CQGTIHHLGY DARGYYDSVY LMKDALREAG LFEQVRADLA WLTGLGRIFR
GWEHRMAYGS VIDILNTTFR GMLAAVLLRD TEAEQVAYLR ALRAWLNRAL LPSAGIQDGL
KADGATFHHV GFFPDYARDG FVGLAPLVYV LSGGAFRLAA ESHASLKRAV LAMRVYANKN
HWPISISGRN PSGLTALSPV PYQWLAISGT PDGSSDVDPE LAAAFLRLLP AAPNTQQKQL
AVRLAARGIV AEPDPNGNWT INYAALAVHR RENWQVTVRG HNRYLWSTEV YHGANWYGRY
NTYGQIQVLH RGNPVNNADS GYAQAGWDWN RRPGTTVVHR PLDELRADLT GAIEEMLLTD
SRFGGSNTID GRNGMFAMEL REHPKYVGSH RALKSVFLFD DRIVAVGTGI ENDSRQETET
TLFQTRLPGR TAPTYVDGTE PVVAFPHAAP DLTPRWLLDD KGIGYHLAAG QKVGLTRSTQ
TSRDNATEAP TSGDFATAWI KHGTAPRGGS YRYAMVVNTT PERMAAFADA MDDPASAPYA
VLRADTSAHV VSDRATGITG YAVFAPLELT EGPVRKVDTP SMVLVRGDGD DGLVLAVCDP
DLRLYSGVDH DQYERGRYVG HYSPWSRPWL TSPSHPHRMR VTLDGRWRAD GDQPCEVRAR
HDRTVVEFET VDGRPVQVRL LRE