Gene Sros_2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2250 
Symbol 
ID8665532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2428040 
End bp2429476 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content74% 
IMG OID 
Productsulfatase 
Protein accessionYP_003337975 
Protein GI271963779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.739264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAGTC ATGATCCCGG CCGGCCCAAC GTCCTGCTCA TCCTGAGTGA CGACCAGGGC 
CCGTGGGCCC TGGGCTGCGC GGGCAACGAC GACATCCACA CCCCCGCCCT CGACGCCCTG
GCGGCTTCGG GCGTGCGGCT GTCACGGTTC TTCTGCGCCT CCCCGGTGTG CTCCCCGGCC
CGCGCGTCGC TGTTCACCGG GGAGATCCCG TCCCAGCACG GCGTGCACGA CTGGATCAGC
CGGGGGCACG TGGGCGACGA CGGGGCGGAC TTCCTGGCCG GGCGCAGGCT CTTCACCGAC
GACCTGCACG ACGCGGGGTA CCGGCTCGGC CTGGTCGGCA AGTGGCACCT GGGCGCCAAC
GACCGGCCCC GGCCGGGGTT CGTCCGCTGG CTGGCGCACG AGAGCGGCGG CGGGCCGTAC
CTGGACACCG TCCTCTACGA CGGGGGCGAG CGGGTCGAGG CTCCCGGATA CCTCACCGAC
GTGCTGACCG GCGAGGCGTG CGCGTTCCTG TCGCGCGAGG CGGAGCGCGA GGAGCCGTTC
TACCTGTCGC TGCACTACAC CGCGCCGCAC ACGCCGTGGA AGGGGCAGCA CCCGGAGGCG
TTCGAGGCGC TCTACGACGG CTGCGCGTTC GACTCCTGCC CGCAGGGGCC GCCGCACCCC
TGGCAGCCGG TGGGCCCGGA CGGGGCCCCG GTCGGCGGAG AGCCCGACGT GCGCGCCGCG
CTGACCGGTT ATTTCGCGGC GGTGTCGGCG ATGGACGCCG GGATCGGCCG GGTGCTGTCC
CGGCTGGCGG AGCTGGGGCT CACGGAGTCG ACGCTGGTGG TCTTCACCAG CGACAACGGC
TTCAACTGCG GCCACCACGG GATCTGGGGC AAGGGCAACG GCACCTTCCC GCAGAACGTC
TACGACAGCT CGGTCATGGT CCCGGCGATC GTGAGCCAGC CGGGGCGCGT CCCCGGGGGC
CGGGTGTGCG AGGCGCTGCT GTCGGCCTAC GACCTACGGC CCACGCTGCT GGAACATCTC
GGGCTTCCCG CCGCCGCCGG ATCGCGGCCC GGCCGGTCGT TCGCCGACAT CCTCGGAGGC
TCGCCGGGCG AGGATCGGCC GAGGGTGGTG GTGTTCGACG AGTACGGGCC GGTCCGGATG
ATCCGGACCG AAGCCTGGAA ATACGTGCAC CGGCATCCGT ACGGCCCGCA CGAGCTCTAC
GACCTGGTGA CCGATCCGGA TGAGAGGCGC AACCTGGCCG ACGCCCCCGA GGCGGCGGCC
GTGCGCGCCG GCCTGGCCGG AGAGCTGGAC GCCTGGTTCG CCCGGTACGT CGACCCGGTG
GCGGACGGCA GGGGGCTGCC CGTCAGCGGG GCCGGCCAGG CCGCTCCGCT ACGGCCGGGC
TCCGGGCTCG GCGCCTTCGA GGCCGCGCCC TTCCCGACGG TCCCCCGGTA CGGATGA
 
Protein sequence
MASHDPGRPN VLLILSDDQG PWALGCAGND DIHTPALDAL AASGVRLSRF FCASPVCSPA 
RASLFTGEIP SQHGVHDWIS RGHVGDDGAD FLAGRRLFTD DLHDAGYRLG LVGKWHLGAN
DRPRPGFVRW LAHESGGGPY LDTVLYDGGE RVEAPGYLTD VLTGEACAFL SREAEREEPF
YLSLHYTAPH TPWKGQHPEA FEALYDGCAF DSCPQGPPHP WQPVGPDGAP VGGEPDVRAA
LTGYFAAVSA MDAGIGRVLS RLAELGLTES TLVVFTSDNG FNCGHHGIWG KGNGTFPQNV
YDSSVMVPAI VSQPGRVPGG RVCEALLSAY DLRPTLLEHL GLPAAAGSRP GRSFADILGG
SPGEDRPRVV VFDEYGPVRM IRTEAWKYVH RHPYGPHELY DLVTDPDERR NLADAPEAAA
VRAGLAGELD AWFARYVDPV ADGRGLPVSG AGQAAPLRPG SGLGAFEAAP FPTVPRYG