Gene Sros_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0115 
Symbol 
ID8663379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp116463 
End bp117788 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content71% 
IMG OID 
ProductPhosphoprotein phosphatase 
Protein accessionYP_003335913 
Protein GI271961717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.11266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CACTCCGTTA CGCCGCCCGC TCTGACGTCG GCCTCCTCCG CGAAGGTAAC 
GAGGACTCGG CGTACGCCAG CGGTCGTCTG CTCGCCGTCG CCGACGGTAT GGGCGGCCAC
GCACACGGTG AGGTGGCCAG CTCGGTCGCC ATCGCCGCGA TGTCCTCCCT CGACGAGGAC
CCGCAGGGGG GTGACCTGCT CAGCGCCGTC GAGGCGGCGG TCAGAGACGC CAACCGCAGG
CTCCACGAGA TGGTGGGACG GGACCCGAGC CTCAAGGGCA TGGGCACCAC CCTGACCGCC
ATGCTGTGGT CGGGCACGAG GGTCGCGCTG GTCCACGTCG GCGACTCCCG CGCCTATCTG
CTGCGCGCCG GGGAGCTCTA CCAGATCACG CACGACCACA CCCTGGTGCA GTCCCTGGTG
GACGACGGCC GGATCACCCT GGAGGAGGCC GCCACCCACC CGCAGCGGTC GATCCTGCTG
CGCGCCCTCG ACGGCAGCGG CGAGGTCGAC CCCGACCTGT CGCTGCGCGA GGCCCAGGTC
GGCGACCGCT ACCTGCTCTG CTCCGACGGG CTGTCCGGCG TGGTGAGCGC GGAGACGATG
CACCACACGC TCTCCACGAT CGAGGACCCC GAGACGGTGG TCCGCACGCT CATCGACCTG
GCCAACCGCG GCGGCGGCCC CGACAACATC ACCTGCGTGC TCGCCGACGT CCTGGAGGTG
GACGAGGGTC TCGCCCTCCC CGTCGAGGCC GCCGTGGTGG GCGCCGCCGG GTCCACCCGG
CCGCGGACCC AGCTCCCGGA CACCCCGGCG GGCCACGCCG CGGGGATCAC CATGCCCCAG
CCCGTCATCA CGGACGACGA TCTCGAGGAG CCGGTCGCCA GGGCCACGGG GCGGCCGGCC
AGGCGCCGTC GACTGTGGCC GCTCATGGCC TCGGTGGGAG GCGTCGTCCT GGTCGGCGGC
GGCCTAGGGT GGTACTTCGG GAGCCAGTGG CTCGACGACC AGTACTTCGT AGGGGTGAAA
GGGGATGAGA TCGTGGTTTT CCAGGGCGTG AAGACCAACC TCGGCCCCAT CGAGCTCTTC
GACGTCGCCC GGAGCACCAC CGAGTCGGTC ACGGCCCTTG GCGCGTTCCA GCAGGGCCAG
GTCCGCGACG GCATCCCCGT CGCCAGCGTC GACGAGGGCC TGAAGAAGAT CGAGGAGCTC
AAGACGTCCG CGGCGAAGCC CGCGACGAAA CCGACGGCGA AGCCCGAGTC CAAGCCTGAC
GGGAAGGGCA AGCAGACCTC CCAGCCGAGC GGCACCGCAT CCCCGGAACC CACAAGGTCG
CAGTAG
 
Protein sequence
MTIALRYAAR SDVGLLREGN EDSAYASGRL LAVADGMGGH AHGEVASSVA IAAMSSLDED 
PQGGDLLSAV EAAVRDANRR LHEMVGRDPS LKGMGTTLTA MLWSGTRVAL VHVGDSRAYL
LRAGELYQIT HDHTLVQSLV DDGRITLEEA ATHPQRSILL RALDGSGEVD PDLSLREAQV
GDRYLLCSDG LSGVVSAETM HHTLSTIEDP ETVVRTLIDL ANRGGGPDNI TCVLADVLEV
DEGLALPVEA AVVGAAGSTR PRTQLPDTPA GHAAGITMPQ PVITDDDLEE PVARATGRPA
RRRRLWPLMA SVGGVVLVGG GLGWYFGSQW LDDQYFVGVK GDEIVVFQGV KTNLGPIELF
DVARSTTESV TALGAFQQGQ VRDGIPVASV DEGLKKIEEL KTSAAKPATK PTAKPESKPD
GKGKQTSQPS GTASPEPTRS Q