Gene Sros_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1739 
Symbol 
ID8665016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1854516 
End bp1856156 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content76% 
IMG OID 
ProductSulfite oxidase-like protein 
Protein accessionYP_003337473 
Protein GI271963277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.316961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGT GGGCGGCGGC GCTGATCGGC CTGGTGTCCG GTGCGGTGGC GGTGGGGGTT 
TCCCTGCTGG CCGCGGGCCT GGTGAAAGCC TCGGCCTTCC CGGTTGTCGC GGTCGGCAAC
GCCGCCGTCG ACCTCACCCC GGCCGCGCTG AAGGACTTCG CCATCCGCAC CTTCGGCGAG
AACGACAAGA TGGTCCTGCT GACGGGCATC TTCCTGGTCC TCGCCGCGAT CGCCGCCGCC
GTCGGAGTCC TGGCGGTCCG GGACCTCCGG TACGGCCTGG CGGGCCTGGC CGCCTTCGGC
GTCGTCGGCG TCCTGGCCGT CCTGACCCGC CCCGGCGCCG CGGTCGTGGA CGTCGTCCCC
ACGGTGGCGG GCGTCGCCGC CGCCATGTTC GCCCTGCACC GCCTCACCGC CCGCGCCCTG
GCCCCGCCGG CCGGCCCGCG CGAGGCGGGT CCGCCGGCCG GCCCGCACGG CACCGGCCCG
AGCGGCGGGG AACCCGCCGC CGGAGCTGGA GGGGAGGAGC GGTACGGCGC GCCGGTCCCG
CCGGTCATGC GGGCGGGGAA CGGCCCCTAC CCGTTCGACC GGCGCAGGCT GCTGATCGGG
ACGCTGGGTG GAGTCGCCGT CGCCGGAGCG GCCGGCGTGG CCGGGCGGAT GCTGTCGGGC
CGGGCGGAGG TGGCCGCGGC CCGGGTCGGC ATGGCACTGC CCCGCCCCGC CGTCCCCGCC
GCGCCGCTCC CGGCCGGCGC AGACCTGAAG ATCAGAGGGC TGTCGCCGTT CGTCACCCCG
AACCACGACT TCTACCGGGT GGACACCGCC CTCGTGCTGC CCCAGGTGGA CCCCCGCGAC
TGGACCCTGC GGATCCACGG CATGGTGGAC AGGCCCGTCG AGCTGACCTT CGCCGACCTG
ATGAAACGCC CCCTGGAGGA GGCCGACATC ACGCTGTGCT GCGTCTCCAA CGAGGTCGGC
GGCCCGTACA TCGGCAACGC CCGCTGGCTG GGCACCAGCC TGGCGGGCGT CCTGCGCGAC
GCGGGGGTGC GGAAGGGGGC CGACATGCTG CTCAGCACCT CCGCCGACGG CTGGACCTGC
GGCACCCCGG TGGACGTCGT CCTCGACGGC CGCGACGCGC TGCTGGCCTT CGGGATGAAC
GGCGAGGCGC TCCCGGTCGC GCACGGCTTC CCGGTCCGCC AGGTGGTCCC CGGCCTCTAC
GGCTACGTCT CGGCGACCAA GTGGGTGACG GAGATCAAGG TCACCAGGTT CGACCGGGAC
GAGGCCTACT GGACGCCCAA GGGGTGGTCG GCCAGGGGGC CGGTCAAGAC GCAGTCGCGC
ATCGACCTGC CGAGGGACGG CGCCCGCGTC GCGCCGGGCC GTACGGTGAT CGCGGGAGTC
GCCTGGGCGC AGCACAAGGG GGTGGACGCC GTCGAGGTGC GGATCGACCG GGGGCAGTGG
CGCCAGGCGC GCCTGGCCGT GGCGCCGACC GCCGACACCT GGCGCCAGTG GGTGGTCGAC
GACTGGGACG CCACCCCCGG CAGCCACACC ATCGAGGTGC GGGCCACCGA CGCCACCGGC
TACACCCAGA CCCCCGACCT CGCCCCGGTG GCCCCCGACG GGGCCACCGG CTGGCACAGC
GTCAGCGTCG ACGTCGCCTG A
 
Protein sequence
MPPWAAALIG LVSGAVAVGV SLLAAGLVKA SAFPVVAVGN AAVDLTPAAL KDFAIRTFGE 
NDKMVLLTGI FLVLAAIAAA VGVLAVRDLR YGLAGLAAFG VVGVLAVLTR PGAAVVDVVP
TVAGVAAAMF ALHRLTARAL APPAGPREAG PPAGPHGTGP SGGEPAAGAG GEERYGAPVP
PVMRAGNGPY PFDRRRLLIG TLGGVAVAGA AGVAGRMLSG RAEVAAARVG MALPRPAVPA
APLPAGADLK IRGLSPFVTP NHDFYRVDTA LVLPQVDPRD WTLRIHGMVD RPVELTFADL
MKRPLEEADI TLCCVSNEVG GPYIGNARWL GTSLAGVLRD AGVRKGADML LSTSADGWTC
GTPVDVVLDG RDALLAFGMN GEALPVAHGF PVRQVVPGLY GYVSATKWVT EIKVTRFDRD
EAYWTPKGWS ARGPVKTQSR IDLPRDGARV APGRTVIAGV AWAQHKGVDA VEVRIDRGQW
RQARLAVAPT ADTWRQWVVD DWDATPGSHT IEVRATDATG YTQTPDLAPV APDGATGWHS
VSVDVA