Gene Sros_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1039 
Symbol 
ID8664313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1054911 
End bp1056074 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative oxidoreductase 
Protein accessionYP_003336782 
Protein GI271962586 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0582367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0597058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA CCAACGTCCT CATCTCCGGC GCCAGCATCG GCGGCCCCGC CCTCGCCTAC 
TGGCTCGACC GCTACGGCTT CAACGTCACC GTCGTCGAGA AGGCCCCGGC CCTGCGCGCG
GGCGGCCAGG CCGTCGACTT CAAGGGCGAG ACCCACCTCA CCGTGCTGAG GCGGATGGGC
ATCCTGGAGG ATGTGCGGCG GCTGCAGACC GGCGGGACCG ACCAGGAGAT CGTCGACGCC
GACGGCCGCA GGCTGGCGGT CATCCCCGGC GAGTTCACCG GCGGTGAGAT CGAGATCAAG
CGCGGCGACC TGTCCAGGCT GCTGTACGAC AGGACGGCCG CCGGCTGCGA ATACGTCTTC
GGTGACTCCA TCACGTCCCT GACCGAGACC GCGGACGGCG TGCATGTCAC CTTCGAGCGT
GCCGCGCCGC GCACGTTCGA CCTGGTGGTG GGCGCGGACG GGATCCACTC CAACGTGCGC
CGCCTGGCCT TCGGCCCGGA GGCCGACCAC GTCGCGTTCC TCGGCCACTA CTACGCCCTG
GCCGAGCTGC CGGGAGACTT CGGGCCCGTG CCGAAGATGT ACAACGAGCC CGGCAGGATG
GTCGCCGTCG GCGGGCCGAA GGCGCCCGCG TTCTTCGTCT TCGCCTCCGA GCAGCTCGAC
TACGACCGCT ACGACGTCGA GCAGCACAAG CGGATCGTGG CCGAGGCCTA CGCGGGCATG
GGCTGGCGGG GCCCCGCGAT CGTCGACGCG GTACGGCGGG CGGACGATCT CTACCTCGAC
TCGATCAGCC AGGTCAGGAT CGACCACTAC GCCAGGGGCC GGGTGGTGCT GCTCGGCGAC
GCCGCCTACG GCAACACCCT GGGCGGCTTC GGCACCGGTC TGGCCGTGGT CGGCGCCTAC
GTCCTGGCGG GGGAGCTGGC CGCGGCCGGT GGTGATCACC GCCTGGCCTT CGAGCGGTAC
GAGGAGGAGT TCCGCGGATA CGCCAAGGTG GCCCGGAGCG GCAACGCCGG CCCCTTCCTC
GCCCCCGGCA GCCCGGCCAG GATCCGCATG CGCAACTGGA CGTTCAAGTA CGGCTTCCTG
CTGGGCCTGA TGCTGAAGAT GACCGACATG TTCGCCACCC GCATCACCCT GAAGGACTAC
GAGCCCGTAC GGCAGGCCCG CTGA
 
Protein sequence
MKNTNVLISG ASIGGPALAY WLDRYGFNVT VVEKAPALRA GGQAVDFKGE THLTVLRRMG 
ILEDVRRLQT GGTDQEIVDA DGRRLAVIPG EFTGGEIEIK RGDLSRLLYD RTAAGCEYVF
GDSITSLTET ADGVHVTFER AAPRTFDLVV GADGIHSNVR RLAFGPEADH VAFLGHYYAL
AELPGDFGPV PKMYNEPGRM VAVGGPKAPA FFVFASEQLD YDRYDVEQHK RIVAEAYAGM
GWRGPAIVDA VRRADDLYLD SISQVRIDHY ARGRVVLLGD AAYGNTLGGF GTGLAVVGAY
VLAGELAAAG GDHRLAFERY EEEFRGYAKV ARSGNAGPFL APGSPARIRM RNWTFKYGFL
LGLMLKMTDM FATRITLKDY EPVRQAR