Gene Sros_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0320 
Symbol 
ID8663588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp306308 
End bp309514 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336095 
Protein GI271961899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGGCA CTGAGTACGA GGCGGACTTC TTCGTCAGCC ACGCCGACGC CGACGCGCAG 
TGGGCGGAGT GGATCGCGGC GGAGCTGAAG GGCGCCGGCT ACGGAGTGAT CGTCAAGGCG
TGGGACTTCC TGCCCGGGGA GAACCTCCTC GACAGGCTCG ACAGGGCGCT GGCCACCTGC
CGCCACACCA TCGGCGTGCT CTCGCCGGAC TACGTGGCCT CGGAGATGGC CGCCCGTACC
GCCGCCCACT ACCAGGGGCT GGAGGGCAAG GAACGGGCGC TGATCCCCGT CAAGGTGGCC
GATCACCAGG TCCCGCCGTC GATGGGGCCG ATCATCTCGA TCGACCTGTG CGACGTCGGC
GAGGAGGACG AAGCCCGCAG CCGCCTGCTG AACGGGGTGG CCGGCCGCGT CGCGCGCGTC
GCCCGCGGGG GTTTCCCGAA CGCCCCCGCC AACCGGACCC GGTTTCCCGG CGCCGCCCAG
GAGGTGTGGG AGCTCCGCGG GCACCGCCCG GATCCGCACT TCGTGGGACG GGACGACGCG
CTGGCCGGCC TGCACCGTGC CTTCCGCGCC GGGCGCGCGA CCTCGGCCGT CCAGGCGATC
ACCGGCCTGG GCGGCCTGGG CAAGACCCAG CTGGCGGTCG AGTACGCCTC CCGTCACGCC
GCCGCCTATG ACATGGTGTG GTGGATCCGC GCGGAGGACC CCGCGACCCT GCGGGGTGAC
TACGCCGAGC TCGCCACGGT GCTGGGCCTG CCGTTCGACC AGGACGGCCA GGCGGTGGCG
GCGCTCCGCC AGGAGCTGCG CCGGCGTAAG GACTGGCTGC TGGTCTTCGA CAACGCGGAG
GACCCCGGCG AGGTCTTCCC GCTCCTCCCC GACCGGCACT CCGGGCACGT GCTGATCACC
TCTCGTCTCC GGGAATGGCA GCATGCCGAA TCCCGGCACA TCGAGGTCCT CCCGCTGCCG
GCCGCCGTGG AGTATCTGCG GCGGCGGGGG CAGGTGACCG ACGCCGGCAC GGCGCGGGAG
CTCGCGGAGG CCCTGGGCCG CCTGCCTCTG GCGCTGACCC AGGCCGCCGG CGTCATCGCC
GACGGCATGC GCGCGACCGA CTACCTCGGC CTGCTCCGGA GGCAGTCACC CGAACTGTTC
GTGCAGGGCC GTGCCGGGGC GCACGACACG ACGATCGCCT CCACGTGGCG GGTGTCCTTC
GACCGGCTCG CCGACCGGTC CCCGGCCGCC GTGGCCCTGT TCCGCCTCGC CGCGTTCCTC
GGGGCCGAGG CGATCCCGCT GGACCGGCTC ACCCCGGTGC CGGACATGCC CGCCGAGCTC
GCCGAGGCCC TGAACGACCC GTTCCGGCGC CGCGACGCGA CCCGGGCCCT CGGTGAGTAC
TCCCTGGCCG AGACCGGCGA CGGTCTCCTG TCGATCCACC GCATGGTCCA GACGGTCACC
CGCACCGAAC TGGCCGGCGA CGAGCCCTTC TGGGCCGGGC TCGCGCTCGC CGTGACCACC
GCGGCGTTCC CCCGCGACGT CCGTGATCCG CGATCCTGGC CCGCCTGCGA AGCGGCGCTC
GCGCACGCCA TCGCCGCCGC GGAGCACGCC GGGCGGCTCC ACGTCGACAC CGGGGGGACC
GTCGACCTGC TGAATCAGGT CGCGCTCTAC CTCCTGGCAC GGGGGAGGAC GGACCGGGCC
GCGACCGCCG TCGAGAACGC GCTGACCCTG GCCGCGCGGC TTCCGCGGGA CGCGCCGGAG
TGTCTCCGCT GCCGCAACAC GCACGGGCTG CTGCTGCTGG CCCAGGGCGA CCGCGCGGCG
GCGTGCCAAG CCCACGAAGA GGTGTACGAG GCCAGGATCC GGATCCTGGG ACCCGACGAC
GTCGACACCC TGCGGGCCGG CCGGGACCTG GTCGAGGCGC TCTATTTACA GGGGCAATGG
GCGCGGGCCA CCCGGTTGCA GGACCGGCTG GTCCAGGCGT TCACGGCGGT CCTCGGCACC
GACGACCTCG AAACGGTCAC CTCCGTGGCC TACCAGGCGA CCCTCCTGCG CAACGCCGGG
CAGTACCAAC GGGCCCGCAC TCTTGAGGAA GGGGTGCTGG AGGTCCGCCG GCAGCGACTG
GGCGAGGAGC ACCCTGACAC CCTCAATGCG ATGGCCAACC TCGGCGCGAC CCTGCACGCC
CAGGGGAAAT GGGAGAAGGC TCGCACCCTC GTGGAGGGGG TGCTGGAGGT GCGTCGGTGG
CTGCTGGGTG AGGAGCATCC CGACACTCTC GACGCAATGG CCAACCTCGG CTCGATCCTC
CATTCGCAGG GTGATCTGGA CGGAGCCCGC GCTCTCGTGG AGCGGGAGTT GGAGGTGTGT
CGGCGGCTGT TGGGTGAGGA GCACCCTCAC ACCCTTGGCG CGATGGCCAA CCTCGGCTCG
ATCCTCCATG TGCAGGGTGA TCTGGACGGA GCCCGAGCCC TCAAAGAGGG GGTGTTGGAG
GTGCGTCGGT GGTTGCTGGG TGAGGAGCAC CCCGACACCC TCACCGCGAT GGCCAACCTC
GGCGCGACGC TCCATGCGCA GGGTGATCTG GACGAGGCCC GAGCCCTCGA GGAGGGGGTG
TTGGAGGTGC GTCGGTGGTT GCTGGGTGAG GAGCACCCCG ACACCCTCAC CGCGATGGCC
AACCTCGGCG CGACGCTCCA TGCGCAGGGT GATCTGGACG AGGCCCGAGC CCTCGAGGAG
GGGGTGTTGG AGGTGCGTCG GCGGTTGCTG GGTGAGGAGC ACCCCCACAC CCTCACCGCG
ATGGCCAACC TCGGCGCGAC GCTCCATGCG CAGGGTGATC TGGACGGAGC CCGCACCTTC
AAGGAGCGGG TGTTGGAGGG GCGTCGGCGG TTGCTGGGTG AGGAGCACCC CCACACCCTC
ACCGCGATGG CCAACCTCGG CGCGACGCTC CATGCGCAGG GCGAGATGGC GGAAGCCCGC
TCGCTGCTCT TGGAAGCCCT CACCGTCAGC CAGCGGGCCT TCGGCAAGAA GAACACCGTC
ACCTCTGAGA TCGCGTGGCG GATGGTGTCG ACCTACGACA GGCCTCATGA AACAGCCAGG
AGGAAAAACC TCATATTGGA GAATTTGTCC TGGCTCGCCA AGGAGCCGCC CAGCCGGCTG
ACCGGCCAGC AGAAGAGCAT CAAGGACCGT GTCAAAGGCC TCTTCGGTGG CAGGTCCGCC
AAGCGCCAGG GGAAGCGCGG CAAGTGA
 
Protein sequence
MPGTEYEADF FVSHADADAQ WAEWIAAELK GAGYGVIVKA WDFLPGENLL DRLDRALATC 
RHTIGVLSPD YVASEMAART AAHYQGLEGK ERALIPVKVA DHQVPPSMGP IISIDLCDVG
EEDEARSRLL NGVAGRVARV ARGGFPNAPA NRTRFPGAAQ EVWELRGHRP DPHFVGRDDA
LAGLHRAFRA GRATSAVQAI TGLGGLGKTQ LAVEYASRHA AAYDMVWWIR AEDPATLRGD
YAELATVLGL PFDQDGQAVA ALRQELRRRK DWLLVFDNAE DPGEVFPLLP DRHSGHVLIT
SRLREWQHAE SRHIEVLPLP AAVEYLRRRG QVTDAGTARE LAEALGRLPL ALTQAAGVIA
DGMRATDYLG LLRRQSPELF VQGRAGAHDT TIASTWRVSF DRLADRSPAA VALFRLAAFL
GAEAIPLDRL TPVPDMPAEL AEALNDPFRR RDATRALGEY SLAETGDGLL SIHRMVQTVT
RTELAGDEPF WAGLALAVTT AAFPRDVRDP RSWPACEAAL AHAIAAAEHA GRLHVDTGGT
VDLLNQVALY LLARGRTDRA ATAVENALTL AARLPRDAPE CLRCRNTHGL LLLAQGDRAA
ACQAHEEVYE ARIRILGPDD VDTLRAGRDL VEALYLQGQW ARATRLQDRL VQAFTAVLGT
DDLETVTSVA YQATLLRNAG QYQRARTLEE GVLEVRRQRL GEEHPDTLNA MANLGATLHA
QGKWEKARTL VEGVLEVRRW LLGEEHPDTL DAMANLGSIL HSQGDLDGAR ALVERELEVC
RRLLGEEHPH TLGAMANLGS ILHVQGDLDG ARALKEGVLE VRRWLLGEEH PDTLTAMANL
GATLHAQGDL DEARALEEGV LEVRRWLLGE EHPDTLTAMA NLGATLHAQG DLDEARALEE
GVLEVRRRLL GEEHPHTLTA MANLGATLHA QGDLDGARTF KERVLEGRRR LLGEEHPHTL
TAMANLGATL HAQGEMAEAR SLLLEALTVS QRAFGKKNTV TSEIAWRMVS TYDRPHETAR
RKNLILENLS WLAKEPPSRL TGQQKSIKDR VKGLFGGRSA KRQGKRGK