Gene Sros_9106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9106 
Symbol 
ID8672452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10052698 
End bp10054482 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344472 
Protein GI271970276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACT TCCTCCGGCA TGCCCTGATC ACCTCCGGCG TCGCGGGTCT CGTCCTCGCC 
TCCGGTACGG CGGTGCGGGC AGCGGCCCCC GACGATCCGA ACATCCCCTC CGTCTGCGGC
GACCGCCGCC CGGCCGACCC CCAGGCCGGC ACGTACGGCA TGCAGCTCGA CGGCACCTTC
CGGCACCCCT CCCTGACCCC GGCCGGCGAG GAGCGCTTCT CCTTCCCCGG CGGCGCCCAG
GACCTGTTCG GGAAGACCAA GGGCTACGAC CCGAAGTCCT ACCTAGAGGG TCCGATCAAG
CTGGAGGCCG CCTACCGGGG CGCCGAGTTC ACCCCGGACT ACTTCCACAG CTGGCAGAAC
ATCGTCGACT TCGACGGCCG CCGCTACCTG TTCCAGTACG ACCGGAGCGA GGGCCGCGTC
TACGACGTGA CCGACGTCAA GAAGGTCAAG ATCGTCGAGT CGCTCAGCCG CAACGACATC
AAGGCCGAGT ACGGCGAGAA CGTCGAGCTC AAGGACGGCA AGGACTGGAA GGCCCACGAC
TACTGGGGCG CCTCCACCAT CCAGTGGAAC GCCAGGCTCG ACGCCTACGT CATGGTCCAG
AGCTTCGAGC AGAAGCGCCA GGTGGGCGAG CTGGGCGACT CGGTGGAGCA CTCCAAGTTC
CGCAACGCCG AGGGTGTCAG GAAGCTCCGT GAGACCACGA GCTTCAAGGG CTTCAAGGTC
TACCAGCTCA ACGGCCCGCG CAAGAAGGAC TGGAAGCTGC TGGCCACCGT CACCACCGAC
GGCTCGGCGG ACAACCCGCT GAACGCGCCG GTCGACGGGC CGCAGCAGGG CTCGGGCTCG
CTGGACGTGC CGTACTGGAC CGGCGGCAAG TACATGTTCG TCGCCGCCGC GCCGCTGGAC
ACCTGGAGCA ACACCGAGGT GCCCACCTAC CTGTACTCGG CCGGCTACCA GGCCTACGAC
ATGTCCGACC CGGCCAAGCC GAAGAAGATC GGTGAGTGGC ACAGGAAGGG CCAGCTCGCC
GGCGAGTCCG CCGACTACGC CAAGAACCCG CGCTGCGGCA ACCAGACCTC CTGGATGGGC
GCGCGCATGC CGCTGTTCAT CCCCAAGCCG GTCGAGGAGG GCGGCAAGTA CGGCTTCGCG
GCCCTGGGCG GCTACGGCCT GTCGGTGCTG GACATCTCCA ACCCCGCCAA GATGAACGAG
GTCGCCCACC TGGACCTGCC CATGTCGGTC GGCGGCACCG AGGCGGACAA CGTGGACGTC
TCCCAGTTCG AGAAGACCGG CATGATCTAC GTCTCCGGCT ACCCGCTGGG CGAGGACTGC
CACGAGCCCT ACAAGGACAT CTTCCAGATC GACGTCCGCA ACCCGGCCAA GCCCCGCATC
GTCGGCGCCC TGCCCCGGCC CGAGCCCGCC GCGGCGGCCC CGTTCGGCGA CTACTGCCAG
CGCGGCGGCA GCTTCGGCCC CAAGCGCTCG GGCTACTACA CCTCACCCGG GGAGCCCAAG
CAGGGCCTGC TGCCGTACGC GTTCTACAAC GCGGGCGTGC AGTTCTTCGA CACCCGCGAC
CCGAAGCACC CGAAGATCGT CGCGCAGTTC GTCCCGGCGG GCTTCGCCAA GGGCGTGCCC
GACTACGCCC TGGGCAACCA GACCCACGGC ACCTACGTCG AGTGGGACCG CAAGATCGCC
TGGGTCTTCA CCAACGACGG CATCTACGCG ATCTCCTCGA AGAAGCTGCT CGGCACCCCG
AACCTGGGCA AGCCGGCCAA GCCGTTCCGC ACCAGCGCCC GGTGA
 
Protein sequence
MRHFLRHALI TSGVAGLVLA SGTAVRAAAP DDPNIPSVCG DRRPADPQAG TYGMQLDGTF 
RHPSLTPAGE ERFSFPGGAQ DLFGKTKGYD PKSYLEGPIK LEAAYRGAEF TPDYFHSWQN
IVDFDGRRYL FQYDRSEGRV YDVTDVKKVK IVESLSRNDI KAEYGENVEL KDGKDWKAHD
YWGASTIQWN ARLDAYVMVQ SFEQKRQVGE LGDSVEHSKF RNAEGVRKLR ETTSFKGFKV
YQLNGPRKKD WKLLATVTTD GSADNPLNAP VDGPQQGSGS LDVPYWTGGK YMFVAAAPLD
TWSNTEVPTY LYSAGYQAYD MSDPAKPKKI GEWHRKGQLA GESADYAKNP RCGNQTSWMG
ARMPLFIPKP VEEGGKYGFA ALGGYGLSVL DISNPAKMNE VAHLDLPMSV GGTEADNVDV
SQFEKTGMIY VSGYPLGEDC HEPYKDIFQI DVRNPAKPRI VGALPRPEPA AAAPFGDYCQ
RGGSFGPKRS GYYTSPGEPK QGLLPYAFYN AGVQFFDTRD PKHPKIVAQF VPAGFAKGVP
DYALGNQTHG TYVEWDRKIA WVFTNDGIYA ISSKKLLGTP NLGKPAKPFR TSAR