Gene Sros_8001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8001 
Symbol 
ID8671326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8814893 
End bp8816458 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343399 
Protein GI271969203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.212108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGG TTCTCGACCC GCCCGCGGTC CGGCAGTGGT CGCGGCTGGC CGCGGAGACA 
CTGGGAAAGG CGCGCGAGGA GATCGACGCG CTCAACGTCT TCCCCGTCCC CGACGGCGAC
ACCGGCACCA ACCTGCACCT GACCATGCTC TCCGCGGCCG AGGCGCTCGA CGGGCTGCCG
GGCGACGCCG ACGCCGCGAC CACCTGGCAG GCGCTCGCGC AGGGCGCGCT GCTCGGCGCC
CGGGGCAACT CCGGAGTCAT CGTCAGCCAG GCGCTGCGCG GCCTCGCCGA GGTGCTCAGG
GCGACCGAGG GGCGGGGCGC CGACCTCGGG CTCGGCCTGG TCAGGGCCGC CGAGCTGGCC
CGCGCGGCGG TGGCCAGACC GGTCGAGGGC ACGGTGCTCA GCGTGCTCAC CGCGGTCGCC
GGGGCGGTGC GCGACCTGAC GGGAGACCTC GCCTCGGTGG CCAGGAGGGC GGCCGACGAG
GCGCGCTCCG CGCTCCGCCG CACTCCGGAC CAGCTCGACG TGCTCGCCCG GAGCGGCGTG
GTGGACGCGG GCGGTGCGGG CCTGGCGATC ATCCTGGAGA GCCTCGCCGC GGTGATCACC
GACTCCTACA CCGGGCGGGT CGACATCCCG GCGCCGACCC ACCGGGTCGC CCCGGAGCCG
GAGGAGGGCC CCGGCTACGA GGTCATGTAC CTGCTCGACG CCGGCGAGGC GGCGGTCGGC
GCGCTCCGCC GCGAGCTGGA CGCGCTCGGC GACTCCCTGG TGGTCGTGGG CGGCGACGGC
CTGTGGAACG TGCACGTCCA CGTGGACGAC GCGGGCGCGG CGATCGAGGC GGCCATGCGG
GCGGGGCGGC CGCACCGGAT CAGGGTGACC TACCTGGTCG GCTCCGGCCG GACCCACCCG
GCGGCCCGGG GGCGCGGGGT GGTGGCGGTG GCGGCCGGGC CCGCGCTGGG CGCCGTGTTC
GAGCAGTCGG GAGCCGTGGT GGTCCGCAGG GAGCCCGGCT CCAGCCCGCC CCTGGCGGCG
GTGCTCGCGG CCATCCGCGA GGCGGGGGCG GAGGTCGTGG TGCTGCCCAA CGACAGCGGG
ACCCGCGAGG TCGCGGCGGC CGCCGCCGAG ATCGCCCGCG AGGAGGGCCT GATGGTCAGC
GTGCTGCCCA CCAGAGCCTC GGTGCAGGGC CTGGCGGCGC TGGCGGTCCA CGATCCGCTG
CGGCGCTTCG ACGACGACGT GGTGGCCATG ACCGAGGCCG CCGCGCACAC CCGGCACGGG
CACGTCTGGG TGGCCGACCG CGAGGTGATG ACGAGCGCGG GCCTGACCGC GCCGGGAGAC
GTCCTGGGCG TCATCGACGG CGACGCCGCG GTGATCGGCG CCGACCTCGT GGGCACCGCC
CTGGAGATCA CCCGTCGCAT GGTGTCGTCG AGCAGCGAGC TGGTGACCAT GCTCGAAGGC
GTCAACGCGC CGGAGGGGCT GGCCAGGGCC GTGCAGGACC ATCTGGCCCG GATCCGGCCC
GACGTCGAGG TCGTCCTGTA CGAAGGCGGG CAGGGCGGCT ACCCGCTGCT CATCGGCGTC
GAGTGA
 
Protein sequence
MLQVLDPPAV RQWSRLAAET LGKAREEIDA LNVFPVPDGD TGTNLHLTML SAAEALDGLP 
GDADAATTWQ ALAQGALLGA RGNSGVIVSQ ALRGLAEVLR ATEGRGADLG LGLVRAAELA
RAAVARPVEG TVLSVLTAVA GAVRDLTGDL ASVARRAADE ARSALRRTPD QLDVLARSGV
VDAGGAGLAI ILESLAAVIT DSYTGRVDIP APTHRVAPEP EEGPGYEVMY LLDAGEAAVG
ALRRELDALG DSLVVVGGDG LWNVHVHVDD AGAAIEAAMR AGRPHRIRVT YLVGSGRTHP
AARGRGVVAV AAGPALGAVF EQSGAVVVRR EPGSSPPLAA VLAAIREAGA EVVVLPNDSG
TREVAAAAAE IAREEGLMVS VLPTRASVQG LAALAVHDPL RRFDDDVVAM TEAAAHTRHG
HVWVADREVM TSAGLTAPGD VLGVIDGDAA VIGADLVGTA LEITRRMVSS SSELVTMLEG
VNAPEGLARA VQDHLARIRP DVEVVLYEGG QGGYPLLIGV E