Gene Sros_8990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8990 
Symbol 
ID8672332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9939465 
End bp9941183 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content76% 
IMG OID 
ProductSL44-1; basic proline-rich protein 
Protein accessionYP_003344364 
Protein GI271970168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.297465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACAC CTGAGGATCC CAACGAGCCC GGCTCGCGGC GCGAGCCGCG GGACCGGCGA 
TCCGAGCGGG AGGAGGCGGG CCCGGAGCAG TCCTCGGGCG CCTCCGAAGA GGCGGCCGTT
CCGAGCGGAC CGGACGATTC CGACTCCACG GTCTTCCGCC CGGAGGACCC TTCCACGTGG
TCGGCGCCCG ATCCCGCCGA CCCTCTGCGC TCGTCGTCCG AGCCCACCCC TCCGACCGGC
CGGTCCGATC AGCCCCCCGA GCCCCCTTCC GAGCCCTCCG CGAGGCCCGC TGAGCCCGCG
CCCCCGACCG GGTGGCCGTC CGAGCCCACG CCCCCTTCCG AGCCCTCCGC GAGGCCCGCT
GAGCCCGCGC CCCCTTCCGA GCCCTCCGAG CCCCCTTCCG AGCCAGAGGA GCCCGAGAAG
CCCACGCCGG GGTCGCCCGA GCCACGCGGT CCCGGTGAGA CCCCGGGCTG GCCGCCGATC
TCCACGCCCG AGGAGCCGTC GCTGCCGGGG CATCCGGATG TGCCCACCGC GCCGGAGGTC
CCCAGGCCGG TCCAGGCGGA TCCGGACACC ACCAGCGTTT TCGAGTCCCC GGGCACTCCG
GCCGCCGAGG CCCCGCCCCC CTATCCGGTC CCGGGCGGCA CGCCTGCCTA TCCGGGCTGG
GAGAGCGCCT CGGAGACACC GGCCTCAGGC GGCCGGGACG ACAGGGACGG CATGGATGTG
CCGCCCGGCG CCGCCCAGCC CTCGCGCTAC GATCCGCCGA CGTCCCCCGA GGGGTTCCCC
GCGGCCTCGC ACCGGCAGCC CGAGCAGCCC GAGCAGCCCG AGCAGGCCGG GCTCCCCGAG
CAGCCCGGTC GGTTTGAGCA GGCCGGGCCC TCCGAGCAGC CCGGTCGGTT TGAGCAGGCC
GGGCCCTCCG AGCAGGACAG GCCGTCCGAG CAGGCCGGTC GGCCTGAGCA GGCCGGGCCA
CCCGAGCAGC CTGGCGGAGG CTTCCAGGCG GAACCGCCGC CGGGACCGGG CGGTCCGTAC
GGTGGCCCCT CGGCTCCGTA CGGCGGCCCC GCGGCCCCGT ACGGCGGCCC TTCGGCTCCG
TACGGCGAGG CCGCCCCCGG TGCCCACCGC GGGCCGGGCG ACGAGGCTCC GACGGAGAAC
ATCTCCGGCG CGGGCCCCTA CGGAACCCCT CCCTACACCG GTGCCCACGC GAGCCGGCCG
GATGAGCCGC CGCGGCAGCC GCCGTACGCC CCACCGGCCG GCGGTCCGTC CTATCCGTCC
TATCCGTCCG AGGGCCCGCC GCAGGGCGGC CTGTCCTACC CGTCGGGCCA GCCGTACCAG
GCGGCGCCTC CGGGAGGCGC CTATCCGGGA GGTCCCGGCT ATCCGGGCGG CACGCCGTAC
CCGGCCTATC CGGGTGACGA CCGGTCGCGC CAGCAGGGCG GCGGACTCGG CACCACCGCG
CTCGTACTCG GCATCGTGAG CCTTTTCCTG CTCGTCGTGT GCGGCCTGGG GGCGCTGACG
GCCATCATCG GCCTGATCAT CGGCATCGCC GCGGTCGTCA AGAACTCCAA CCGCGGCCGC
GCCTGGGTGG GCATCGCCCT GAGCGTCCTG ACACTGATCA TCGCCGTGGT GGTGCTCAGC
TGGTTCTACA GCAAGGTGGG CGACTGCCTG AACCTGCCGC CCGAGTTCCA GCAGCGCTGC
ATCCAGGAGA AGTTCGGCGG GCAGTTCACC ACGTCGTGA
 
Protein sequence
MTTPEDPNEP GSRREPRDRR SEREEAGPEQ SSGASEEAAV PSGPDDSDST VFRPEDPSTW 
SAPDPADPLR SSSEPTPPTG RSDQPPEPPS EPSARPAEPA PPTGWPSEPT PPSEPSARPA
EPAPPSEPSE PPSEPEEPEK PTPGSPEPRG PGETPGWPPI STPEEPSLPG HPDVPTAPEV
PRPVQADPDT TSVFESPGTP AAEAPPPYPV PGGTPAYPGW ESASETPASG GRDDRDGMDV
PPGAAQPSRY DPPTSPEGFP AASHRQPEQP EQPEQAGLPE QPGRFEQAGP SEQPGRFEQA
GPSEQDRPSE QAGRPEQAGP PEQPGGGFQA EPPPGPGGPY GGPSAPYGGP AAPYGGPSAP
YGEAAPGAHR GPGDEAPTEN ISGAGPYGTP PYTGAHASRP DEPPRQPPYA PPAGGPSYPS
YPSEGPPQGG LSYPSGQPYQ AAPPGGAYPG GPGYPGGTPY PAYPGDDRSR QQGGGLGTTA
LVLGIVSLFL LVVCGLGALT AIIGLIIGIA AVVKNSNRGR AWVGIALSVL TLIIAVVVLS
WFYSKVGDCL NLPPEFQQRC IQEKFGGQFT TS