Gene Sros_8159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8159 
Symbol 
ID8671487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8998001 
End bp8999128 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID 
Productalkaline D-peptidase 
Protein accessionYP_003343553 
Protein GI271969357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.533807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACA ACATCCACGA GAAGATCCAG CAGATCCTGA ACCGGGCTGT GGCCGAGGAC 
GGCGTTCCCG GCATCGTCGC CGAAGTTCAC GACGCCGACG GAACATGGTT CGGCGCCGCA
GGAGTGGCCG ACCTCGCCGG CGGGCATCGG CGTCAGCCCG GGGAGCACCT GCACATCGGC
AGCTCCGGTA AGGCCTTCAC CGCCGCCACC GTGCTGGCCC TGGCAGCCGA AGGCAGGCTG
AGCCTCGAGG ACCCGGTGAA CACATGGCTG CCCGGCGTCA TGGAGACGGG CGGCTACGAC
GGCGACAAGA TCACCATCCG GCATCTGCTC AACCACACCA GCGGCCTGTT CCTCACCGGC
CTCGCACCAG AACTACAGCG CAGCATCGCC ACGCAGCCGA CCCGCATCTG GACCACCTCC
GAGCTGGTGA GGCTCGCGGT GTCCCAGCCG CCGGCCGGCG AGCCGGGCGA GCAGTTCATC
TACTCCAACG GCGGCTACTA CCTGGCCGGC GCGATCATCG AGAAGGTCAC CGGCAACACC
TACGCCGCCG AAGTCGAACG CACAGTCATC CGGCCGCTCG ACCTGACCCG CACCTACGTA
CGGCCCGCAG ACGCCACAAG CTATCTCCAC CCGCATCCCA CGGCCTACGT TGCCGGCGCC
CTCAAGGATG GCGTCGACCC GGCGACGCTC ACCGCGGAGA ACTGGGCGTC GATGATCGAC
CATGACAAGC CGCCCATCGA CGTCACCGCG CTCAACACCT CATGGGGCTG GGCGGCCGGC
GGCATCGTCT CCACCACCGA AGACCTGACC CGCTTTCTCA GGGCGATCGC GACCGGCGGT
CTGCTGCCAC CGGCTCAGCA CCACGAGATG TGGACCATGG TCACCAACGA CAGCGTCGTC
TGGTTGCCGC ACGCCCGCTA TGGCCTCGGC GTGATCGAGT TCGACAACGC GGGGATGGAC
GGCCTGACCG TGCGTGGCGT CAGCGGCACC CTCCCGGGAT CCTTCACCCT CGCGCTGAGC
ACCGACGACG GCCGGCAGAG CGTCGTCATC CACACCAACA TCGAGCCGAA GACCTTGGAC
ATCCCCATCA AGATCATCAA GGCGATGTAC GGCGTCGCCC TCGGCTGA
 
Protein sequence
MPHNIHEKIQ QILNRAVAED GVPGIVAEVH DADGTWFGAA GVADLAGGHR RQPGEHLHIG 
SSGKAFTAAT VLALAAEGRL SLEDPVNTWL PGVMETGGYD GDKITIRHLL NHTSGLFLTG
LAPELQRSIA TQPTRIWTTS ELVRLAVSQP PAGEPGEQFI YSNGGYYLAG AIIEKVTGNT
YAAEVERTVI RPLDLTRTYV RPADATSYLH PHPTAYVAGA LKDGVDPATL TAENWASMID
HDKPPIDVTA LNTSWGWAAG GIVSTTEDLT RFLRAIATGG LLPPAQHHEM WTMVTNDSVV
WLPHARYGLG VIEFDNAGMD GLTVRGVSGT LPGSFTLALS TDDGRQSVVI HTNIEPKTLD
IPIKIIKAMY GVALG