Gene Sros_4863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4863 
Symbol 
ID8668157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5385881 
End bp5387299 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content72% 
IMG OID 
ProductPeriplasmic protease-like protein 
Protein accessionYP_003340424 
Protein GI271966228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0643928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA TCAGGATCGC GGCCGCCGGG CTCGCCCTCG CGGCGGCGGC GGCCTGCACG 
GCACCGGCGC CGCCGCGCAG CCAGGCGAGC ACGGCGGCCG GTACGGTCTG CGCAGCGCCC
CGGGGAGTTC CCGGGGCGGA GACGGCCACC ACGATCGACG TGATCGAACA GGCCTACTTC
TGCCTCCTCG GCAACTACTA CAGCGGCGCC ACGCTGGACG CCCGCTCGCT GCTGAGCGCC
GGATTCGTCG CCCTGACCCA AGAGCTCAAC CGCAACGGCC GCGACGTGCC CGAGGCGACC
ATGCCCGCGC TGACCGGCGA CCGCAAGACC GACTGGACCG CCTTCGAGGC CGCCTACCGC
GAGACCACCG ATCAGGTCCC CGACCTCCGC GACAAGCTCG CCGTCGTCAC CCTGGAGGCC
ATCGTGGCCA GCCTCGGCGA CAACCACGCC CGCTGGGCGC ACGACGTCAA GCGGCCGCCC
GACTACTACG ACGGCGACGG CTACGGCCTG GGTTTCCAGG CGAACGTCAA TGGCCCGCAG
GTGGACGGCA ACCCCGGCGT CGCCCTCCCC CCGCTGTTCG TCACCACCGT GCAGGGCGGC
GCGGCGCAAG CGGCCGGGCT GCGCCCGGGC GACATCATCG AATCGGTCAA CGGATCGGCG
CCCTTCATCG ACGGGAAGGC CACTCCCGCG ATCGCCGCCC TCTACCCGGG GTACCCGGAG
GCGCGCCCGG TCCGATTGCG GCTCCTGCGG CAGAGCACCG GCCGCCGCTG GAGCGTGACG
CTCAAGCCCG GCCTCTACCA GCGGGATCTG GCCGCCCTGC AAGTGGTGAC CTCGAAGCTG
CTGGACGACG ACATCGCCTA TGTACGGCTG CGCGGGTTCG CTCCCGACTC CGCGGACAGG
GTCTTCAAGG CGATCTCCAG ACTGCGCGCC GGCCGGACCC TGTCCGGCGT CGTGCTGGAC
CTGCGCGGCA ACGGCGGCGG CAGCCCCGTG GAGGCGACCC GGCTGGTAAG CGCGTTCGGC
CACGGCAAGG TCACCGCCTA CCAGTGCACC GTGGACGGCA AGTGCGAAAC TTCGCGGACC
GACGACACCG TCGAGCTGAT CGACCTGCCG CTGATGGTGC TCACCGACCG CAGTTGCGCC
TCGGCGTGCG AGCACTTCAG CTCCGCGGTC AAGGACCTGC GCCTCGGCCG GCTGGTCGGC
ACCAGAACCG CCGGCGTCAT CTCCGGCCCG GCGCAGCCGT ACCTGCTCGG CAACAACACC
AGCCTGAGCT TCCCCGCCAG GCACCACCTC GGGCCCAAGC GCGAGGTGAT CGACCGGATC
GGCGTGCCGC CCGACCACCA CGTGCCCCTG ACCCCGAAGG ACGCGGCCGC CGGGCGCGAC
CCCGCGCTGG CCAAGGCCCT GACCTTGCTG AACGAGTGA
 
Protein sequence
MTIIRIAAAG LALAAAAACT APAPPRSQAS TAAGTVCAAP RGVPGAETAT TIDVIEQAYF 
CLLGNYYSGA TLDARSLLSA GFVALTQELN RNGRDVPEAT MPALTGDRKT DWTAFEAAYR
ETTDQVPDLR DKLAVVTLEA IVASLGDNHA RWAHDVKRPP DYYDGDGYGL GFQANVNGPQ
VDGNPGVALP PLFVTTVQGG AAQAAGLRPG DIIESVNGSA PFIDGKATPA IAALYPGYPE
ARPVRLRLLR QSTGRRWSVT LKPGLYQRDL AALQVVTSKL LDDDIAYVRL RGFAPDSADR
VFKAISRLRA GRTLSGVVLD LRGNGGGSPV EATRLVSAFG HGKVTAYQCT VDGKCETSRT
DDTVELIDLP LMVLTDRSCA SACEHFSSAV KDLRLGRLVG TRTAGVISGP AQPYLLGNNT
SLSFPARHHL GPKREVIDRI GVPPDHHVPL TPKDAAAGRD PALAKALTLL NE