Gene Sros_5940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5940 
Symbol 
ID8669234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6512982 
End bp6514418 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content72% 
IMG OID 
ProductZn-dependent protease 
Protein accessionYP_003341418 
Protein GI271967222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.231854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00252026 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGAGC GGACCGGAGA ACAGCACACC TCGGCGGACG TCTCGACCGG CCGTGGGGAA 
GGGATCGGGA TGAGCCCCCA GGAGATGATC GAGAAGGCTC TGGATCTCTC CACGGCCGAC
GACGTCGTCG TGATCGTGGA CGAGGGCTCC AGCGCCAACC TGCGTTTCGC GGGCAACACG
CTCACCACCA ACGGCGTGGG CCGTTCCTCG CAGCTCACCG TGATCTCGAT CGTGGGCCGG
GGGGTGGGTG TGGTGTCGCG GGCGGCGGTC CGCCCCGAGC AGCTCGCCGA CATCGTCGCC
GCCGCCGATC ACGCCGCAGA GGACGCGACG CCGTCCGAGG ACGCCCGGCC GCTGGTCGAG
GGCGGCCGCT CGGCGGACTG GGACCTGCCG GCGGAGTCCA CCTCCATCGG CGTGTTCGGT
GCCTTCGCCC CCGCCCTGGG CGAGGCCTTC GCCACGGCCG AGGCGGGCGG GCGCAGGCTG
TACGGCTTCG CCGAGCACAT CGTCACCACC ACCTTCGTGG GCACCTCGGC CGGGGCCCGG
CTCCGGCACA CCCAGCCCAC CGGCCGGCTG GAGCTCAACG CCAAGTCACC GGACATGAAG
CGCTCGGCGT GGACGGGCGT GGCCACCCGC GACTTCTCCG ACGTGGACGT GGCGGCGCTC
GACGCCTCGC TGGCCCAGCG GCTGGAGTGG GCGAAGAACC GGATCGACCT GCCGGCGGGC
CGATACGAGA CGCTCCTGCC TCCGACGGCG GTGGCCGATC TGATGGTCTA CCTGTACTGG
ACGGCCGGCG CCCGGGACGC GCTTGAGGGC CGGACGGTCT TCTCCAGGCC GGGCGGCGGG
ACCCGCGTCG GGGAGACCCT CTCCCCCCTC CCGATCCGCC TGTACAGCGA TCCGGCCGCG
CCGGGCATCG CGTGCGCGCC GTCCGTGGTG GCGCACGCCT CCAGCAGGCA GAGCTCGGTG
TTCGACAACA CCCTGCCGCT GGCGCCCACC GACTGGATCG CCGACGGCAC CCTGACGAGC
CTGATCCAGA CCCGGCACTC GGCCGAGCTG ACCGAGCTGC CCGTGACTCC GGCGATCGAC
AACCTGATCA TGCGGGGGCC CGAGGGCGGC CGGTCGCTGG AGGAGATGAT CGCCTCCACC
GAGCGCGGCC TGCTGCTCAC CTGCCTGTGG TACATCCGCG AGGTGGACCC GCAGAGCCTG
CTCCTCACCG GGCTGACCCG TGACGGCGTC TACCTGGTGG AGAACGGCGA GGTCGTCGGC
GAGGTGAACA ACTTCCGCTT CAACGAGAGC CCGGTGGACC TGCTCGGCCG CCTCACGGAG
GTCGGCGCCT CGGAGCAGAC CCTGCCGCGC GAGTGGAGCG ACTACTTCAC CCGCACGGTC
ATGCCCGCGA TCCGGGTGCC CGACTTCAAC ATGTCGACGG TCAGCCAGGC GAACTAG
 
Protein sequence
MSERTGEQHT SADVSTGRGE GIGMSPQEMI EKALDLSTAD DVVVIVDEGS SANLRFAGNT 
LTTNGVGRSS QLTVISIVGR GVGVVSRAAV RPEQLADIVA AADHAAEDAT PSEDARPLVE
GGRSADWDLP AESTSIGVFG AFAPALGEAF ATAEAGGRRL YGFAEHIVTT TFVGTSAGAR
LRHTQPTGRL ELNAKSPDMK RSAWTGVATR DFSDVDVAAL DASLAQRLEW AKNRIDLPAG
RYETLLPPTA VADLMVYLYW TAGARDALEG RTVFSRPGGG TRVGETLSPL PIRLYSDPAA
PGIACAPSVV AHASSRQSSV FDNTLPLAPT DWIADGTLTS LIQTRHSAEL TELPVTPAID
NLIMRGPEGG RSLEEMIAST ERGLLLTCLW YIREVDPQSL LLTGLTRDGV YLVENGEVVG
EVNNFRFNES PVDLLGRLTE VGASEQTLPR EWSDYFTRTV MPAIRVPDFN MSTVSQAN