Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4863 |
Symbol | |
ID | 8668157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 5385881 |
End bp | 5387299 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Periplasmic protease-like protein |
Protein accession | YP_003340424 |
Protein GI | 271966228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0643928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCA TCAGGATCGC GGCCGCCGGG CTCGCCCTCG CGGCGGCGGC GGCCTGCACG GCACCGGCGC CGCCGCGCAG CCAGGCGAGC ACGGCGGCCG GTACGGTCTG CGCAGCGCCC CGGGGAGTTC CCGGGGCGGA GACGGCCACC ACGATCGACG TGATCGAACA GGCCTACTTC TGCCTCCTCG GCAACTACTA CAGCGGCGCC ACGCTGGACG CCCGCTCGCT GCTGAGCGCC GGATTCGTCG CCCTGACCCA AGAGCTCAAC CGCAACGGCC GCGACGTGCC CGAGGCGACC ATGCCCGCGC TGACCGGCGA CCGCAAGACC GACTGGACCG CCTTCGAGGC CGCCTACCGC GAGACCACCG ATCAGGTCCC CGACCTCCGC GACAAGCTCG CCGTCGTCAC CCTGGAGGCC ATCGTGGCCA GCCTCGGCGA CAACCACGCC CGCTGGGCGC ACGACGTCAA GCGGCCGCCC GACTACTACG ACGGCGACGG CTACGGCCTG GGTTTCCAGG CGAACGTCAA TGGCCCGCAG GTGGACGGCA ACCCCGGCGT CGCCCTCCCC CCGCTGTTCG TCACCACCGT GCAGGGCGGC GCGGCGCAAG CGGCCGGGCT GCGCCCGGGC GACATCATCG AATCGGTCAA CGGATCGGCG CCCTTCATCG ACGGGAAGGC CACTCCCGCG ATCGCCGCCC TCTACCCGGG GTACCCGGAG GCGCGCCCGG TCCGATTGCG GCTCCTGCGG CAGAGCACCG GCCGCCGCTG GAGCGTGACG CTCAAGCCCG GCCTCTACCA GCGGGATCTG GCCGCCCTGC AAGTGGTGAC CTCGAAGCTG CTGGACGACG ACATCGCCTA TGTACGGCTG CGCGGGTTCG CTCCCGACTC CGCGGACAGG GTCTTCAAGG CGATCTCCAG ACTGCGCGCC GGCCGGACCC TGTCCGGCGT CGTGCTGGAC CTGCGCGGCA ACGGCGGCGG CAGCCCCGTG GAGGCGACCC GGCTGGTAAG CGCGTTCGGC CACGGCAAGG TCACCGCCTA CCAGTGCACC GTGGACGGCA AGTGCGAAAC TTCGCGGACC GACGACACCG TCGAGCTGAT CGACCTGCCG CTGATGGTGC TCACCGACCG CAGTTGCGCC TCGGCGTGCG AGCACTTCAG CTCCGCGGTC AAGGACCTGC GCCTCGGCCG GCTGGTCGGC ACCAGAACCG CCGGCGTCAT CTCCGGCCCG GCGCAGCCGT ACCTGCTCGG CAACAACACC AGCCTGAGCT TCCCCGCCAG GCACCACCTC GGGCCCAAGC GCGAGGTGAT CGACCGGATC GGCGTGCCGC CCGACCACCA CGTGCCCCTG ACCCCGAAGG ACGCGGCCGC CGGGCGCGAC CCCGCGCTGG CCAAGGCCCT GACCTTGCTG AACGAGTGA
|
Protein sequence | MTIIRIAAAG LALAAAAACT APAPPRSQAS TAAGTVCAAP RGVPGAETAT TIDVIEQAYF CLLGNYYSGA TLDARSLLSA GFVALTQELN RNGRDVPEAT MPALTGDRKT DWTAFEAAYR ETTDQVPDLR DKLAVVTLEA IVASLGDNHA RWAHDVKRPP DYYDGDGYGL GFQANVNGPQ VDGNPGVALP PLFVTTVQGG AAQAAGLRPG DIIESVNGSA PFIDGKATPA IAALYPGYPE ARPVRLRLLR QSTGRRWSVT LKPGLYQRDL AALQVVTSKL LDDDIAYVRL RGFAPDSADR VFKAISRLRA GRTLSGVVLD LRGNGGGSPV EATRLVSAFG HGKVTAYQCT VDGKCETSRT DDTVELIDLP LMVLTDRSCA SACEHFSSAV KDLRLGRLVG TRTAGVISGP AQPYLLGNNT SLSFPARHHL GPKREVIDRI GVPPDHHVPL TPKDAAAGRD PALAKALTLL NE
|
| |