Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2526 |
Symbol | |
ID | 5540008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3258524 |
End bp | 3260209 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894657 |
Product | signal peptide peptidase SppA, 36K type |
Protein accession | YP_001432624 |
Protein GI | 156742495 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0222519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATCCA TATTCTTCAT CTGGCTGATC AACCTGACGC GGCGCGTCCG CAACCTGTGG CGGCGGTTGC TGCGGCGGCA GGTTGCATTT GTGCGCATCC CAATCGGTGG GGCGCTGCCA GAGTTCGCGC CGGCGCCGCC GTGGTGGGCG CACCGCTTCT TTGGGGCAGC GGCGCCACCG AGCCTCAGTG AGTTGCGTCG TCGCTTCGAG TGGTTGGCGT CTGACCCGCA AGTGAAAGGC GTTGTGCTCG ACATTGGGAC TCTGACCTGT GGATGGGCGA CGATCCAGAA CCTGGATCAG GACATCCGCA GGTTTCGCGA GCAGGGCAAA CTGGCTGTTG CGCGTATCAC AAACCCTGAC ACAAAGACCT ATGTTGCAGC ATGCGCTGCC GATCTAATTG TGGCGCCGCC GGTGAGTTTA CTGACCGTTA CCGGTCTGTA CGCCGAAGTG CGGTTTCTCA AAGATGCGCT GGCAAAAGTG GATGTGAGCG TAGAAGTGAC GGCAGTGTCG CCGTACAAAA CGGCGGGCGA CTCGCTGGCG TGCTCGGAAA TGTCGCCAGA AAACCGCGAG CAGATTGAAC GGCTGCTCGA TCAGCGTTAT GCGTTGATCG TCGAGACCAT CGCCAACGCG CGACATAAGA CCGTCGATGA GGTGTGTTCG CTCATCGATA CCGCACCGTG GAGCGCCCGG CGCGCCCAGG AAGCCGGTTT GATCGATGCC GTGCTGTACG AGGACGAACT GCCCGCTTTT CTTGCATCAC GCACTGGCGC GTCTCCGGCA AAACTGCCAG AGATCGCCGA ATGGAGTCAG GCGCGGCGCG CTCTGCGCCT CCCACTGTTG CGCCACCATC GTCGTCTGGT CGGTGTTGTC GCTGTGGAAG GGACGATCGC GCCGGGAACC AGCCGCCAGA TTCCGCTGCC GATCCCGCTG ATCGGTGGTC AAATCGCCGG AAGCGAGAGT ATCGTGCAGG CGCTGCGTCA GGCAGAGCGC AATCCGCGTC TTGCTGCTGT GATTCTTTAT GTCAACTCAC CCGGCGGCAG TGCGTTCGAC TCAGAACTGA TCTGGCGCGA AGTGCGGCGT CTCGACCGGC GCAAGCCGGT GGTGGCGGTG ATGGGGGACG TCGCGGCTTC AGGCGGCTAC TACGTCGCAT CCGGTGCGCG CACTATTCTG GCTCAACGTG GAACAATTAC CGGCAGCATT GGCGTTCTGA TCGTCCGTCC GGTTATTGAC GGTCTCGTGA AGCGCGCTGG CGTGAACACC GTCGCCATCG GGCGCGGCGC AAACAGCGCA TTCTTTATCA GCGATGCGCC AACTGAACAG GAACGCGCGG CGGTGCGCGC ATTGATCGAC GACAGTTACA CCGTCTTCAA ACAGCGCGTG ATGGAAGGGC GATCAATGTC CGAAGAGGCG CTTGAACCGC TGGCAGGCGG GCGCGTCTGG ATGGGGGGAG AAGCGCATGA GTCCCATTTG ATCGATGACG TCGGCGGTAT GCCGGAGGCG CTGTTGAAGG CGCAGGAACT CGCCGGACTG CCCCGCGATC AGACGGCGCC GCTGGTTCTG ATCGGCGGCG GACGCGGACG CCTGGCGCCG CAGACATTCC CTGAAGAGCC ATCGAAGACG TTGCGCGAGG CGCTGGCGCT TTTGCGTCAA CCGCTGATCT GGGCTATATT GCCGTTTTTT GAATAG
|
Protein sequence | MVSIFFIWLI NLTRRVRNLW RRLLRRQVAF VRIPIGGALP EFAPAPPWWA HRFFGAAAPP SLSELRRRFE WLASDPQVKG VVLDIGTLTC GWATIQNLDQ DIRRFREQGK LAVARITNPD TKTYVAACAA DLIVAPPVSL LTVTGLYAEV RFLKDALAKV DVSVEVTAVS PYKTAGDSLA CSEMSPENRE QIERLLDQRY ALIVETIANA RHKTVDEVCS LIDTAPWSAR RAQEAGLIDA VLYEDELPAF LASRTGASPA KLPEIAEWSQ ARRALRLPLL RHHRRLVGVV AVEGTIAPGT SRQIPLPIPL IGGQIAGSES IVQALRQAER NPRLAAVILY VNSPGGSAFD SELIWREVRR LDRRKPVVAV MGDVAASGGY YVASGARTIL AQRGTITGSI GVLIVRPVID GLVKRAGVNT VAIGRGANSA FFISDAPTEQ ERAAVRALID DSYTVFKQRV MEGRSMSEEA LEPLAGGRVW MGGEAHESHL IDDVGGMPEA LLKAQELAGL PRDQTAPLVL IGGGRGRLAP QTFPEEPSKT LREALALLRQ PLIWAILPFF E
|
| |