Gene Rcas_3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3085 
Symbol 
ID5540581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3996347 
End bp3997393 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content58% 
IMG OID640895204 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_001433157 
Protein GI156743028 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC AACCGCCTGT TTCCGCTCCG GTTCCGCTGG CGCAACCGGC GCCGCGAAAA 
GACCGGACGT GGATTGTTGT CATTTCGATC ATTGTGGGCA TTGTGCTGGC GTGCGCCATT
TTGCCGCTCG GTGGCATGGC GTTGCTGCTG GCGTTCGATG GCGGCAATGC GGCTGCGACA
GTTCCCGGCA GCAGATGGCA GGAAGAGGTC ATCTCTGGAC GAGGAAATGA TCGCATCGTG
GTTATTACCG TTAGCGGCAC AATTGGGGCC GACGCAGGCG ATGGCTTGTT CACCACAGGG
TTGAGCCACG AGCAGTTGCT GTCGCAGATT CGCACTGCTG CAAACGACTC GCGGGTGAAG
GCGGTGGTGC TGCGGGTGGA TAGCCCTGGC GGCAGCGTCG TTGCCAGCAA TGAGTTGTAT
GTCGAACTCA AGAAATTGCG AGAAAAAGGG AAGCCGCTGG TGATCTCGAT GGGTTCAATT
GCAGCCAGCG GTGGATACTA TATCTCGATG GCGGGCGAAC GCATCTATGC CAACCCGGAT
ACGCTCACCG GCAGCCTTGG GGTCATTGTG TCGCTGCTGA ACTATGATGA AGCCTTTGAA
CGCCTTGGTC TGCGCGAGTA TGTGTACAAA AGCGGTGATT TCAAGGATAT CGGTTCGCCA
CTGCGCCCGC CGCAGCCGGA GGAAGAGGCG ATCTGGAATG CGCTGGTCGA TGAGGCGTAC
CAGGGGTTTA TCGATGTGAT TGTCGAGGGG CGCGGGATGG AACGCACTGA GGTGATCCGG
CTCGCCGACG GGCGAATCTA CACCGGGCGA CAGGCGAAAG CGCTGGGTTT GATCGATGAA
CTTGGCAACC TGGAAGATGC AATCGAAGGT GCAAAAGAAC TGGCGGGGTT GACCGATGCG
TTGATTGTGC GCTATCGTTC GTTCAATACG CTGCGCGAAT TGTTGCAGGC AAATCTGGAA
CAGAACCTGC AACCGTCTGA TCCACTGGGG TTGCGCGCTA TCGCGCAGCC GCGCGCGCCA
CGGTTGGAGT ATCGGTTTGT TCCTTGA
 
Protein sequence
MSDQPPVSAP VPLAQPAPRK DRTWIVVISI IVGIVLACAI LPLGGMALLL AFDGGNAAAT 
VPGSRWQEEV ISGRGNDRIV VITVSGTIGA DAGDGLFTTG LSHEQLLSQI RTAANDSRVK
AVVLRVDSPG GSVVASNELY VELKKLREKG KPLVISMGSI AASGGYYISM AGERIYANPD
TLTGSLGVIV SLLNYDEAFE RLGLREYVYK SGDFKDIGSP LRPPQPEEEA IWNALVDEAY
QGFIDVIVEG RGMERTEVIR LADGRIYTGR QAKALGLIDE LGNLEDAIEG AKELAGLTDA
LIVRYRSFNT LRELLQANLE QNLQPSDPLG LRAIAQPRAP RLEYRFVP