Gene Rcas_2857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2857 
Symbol 
ID5540346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3709080 
End bp3710528 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content62% 
IMG OID640894984 
Productpeptidase S41 
Protein accessionYP_001432944 
Protein GI156742815 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACTC GTTGTCTCCG CTTCGTGAGT GCGCTGCTCG TCGTCGTGCT GGCGGGATGC 
GGCAACCTGC CCGTTCCGTT GGGTTCGATG TCGACCACAA CGCCCACCCC TCTTCCAACA
GCGACGTCGC AGCCAACGGT CGCGCCAACC GACACACCAC GCCCGACGCC TCCTGCAACG
GCAACGCCAA CCCTTGCCCC CACTGCAACG CCGACAGTCA CGCCATACCC GCTCCTGCCC
ACCCCCACGC TTGCGCCGCT AAGCGCAAAC GAGCGTCAGG CGGTTTTCGA AGATGTCTGG
ACACTGGTGC GCGACCGGTA TGTGTACGAG GACTATGGCG GCGTCGATTG GGACGCGGTG
CGTGCTGAAT TTGAGCCGCG CATTGCCCGC GCCGCTTCGG AAGCGGAGTT CTATGCGCTT
ATCGAAGAAA TGATCGGGCG GCTTGGCGAT GATCATACGC GCTTCGACAC GCCCCAGGAA
GTAGCGGAAG AAGCAGCGCG TTTCGACGGC GGCGTGGCAT ATGCCGGCAT TGGCGCCATG
ATCCGCGATC TGGAAGAAGG GATTCTGATC ACCCGTCTCG CTCCTGGCGG ACCGGCGGAG
CAGGCAGGGC TTCAGCCGCG CGACCTGATC ATTGCCGTCA ATGGCGTGCC GGTCAGCGAC
ACCATCACCT TTGGTCCTGG CGGTCCGGTC TCGGTGGTGC GCGGACAACC CGGAACGCCG
GTGCAACTCC GGATCATCGA CGCCGCCGGC GCCACACGCG ATGTGACGGT CATTCGCCAG
ATCATCCCTC CCGATGCATT CCCGATTGTC GAGGCGCGCC GCGTGCCGGG AACCGATATT
GGCGTGGTGC TGATCGATAC ATTCAATGTC TCAGCGCTCG ATGAGCGCGT AACCGACGCC
ATTATGTCGC TCTATCAATC TGGTCCGCTT GATGGGCTGA TTCTGGATGT GCGCACCAAC
GGCGGGGGAC GCCTCGATAT GCTGCGGCGC ACGCTTGGAT TGTTCCTCGA TGGGGGGACA
ATCGGCAGCA GCAGCGGACG CGAGCGATCA TTCAGCATCG ACGTGCCTTC TGGCAAGACG
TTGCCACTGC TCGAGCAGAT GCCGATTGTG GTATTGACCA GTGATGAAAC CGCCAGCGCC
GCCGAAATGT TTGCAGCCGG CTTGCAATTC CGTGGACGGG CGCGAGTGGT CGGTACGCCA
AGCGCGGGCA ATACCGAGAA TCTGATCGGG TATGATCTCG ATGACGGTTC ACGCTTCTGG
CTGGCAGAAC TGGTCTTCCG CCTGCCTGAT GGATCGTTGC TCGAAGGGCG CGGCGTCCAG
CCGGATCGGA TCGTCGAGAT TGACTGGTGG CGCTACGCAT TCGAGGACGA TCCGCAGATA
CATGCAGCGG TCGAGGAACT CCAGCTTGTC GGTCGATCAA ATGACGTTGC AGAGGTGGTC
GAACGTTGA
 
Protein sequence
MMTRCLRFVS ALLVVVLAGC GNLPVPLGSM STTTPTPLPT ATSQPTVAPT DTPRPTPPAT 
ATPTLAPTAT PTVTPYPLLP TPTLAPLSAN ERQAVFEDVW TLVRDRYVYE DYGGVDWDAV
RAEFEPRIAR AASEAEFYAL IEEMIGRLGD DHTRFDTPQE VAEEAARFDG GVAYAGIGAM
IRDLEEGILI TRLAPGGPAE QAGLQPRDLI IAVNGVPVSD TITFGPGGPV SVVRGQPGTP
VQLRIIDAAG ATRDVTVIRQ IIPPDAFPIV EARRVPGTDI GVVLIDTFNV SALDERVTDA
IMSLYQSGPL DGLILDVRTN GGGRLDMLRR TLGLFLDGGT IGSSSGRERS FSIDVPSGKT
LPLLEQMPIV VLTSDETASA AEMFAAGLQF RGRARVVGTP SAGNTENLIG YDLDDGSRFW
LAELVFRLPD GSLLEGRGVQ PDRIVEIDWW RYAFEDDPQI HAAVEELQLV GRSNDVAEVV
ER