Gene Rcas_2935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2935 
Symbol 
ID5540425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3809423 
End bp3811498 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content61% 
IMG OID640895056 
Productpeptidase S41 
Protein accessionYP_001433015 
Protein GI156742886 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000791887 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0220512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC GTCCTTTCTG GTATATCGGA GCGATTGCGG CTTTGCTTCT GGCGCTTGCC 
GCTTGTGGCG GCGGCGCGCC GACCAACCTG CTGCCGGCAC AGGATGCGAC TGCCGTCAAC
CCGTCGAATG TCTCGCCCGT AGCAGAGGCG ACCGCCTCGC CGGAGGTGAT CACGCCGTCG
CCGATCCCGT CCCGTACAGC AACAGGCGGC GTTGAGGTGA TCACCGGCGA GTTCACGTAT
ACCAACGATA TCATTACCAC CTACTATGTC GAACACGCTG TCGGGTTGGT CGATCTCTAC
GGCTTCATCA CTCGTGATGA GGAATGGGAA CTGCCGGTCG AGAGTCAGGC GCTGGGACCA
CTGACCATCG ATCTGGAGCG GCAGCGCGGC GAGTTTCGCC TGGCTCTGCC GGCGCGTCCG
GCAGGAGTGC TGGCTGATGT CGATAACAAC ACCCAACGCG ATACCGGCGT ACAGGTGTTC
GTGGTGGCGT ACTGGCCCAA TTTGTATGGC GGTCCCTTTT CCGAAGGGGA CGACCGCAGT
TTCGGGTGGC CCGCCTATCT GGCATCGACG GTCAACGATC CGGAGAACAA CGATGAGATC
ACCGGCGGCA AACTGGTAGT GTGGGCGCCG GACGAAGCGC AACAGTTCCC AACCGATTTT
GGCGCCGATG GGTTGCTGTT TACCGCCGAT GATCCGGTTG GTCCGCTCGC TGCCGGGTAT
TCGGTCATCG ATCTCGATCA GCGTCCGTTT GGCATCGAGC GCAACCGTGA AGAGCAGGTG
ACGCTGCACG AACCGCCGGA TGCAGCGATC AAAGATTTCT CCGATCTGTC GTATACCAGG
GCGTTCGATG AAATGTTCAA ACGGGTGCGG GTCGAATATG CCTTCAACGG CATTCCCGGA
AAAGCGCCGG ATTGGGATGC ACTTTATGCA AACCTGGCGC CGCGCGTTGC CGAAGCGGAA
CGCCAACAGG ATCGCCGCGC GTTTTTTGAG GTCATGTTCG ATTTTGCGAA TGCCTTCCGC
GATGGGCACG TTGGCGTCAA TTCGCCGCTT TCCGGCGCGC TGTTCCGTGA ACGCGCCGCC
GGCGGGTATG GGTTCGCCAT CCGCGAATTG GATGATGGTC GCGCGCTGGT GGTCTTCGTG
ACGCGCAATG GTCCTGCCGA TCGCGCGGGT GTGCAGGTCG GCGCCGAATT GCTGGCGTTC
AACGGCGCGC CGGTCAAAGA CGCAATTGCT GCCGTCGAGC CATTGGGGGG ACCGTTCTCG
ACCGACTTTG CGCTGCGCTA TCAGCAGGCG CGTTACCTGT TGCGCGCGCC GGTCGGGACG
CAAGCGCAGG TGACGTTCGC CAACCCGCGT GGTGCGCCGC AGACGGTCAC GTTGCGTGCG
GTGGAAGAAC GCGACAGTTT TTTTGCGACA TCGATCTTCC AGGAGAGCAA CCCGGCGGCG
CTGCCGGTCG AGTTCGAGCA GCGCGCCTCT GGCGTCGGGT ATATTCGTAT CAATTCCAAC
TACGATGACC TGAATCTGCT GATCCGTCTG TTCGAGCGGG CGCTCAAGAC GTTCGACGAC
CTGGATGTTC CCGGCATTAT TATCGACATG CGGCAGAATA GCGGCGGTGC GCCGCTGGGA
CTGGCAGGGT TTCTGTCCGA CCGGGAGATC ATCATCGGTC AGGACGAATA CTACAGCGAA
CGTACCGGTC GGTTCGAGCC AGAAGGTCCG CTCGATACGA TTCTGCCGCA CCAGAACCAG
TACCGTTTCG ACAAGATTGT GCTACTGGTC GGGCAGGCGT GTTTCAGTGC ATGTGAATTC
GAGTCGTATG GATTCAGCAA AGTTCCCGGC GTGATTGTGA TTGGTGAAAC GCCGACCGCT
GGGGTGTATG CCGAGGTGTC GCGCGGGCAG TATGTGCTGC CGGACGACAT CTTCCTCCAG
GTCCCGACCG GTCGCACGCT GCTGCCGGAC GGTGCGCCAC TGCTGGAAGG AGTGGGAGTT
GTGCCGACGA TCCGTGTGCC GGTGACTGCC GAAACCGTGC TGTCGAACCG TGACGCAGTG
CTCGAGCGCG CGGAGCGAGA AATTGTCGGG CGTTAA
 
Protein sequence
MKQRPFWYIG AIAALLLALA ACGGGAPTNL LPAQDATAVN PSNVSPVAEA TASPEVITPS 
PIPSRTATGG VEVITGEFTY TNDIITTYYV EHAVGLVDLY GFITRDEEWE LPVESQALGP
LTIDLERQRG EFRLALPARP AGVLADVDNN TQRDTGVQVF VVAYWPNLYG GPFSEGDDRS
FGWPAYLAST VNDPENNDEI TGGKLVVWAP DEAQQFPTDF GADGLLFTAD DPVGPLAAGY
SVIDLDQRPF GIERNREEQV TLHEPPDAAI KDFSDLSYTR AFDEMFKRVR VEYAFNGIPG
KAPDWDALYA NLAPRVAEAE RQQDRRAFFE VMFDFANAFR DGHVGVNSPL SGALFRERAA
GGYGFAIREL DDGRALVVFV TRNGPADRAG VQVGAELLAF NGAPVKDAIA AVEPLGGPFS
TDFALRYQQA RYLLRAPVGT QAQVTFANPR GAPQTVTLRA VEERDSFFAT SIFQESNPAA
LPVEFEQRAS GVGYIRINSN YDDLNLLIRL FERALKTFDD LDVPGIIIDM RQNSGGAPLG
LAGFLSDREI IIGQDEYYSE RTGRFEPEGP LDTILPHQNQ YRFDKIVLLV GQACFSACEF
ESYGFSKVPG VIVIGETPTA GVYAEVSRGQ YVLPDDIFLQ VPTGRTLLPD GAPLLEGVGV
VPTIRVPVTA ETVLSNRDAV LERAEREIVG R