Gene Rcas_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3957 
Symbol 
ID5541463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5163901 
End bp5165274 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content61% 
IMG OID640896065 
Productpeptidase S41 
Protein accessionYP_001434008 
Protein GI156743879 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.604883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.196815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTGG GAAACGTCCT GCTTCGACTA GGGGCTGTCG TCAGCATTGT CACGCTGACC 
GGATGTGGCA TGGCGCCGGT ATCGACGGTG CGTTCCACAA CGGCGGCGAC TGATGCGCTG
CCTGCATGGA CTGAGGGAAT TGCGACATCG GGTGCGTCCA CTCCCGCAAC TGCGACAGTA
ACACCGCTCC CTTCTCCCAC GCTTGTTCCG ACGCCGATTC CAACGCTCAC TCCAACGCCT
GATGTGCCTG CGCTCGATGA ACGACTTCAG ATTTTCGATG AAGTCTGGCG CATTGTGAAC
GAAGAATATC TCTATGATGA TTTTGGCGGC ATTGACTGGG AAGGGCTGTA TCCCGACTAT
CGCGCCCGGA TTATGCGGGC GCAATCGCGC GATGAATTCT ATGCGGCGAT GATCGATATG
GTCGCCCAAC TGAACGATAA CCACTCACGC TTCGTCCCGC CCGCTTCTGT CGAAGCGGAA
GATGCAACCG CCAGTGGACA GGAGACCCGT GTCGGAATTG GTGTGAGCGT TCAACCGAAG
CCGGATGGCG GGTTTATTCA GCAGGTCTTT CCGGAAAGCC CCGCAGCGCG CGCCGGGATT
CGTCCGCGTG ATCGGATTGT GGCAGTGGAC GGACGACCGT ATGCGGTATC CGATGGTGAT
CTCCAGGGCG ACATCGGCAC GTCGGTGCGT CTGACGATTG CGCGCCCCGG CGCAAAATTG
CGCGATGTGC TGCTCACCCG TCAGGAGGTG CGCGCCAGTA TTCTGCCCTA CTATCGCCGT
TTCCCCGGCG ATATTGGATA TGTCGCTGTG CCGACGCTCT GGGTTCACGA TATGGGAGAA
CAGGTGAGCG GCGCCCTCAC CGATCTGGTG GCGGGTGGTT CGTTGCGCGG TCTGATCCTC
GATCTGCGCT CGAACCGTGG TGGTTGGGGT GAGGTGTTGT CGGGGGTGTT GAGCCATTTT
GTGCGCGGGC AGGTCGGTGT CTTTTTCGGG CGCGACTATG TGCGTCCGCT GGTCGTCAAT
CCGCCTGCCG GGCCGGATAT GCGCAATCTC AGCGCCGGTC TGGTCGTCCT GATCGATGAA
GAGACTGCTT CCTACGCCGA GGTGCTGGCA GCGGTGTTGC AGGCAGAAGC GGGGGCGATC
GTCGTCGGGG CGCGATCGGC AGGGAATACG GAAACGATCT ATGCTCACGC GCTGCGCGAT
GGGTCGCGCC TCTGGCTCGC GCAGGAAGGG TTTCGGTTGC GCAATGGCAC AGACCTGGAA
GGGCGCGGCG TCATTCCCGA TGTGTTGCTC GAAGTCGATT GGACGCGCTA TAGCGAAGAT
GATGATCCCC AACTGCTTGA GGCGCTGCGG TTGCTTGGTG GAGGACCAAA GTGA
 
Protein sequence
MVLGNVLLRL GAVVSIVTLT GCGMAPVSTV RSTTAATDAL PAWTEGIATS GASTPATATV 
TPLPSPTLVP TPIPTLTPTP DVPALDERLQ IFDEVWRIVN EEYLYDDFGG IDWEGLYPDY
RARIMRAQSR DEFYAAMIDM VAQLNDNHSR FVPPASVEAE DATASGQETR VGIGVSVQPK
PDGGFIQQVF PESPAARAGI RPRDRIVAVD GRPYAVSDGD LQGDIGTSVR LTIARPGAKL
RDVLLTRQEV RASILPYYRR FPGDIGYVAV PTLWVHDMGE QVSGALTDLV AGGSLRGLIL
DLRSNRGGWG EVLSGVLSHF VRGQVGVFFG RDYVRPLVVN PPAGPDMRNL SAGLVVLIDE
ETASYAEVLA AVLQAEAGAI VVGARSAGNT ETIYAHALRD GSRLWLAQEG FRLRNGTDLE
GRGVIPDVLL EVDWTRYSED DDPQLLEALR LLGGGPK