Gene Rcas_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4069 
Symbol 
ID5541580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5282028 
End bp5283299 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content63% 
IMG OID640896181 
Productcarboxyl-terminal protease 
Protein accessionYP_001434119 
Protein GI156743990 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.686351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC TACGACAATT ATTCCGTCTG CGACTGCCGA TCTGGTTAGT GACGCCGTTG 
CTGGCGTTTG TGCTGACGCT GGGAATCGGC GGCGGCTATC TGCTGGCGTT GCGTGTGACT
ACTCCCTGCC CGCTCCAGGC GCAAGAGTGC GCCGCGTTGA CGAATTTCTG GCGTGTGTGG
CAACTGGCGC GCGACCATTT TGTTGACCCG GCTGCGATTG ATCCGCAGCG CATGAGCGAC
GGTGCGATCA ATGGCATGCT CGATAGCCTG GGAGACCAGG GTCACACGCG CTACCTGAAT
GCAGACGAAG CCCGACGGGA GCGTGAGGCG CTTTCCGGCA GATTCGAGGG CATTGGCGCC
TATATCGACG TGCGCGACGG GCAGCCGCGA ATTGTCGCTC CTATCGAGGG ATCGCCAGCC
GAACGCGCCG GGTTGCGCCC CGACGACCTG ATTCTGCGCG TCGATGGATA CGATGTGCGG
GGGGTGACCG TGGAAGAACT GCGCAACCGG GTGCGTGGTC CAAAGGGGAC GCAGGTGGTA
TTGACCATTC AGCGCGCCGG TGTGGCAGCG CCGTTCGACG TGACGATTAC GCGCGAGGAG
GTGAATGTTC CCAGTGTCAC CTGGCGCATG CTGCCCGACC GTATTGCGCT GATCAAGATC
AATCGTTTCG CCGAGCGCAC CGGAGCGGAG TTGCAACAGG CGCTGCTGGA GGTTCGGGCG
CAGAAGGCGC AGGCGATCAT TCTCGATCTG CGCAACAACC CCGGTGGTCT GGTGACGCAA
CTGGTCGCTG CGGCCAGTCA GTTTATGCCA GAAGGGAGCA CGGTGCTCAT CGAACAGGAC
CGTGACGGCG CCCAACGGCC ATACACAACC ACCGAAGGCG GACTGGCGCT CGATATTCCG
CTGGTTGTGC TGGTGAACAA CAACAGCGCC AGCGCCGCCG AGATCCTGGC AGGCGCGTTG
CAAGAGAACG GACGCGCGCG CGTGATCGGG CAGGCGACGT TTGGCACGGC AACGGTTCTG
CGTCCGTTTG ATCTGGAAGG CGGCGCACAG GTGCGTCTGG GCGCCTCACA GTGGCTGACG
CCGAAGGGCA GGGTGGTGCG CGGTGTGGGC ATTCAGCCCG ATGAATTGAT CGCGCTGGCG
CCAGGGGTTG CGCCACTCAC CCCGACTGAA GCGGCAACTC TCACCCCGGA GGAATTGCAG
CGCAGTCAGG ATATTCAGTT GTTGCGCGGG CTTGAAGTAG TGCGCGAGGC GCTGGCGCAA
AAAACGTCGT AA
 
Protein sequence
MNILRQLFRL RLPIWLVTPL LAFVLTLGIG GGYLLALRVT TPCPLQAQEC AALTNFWRVW 
QLARDHFVDP AAIDPQRMSD GAINGMLDSL GDQGHTRYLN ADEARREREA LSGRFEGIGA
YIDVRDGQPR IVAPIEGSPA ERAGLRPDDL ILRVDGYDVR GVTVEELRNR VRGPKGTQVV
LTIQRAGVAA PFDVTITREE VNVPSVTWRM LPDRIALIKI NRFAERTGAE LQQALLEVRA
QKAQAIILDL RNNPGGLVTQ LVAAASQFMP EGSTVLIEQD RDGAQRPYTT TEGGLALDIP
LVVLVNNNSA SAAEILAGAL QENGRARVIG QATFGTATVL RPFDLEGGAQ VRLGASQWLT
PKGRVVRGVG IQPDELIALA PGVAPLTPTE AATLTPEELQ RSQDIQLLRG LEVVREALAQ
KTS