Gene Paes_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1010 
Symbol 
ID6458837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1112762 
End bp1114405 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content48% 
IMG OID642725009 
Productcarboxyl-terminal protease 
Protein accessionYP_002015696 
Protein GI194333836 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTA TTTTTCAGGC CCTGGTTATG ATTGCAGTCC TCTCTTTCGG TATTCTTCTT 
GGGAGCAGAC TGCAAAACGA TTCTTTGGGT CCTGTTTTTG GTTATCAGAA AAAACTCTTT
GATGTTATTA ATCTGATTTC ATCGAGATAT GTCGATGATG TTGATCCTGA TTCTCTGGCG
GAAAGCGGCA TTAAGGGTAT GGTTGAATCC CTTGATCCTC ATACTGTTTA TCTGAGTCAG
GATAAAGTCT CCTATTCAAA AGCGGAGTTT AAGGGGAACT TTGAGGGGAT AGGCATTGAA
TTCGATATCA TTCGAGATAC GCTTGTCGTT GTTGCTCCAT TGACCAGTGG ACCAAGCCAG
GACGCTGGCA TTATGGCAGG CGACAGGATT CTTGCGATTG ATTCTGTCAG CGCGATAGGT
ATTGCATCGT CAGCAGTTGT TGCTTCATTG CGGGGCGAGA AGGGTTCTTC GGTTCATCTT
GATCTTTACA GGCCTTTCAG TAAACGGTTT CTGCATCTGG ATGTAAAACG AGATAGAATT
CCTACGTATA GCATCGATGC GTCGTTTCTT CTCGATGACA GGACCGGTTA CATCCGTTTG
AGCCGTTTTG TTTCAACGAC ATCTGATGAG TTTGTACAGT CGCTTTCTGA TTTGAAGTCA
AGAGGCATGA AGGGTCTTGT CGTTGATTTG CGGGGTAATC CCGGCGGATA TCTTGAGGAG
GCTGTTGAAA TTGCTGACGA GTTTCTTGCG CCGGACAGTT TGATTGTCTA CACAAAAAGC
CGTCATGGCG GGCCGGACGA AATCAAATAT CGTTCTACAG CTGATGGCGA TTATCAAAGC
GGGCCTCTTG TCATCCTCGT CGACAGGGGA AGCGCTTCTG CTGCTGAAAT TCTGGCCGGA
GCCCTTCAGG ACAGCAGGCG TGCACCGGTT GTCGGGGAGC TGACATTTGG CAAGGGGCTT
GTTCAGCGTC AGTTTGATCT TTATGACGGT TCGGCAATTC GACTGACCAT AGCCCGTTAC
TATACTCCCC TGGGTCGACA GATTCAACGG GATTACGATA ACGGGGCTCG CGGTCGAGAG
GATTATTATG AAGACCATTC TTCGCTCCTT GCAAGTGAGT CTTTCTTCGA TGATAGAGAG
GATCTTGCTG TTGCTACTGA CGTCGATGGC GTTCGTGTCT ACAGAACAGA TGCTGTTTCT
CTCGGAGAGG TTGCCGATAG TGCGGCAGTC AAAGCTTTTG GCGGTGTTGG GGGTATCATT
CCTGATTTCT GGGTTCTGGA TGATAGACCC GGAGAGTATT TTGTGATGCT GCAGGAGAAG
GGGGTGATAG AGGAAACAGC CCTTGCGGTC CTGGATGATT CGGCAAGCAG GGTTCGTGAA
CTGGGCGGCT CTCTTGATCT TTTTCTTGAA AAATATGCTG AGAATCAACG TGTCGAGCGT
TATCTTGAAA TGGTATGCAG GCGTAAGAAT ATGACGATTG ATGCGTTGGA GCTGGAGCGG
GAAAAGACCC GTATTATGAT AGCGGTCAAA TCCCGTATTG CCCGGCAGCT GTTCGGTATC
GGCGGTCAGA TTCGGGTTCT GGTGGAAGAG GCCGACAAGG TTCTTCTGGT TGCCAGAGAG
CAGCTTTATA AAGAGGTGCT GTAA
 
Protein sequence
MSRIFQALVM IAVLSFGILL GSRLQNDSLG PVFGYQKKLF DVINLISSRY VDDVDPDSLA 
ESGIKGMVES LDPHTVYLSQ DKVSYSKAEF KGNFEGIGIE FDIIRDTLVV VAPLTSGPSQ
DAGIMAGDRI LAIDSVSAIG IASSAVVASL RGEKGSSVHL DLYRPFSKRF LHLDVKRDRI
PTYSIDASFL LDDRTGYIRL SRFVSTTSDE FVQSLSDLKS RGMKGLVVDL RGNPGGYLEE
AVEIADEFLA PDSLIVYTKS RHGGPDEIKY RSTADGDYQS GPLVILVDRG SASAAEILAG
ALQDSRRAPV VGELTFGKGL VQRQFDLYDG SAIRLTIARY YTPLGRQIQR DYDNGARGRE
DYYEDHSSLL ASESFFDDRE DLAVATDVDG VRVYRTDAVS LGEVADSAAV KAFGGVGGII
PDFWVLDDRP GEYFVMLQEK GVIEETALAV LDDSASRVRE LGGSLDLFLE KYAENQRVER
YLEMVCRRKN MTIDALELER EKTRIMIAVK SRIARQLFGI GGQIRVLVEE ADKVLLVARE
QLYKEVL