Gene Paes_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0021 
Symbol 
ID6458915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp23505 
End bp25175 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content52% 
IMG OID642724013 
Productcarboxyl-terminal protease 
Protein accessionYP_002014734 
Protein GI194332874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000229164 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00113331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCT GGTATCGTCA GCTCTTGCTT TCTTCTGTCG TTGTCCTGAC GACAGCCTCC 
TTCTTTTTCC ATCCTCTTTT CGCCCGTCAA AGCGGCATGT TCTCCATTGT GCGAAGCATC
GACCTGCTTG GGGATGTTTT TCGTGAGGTG TCACTGAGCT ATGTCGATTC CATCGATGTA
GGGAAATTTA TGTTTGCCGG TATCGACGGT ATGCTCGAAA CCCTTGATCC ATATACGGTC
TTTCTCGATA AAGAGGAGTC TGTTGAGCTT GGCGAGATCA CCAGCGGTCA GTATGGTGGT
ATAGGGGTTA CTATTGCCGG AATCGACAAC GGCGTCTATG TCGTATCGGT TCTCGATGGT
TTTTCGGCAT CCCGGGCAGG TATCAAGGTT GGCGATCAGC TCATCAGCGT TGATGGGATT
TCCATCAGGC AGGATTCTCT TGAAACAGTC AAAAACCTGC TCAAGGGAAC ACCTGGAACC
TCGTTGAATC TCGTTCTGCG TCGTTATGGG GCGCAGGCCA ATCGTAAGCT GACCCTGACT
CGTCAGGAGA TCAGGGTGAA CAGTATCCGC TATGCCGGTC TTATAGGCGA TATCGGTTAT
TTTGAGATGA GTTCGTTCGG TAACAGAAGT GCTGAAGAGC TGAGCGCTGC AATCAGTGAA
CTCAACGCAG AGGCGGAGGG TTCGGGACAA CGGATGAAGG CGGTTATTCT CGATCTGCGA
AATAATCCGG GAGGCCTTCT CGACGTTGCC GTCGATGTCA CCGGTCTGTT CGTGCGTCAG
GGCAGCGAGG TCGTTTCCAT CAGGGGGCGT TTGCCTGAGA GTGAAAATCG ATACGTAACC
AAGCGGGATC CCGTTGCAGG CGACCTTCCT CTCTCGGTTC TCATCAACTC AAAAAGCGCT
TCTGCATCTG AAATTGTGGC AGGTGCGATT CAGGAGCTTG ATCGAGGCGT GGTTATCGGG
GCGCGTTCAT TTGGCAAAGG TCTTGTGCAG TCGATCATTC CTCTTCCTTA TGACTGCAAG
CTCAAGATGA CCTCAGCCCG TTACTATACC CCCTCCGGTC GTCTTATCCA GAAATACCAT
GCCCGGGAGG ATGGTTGGCG CAGCGTTATT CATGGCCCCG GCAGGCAGGA TTCCTCAAGG
GTTTTCTATA CCCTCAACCG CCGCAAGGTC TATGGAGGGG GCGGTATTCT TCCTGATATC
CCTGTCAGTG AGCCCGAGTT TGACGACTAT GAAGCGAAGC TTCAAAAAAA GGGAATGTTT
TTTCGCTATG CTTCGGCATA CCGCTCCCTT CATCCGAAAG CGCCCGGGCT TCCGATGAAG
CGCGAGGGTC TTCTCAAGGG TTTCAACGCA TTTCTGGAAG AGGAACGCTT TTCGTTTGAA
TCCGAGCCGG AGGTGCTTTT GGATCAGATG AAGCTTTCGA CAGCAGAATT AGGCCGTTCA
AGTCGTCTTG ATTCTCTTTT CAATGAGGTG GAGCGTGAGG TTTCCATGCT GACGGAGCGC
TACAGGGTGG ACGACAGGGA GCGGATTGCT CTGGCTGTGG AGAGGGAGAT TCTGCGCCAT
TACGATGAGG ATGCCGCTCG AAGGCTCTCT CTCGAACAGG ACCCGGTCGT CAAAAAAGCG
CTTGATGTTC TCAACTCCCC AAAACAGTAC CGCAGCGTTC TCAAACCCTG A
 
Protein sequence
MKRWYRQLLL SSVVVLTTAS FFFHPLFARQ SGMFSIVRSI DLLGDVFREV SLSYVDSIDV 
GKFMFAGIDG MLETLDPYTV FLDKEESVEL GEITSGQYGG IGVTIAGIDN GVYVVSVLDG
FSASRAGIKV GDQLISVDGI SIRQDSLETV KNLLKGTPGT SLNLVLRRYG AQANRKLTLT
RQEIRVNSIR YAGLIGDIGY FEMSSFGNRS AEELSAAISE LNAEAEGSGQ RMKAVILDLR
NNPGGLLDVA VDVTGLFVRQ GSEVVSIRGR LPESENRYVT KRDPVAGDLP LSVLINSKSA
SASEIVAGAI QELDRGVVIG ARSFGKGLVQ SIIPLPYDCK LKMTSARYYT PSGRLIQKYH
AREDGWRSVI HGPGRQDSSR VFYTLNRRKV YGGGGILPDI PVSEPEFDDY EAKLQKKGMF
FRYASAYRSL HPKAPGLPMK REGLLKGFNA FLEEERFSFE SEPEVLLDQM KLSTAELGRS
SRLDSLFNEV EREVSMLTER YRVDDRERIA LAVEREILRH YDEDAARRLS LEQDPVVKKA
LDVLNSPKQY RSVLKP