Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1829 |
Symbol | |
ID | 6460241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | + |
Start bp | 1994842 |
End bp | 1997796 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642725813 |
Product | peptidase M16 domain protein |
Protein accession | YP_002016488 |
Protein GI | 194334628 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.661414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATCAC CTTTTTCCGG AAAAAAGCCC TTTTTAAAAG TATTTTTTAC TATCGGTTGC CTGATATGGC TCTCGGCATT TCATTTAATT TCCTGTAAAA CAATGAGTTC AGTGAAAGAC TATTCCTATA CGACGATTCC CGAAGATTCC CTTCATACAA GAATATACCG GCTTGAAAAC GGCCTGACCG TCTATATGAG TCCCTACCAC AATGAGCCGA GGATCTATAC ATCTATAGCG GTGAGAGCCG GCAGCAAAAA TGATCCGGCT GAGACAACAG GACTGGCGCA CTATCTTGAA CACATGCTCT TCAAGGGCAC TGACTCTATC GGCTCGCTGG ACTATGAACG GGAACATATA GAACTGCAGA AGATTATCGC TCTCTATGAA GAGTACCGTT CCACCGAGGA CCCCGATACA CGGGCGGAGA TCTATCGTCA GATCGACAGT ACATCAAATA TCGCAGCACA GTACGCCGTT CCCAATGAAT ATGACAAGTT GCTCAACTCG ATCGGAGCCC GCGGCACGAA TGCGTATACA TGGGTGGAAC AGACTGTCTA TCTTAACGAC ATTCCTGCCA ATCAGCTGGA CAAGTGGCTC TCAATCGAAT CTGAACGATT CCGTAATCCT GTCATGCGAC TTTTTCATAC CGAACTCGAG ACGGTGTATG AGGAAAAGAA CATGACCATG GACAGCGACA GCAGAAAAAT ATGGGAAGCC CTATACTCCG GCCTGTTTAC CAGGCATACC TATGGAACAC AGACTACCAT CGGTGAAGCG GAACATCTGA AAAATCCGTC CATACAAAAT GTCATCGACT ACTACAGAAA ATGGTATGTC CCCAACAATA TGGCGATCTG TCTTGCGGGC GATTTCGATC CCGACGAGAC CATACGCATG ATTGACGAAA AATTTTCCGT CCTCAAACCC CGCGAACTGC CCGTATTCAA CCCGCCGATC GAGGAAGAGC TCAGCCAGCC TGTTGCCTCG CATGTCTACG GGCCGGAATC TGAAGAGCTT GTCATCGGAT TCCGCTTCGA TGGCGCCGAT AGCCGGGATG CCGATTACCT GACGCTTCTG GATAAAATCC TTCATAACCA GACGGCAGGT CTCATCGACC TGAACCTCAA CCAGGAACAG CAGGTTCTGG AAGCAGGATC GATGGCCATT CTGATGAAAG ATTATTCGAC GCATATTCTG AGCGCCAAAC CTCGTGAGGG CCAAAGTCTG GACGACGTCA GAAACCTGCT GCTGGAACAG CTGGAACTGG TTAAAAAGGG TGAATTTCCC GACTGGCTGC CCGATGCGGT CATCAACGAT CTGAAAATAG AAGAACTCAA GACATGGGAG TCGAACCGTG GACGTACTGA AGGGTTTGTT GATGCGTTTA TTTGGGATAT GGACTGGGCT CGCTATGAGA ACCGTATCGA AAGAATGTCG GCCATTACCA AGGAAGAGAT CATGGCTTTT GCCCGGGAGC ATTACAAAGA GAATTACGTT GTCGTCTACA AACACCATGG GAAAGACAAG GAAAGCCCGA AAATCGCCAA GCCGCCTATC ACTCCTCTTT CGGTAAACCG TGACAGCACA TCGCTCTTTG CAAAAGAGCT CCTGGCAAGA AAAAACACTG CTATCGAACC GGAGTTTCTG GATTTCGACA AGGATATCTC GCGCGAGTCA ATCACCAGCG ACATCTCTCT CTACTCGGTT CCCAACCGCG AAAACGATCT TTTTTCGCTC TACTATGTGT TCGATATCGG CACCAACCAT AGCCGCCGGC TGGATATGGC GCTTGATTAC CTTACCTATC TGGGAACATC GACCGCAACG CCTGCAGCAT TCAACCGGGC GCTCTACCGG ATCGGGGCCA GCTTCTCGGT CTATACAGCT GATGACCATC TGTATATCAA ACTGTCGGGT CTTCAGGAGA ACTTCACTGC ATCGATACAA CTGCTTGAAT CGCTTCTTTC CGATGCCAGG CCTAATGATG AAGCCCTTGA GAAACTGAAA CAGGGCTTGT TGAAAGAACG ATCGGACGAC AAACTCTCTA AACGTAAAAT CCTGTTTGAG GCGATGAGCA GTTATGCTAA ATATGGCCCG CAATCGCCAT TTACGAACGT TCTGACCAAC ACTGAACTGC AGCAGATCTC ATCTGATGAA CTCCTGAGTG AAATCAGCAA TCTGATCCGT TATGAACACC GGGTACTCTA CTACGGGCCG CAAGAGCCGA AATCACTTGC AAAAGAGCTT CAAGGGCTTC GGCACATGCA GAAGGAGTTA ATCCCGGTTC CTGCAGAAAC CCCTTTTGAA GAAATCGCTC CGGAAGAGAA TCTCGTGTAT GTGGTCGATT ACGACATGAC ACAGGCCGAA ATCCTCATGC TCTCGCAGGA CAACCGGTAT AGCCCGGAAC AGATTCCTCT GATCACCCTG TTCAATGAAT ATTACGGAGG CGGAATGTCC TCTGTGGTTT TTCAGGAGCT CAGGGAAGCC AAAGCTCTTG CCTATTCGGT CTTTTCGATC TACCGCATCC CTAAAAACAA GGATGAACAT CACTATATTT TCAGCTATAT CGGAACACAG GCAGACAAGC TCCCTGAAGC ACTCTCCGGA CTGGGCGAGC TTATGGAAAA GCTGCCGGAA TCTCCGGAAC TCTTTGCTTC GGCAAAAGCA GGAATTCAGG AGAAAATCCG CACTGAACGG GTCAAAAGAG AAAAAATTCT CTTTACCCGT GAAGAAGCCT GCAAGCTCGG CATCGATTAT GATATCAGGA AAAATATCTA TGACCATGTC GGAAATATCA CCTTCGACGA TATCTCACAG TTCCATAAAG AGCGATTCAA CAGTAAAAAG AGGATCATGA TGGTCCTTGG ACGTATGGAA AACCTCGATA TGGAGACGCT CGGCCGTTAC GGTACCGTCA AAACGCTGAC ACTGGACGAG ATCTTCGGCT ACTGA
|
Protein sequence | MRSPFSGKKP FLKVFFTIGC LIWLSAFHLI SCKTMSSVKD YSYTTIPEDS LHTRIYRLEN GLTVYMSPYH NEPRIYTSIA VRAGSKNDPA ETTGLAHYLE HMLFKGTDSI GSLDYEREHI ELQKIIALYE EYRSTEDPDT RAEIYRQIDS TSNIAAQYAV PNEYDKLLNS IGARGTNAYT WVEQTVYLND IPANQLDKWL SIESERFRNP VMRLFHTELE TVYEEKNMTM DSDSRKIWEA LYSGLFTRHT YGTQTTIGEA EHLKNPSIQN VIDYYRKWYV PNNMAICLAG DFDPDETIRM IDEKFSVLKP RELPVFNPPI EEELSQPVAS HVYGPESEEL VIGFRFDGAD SRDADYLTLL DKILHNQTAG LIDLNLNQEQ QVLEAGSMAI LMKDYSTHIL SAKPREGQSL DDVRNLLLEQ LELVKKGEFP DWLPDAVIND LKIEELKTWE SNRGRTEGFV DAFIWDMDWA RYENRIERMS AITKEEIMAF AREHYKENYV VVYKHHGKDK ESPKIAKPPI TPLSVNRDST SLFAKELLAR KNTAIEPEFL DFDKDISRES ITSDISLYSV PNRENDLFSL YYVFDIGTNH SRRLDMALDY LTYLGTSTAT PAAFNRALYR IGASFSVYTA DDHLYIKLSG LQENFTASIQ LLESLLSDAR PNDEALEKLK QGLLKERSDD KLSKRKILFE AMSSYAKYGP QSPFTNVLTN TELQQISSDE LLSEISNLIR YEHRVLYYGP QEPKSLAKEL QGLRHMQKEL IPVPAETPFE EIAPEENLVY VVDYDMTQAE ILMLSQDNRY SPEQIPLITL FNEYYGGGMS SVVFQELREA KALAYSVFSI YRIPKNKDEH HYIFSYIGTQ ADKLPEALSG LGELMEKLPE SPELFASAKA GIQEKIRTER VKREKILFTR EEACKLGIDY DIRKNIYDHV GNITFDDISQ FHKERFNSKK RIMMVLGRME NLDMETLGRY GTVKTLTLDE IFGY
|
| |