Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2675 |
Symbol | |
ID | 5734540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3429561 |
End bp | 3431936 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279817 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001545441 |
Protein GI | 159899194 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.300574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGAC GCATCAGCTT ACTGCTGGCT TTGTTGGCGA TCTTGTTTAC GACAATGCCT GCCTCGGCAG AAATTGTCAA TGTCCCTTAC TTTGAAGAAC CGGAATTGCT GCTTGCTGAA GATGAGTTGC TCAAACTCGC CCAACTGCAA GACAACACCT ACCCCAGTGT TGGCACCAAA ATCAGCCCCG ATGACACGAC GGTGGTCATT GGCAACTATC GTTATAGCGA CACTGGTTCA GCTTTCTTGA ATGTTGTTGA TGGCTCGATT GTGCCAATTC AGCCATTACA ACTCCCCGAA GATAGCGATT TCTTCCCTTT GGCAGCCACC GAAATGGTTT GGCTTGATAA TGACAACATT GGTCAAGTCT TATACGACCT GTTTATGGGT GGGATGGTGC TGTCCATTAA TCGCTACAGT GGCCAAATTA GCTTGTATCC AGTTAATTTG CCATTCTTAC CGTTATCGAT TGCGCCCAAT GGCTCGCGCT TGTTGGTGGT CACCTTCGAA GCATCCGAAC TTGAAGCAAT GCGCCAATCG CCCGATTCGG TGAAGTTGCC ATTCAACATC GAAGCCCCAA AAACCACCAT GGAACGCACC ATGCCCAAGG ATCGGATTGC CTATTACAGT CATACCGATT CACGCCGCCA CATGAGCGAA GAAACCCTTG ATTTGGCAAT CTTCGATCTA ACGACTGGCG CATTAACGCC ACTCTACAGC GTGCCTGATC ATACCTTGCT CTACGATTAT GCATGGTCAA AAGATGGCTC AAAATTCGCT TTGATTCGCG ATACGGTGAT TTTGGGCGAA GGTTTTGGTG AAAAGCGCTT GGTTGACGTG ATGACCCAAG ATGCACTTGG CGGTCTTTCA CCCAAGGATA ACCCACTCTT CACCGAAAAT GTGCTCGATA TCTTCGATCT GACGACTGGC AACTTCCAGC CAGAAGCATG GCGAGCGGTT GATGGCGATG GTCGCGTTGT TCGCGACATC GAATGGAGCA CCGACGGCCA ACGCTACATC GTGCGCTTGG AACGTCCAGC TCAAATCGCT GGTCGCCCAC ACCCAACCTA TATCTTCCCA GATATGGCCA GCTATCAATT CCGTAGCGTC GATGGCACGT TGCAACGCGA ATTGTATGCA CCTGAATTGC AAACCCCTGA AGCTTCAGGC TTCTTCTATC TCTCACCAGA TGAAGTCTTG TTCATCACTG CCAATGGCAC CAACCAAGCC TTGTACTACT TCAACCAAGG CTCAGGCGAG TTCCGCAAAG TGTCGAACAT GGATGGCACC TATTTTGGTG TAACCACCAC GAATATGAGC CGTCAGTTGA TCTTCAGCTA CATGTCGTTC AGCCAGCCAG CCGATATTTA TCGTTTGAAC TGGGATGGCC AAGCGCTCAG TCGCTTGACC TGGGCCAACG CCGAGCTTGA AAAAATTAAT AATGTTCGAG TTGATAGCGT TTCGTTTACC GTCAGCAGTG GCGCACAACG CAATGGCTTC TTAATTCAAC CTGCTGGCGC TGAGTTCCCA CCAAAAGATG TGCCAATCGT GATGTGGCAA GAAGGTGGAC CACGCGCTAC AATGACCCAA TTCTTCGCGA CCAACACTGA AAATCCCTAC AACCTGTTGC CAAACTTTGG CATCGCGGTG TTGTATGTGC CACTGCCTGG TCGCTTGGGC TTCGGGCCAG AATTCTTGAA CGCCTTGGCT GATAATGACA ACTTCGGCAA GATCGATATC GACGAAGGTG CCGAAATTAT TGGCCAAGCA ATTTCACGCG GTTGGACCTC ACAAAATAAG GTTGGGGTAA CTGGCTGTTC ATACGGCGGC TATTTCAGCG CCCAAAGTAT CACCCGCCAC CCAACTCGCT ACGCTGCTGC CAACCCACAA TGCACCTTGC TCAACAACGC CAATGAATTC CACTTTGGCT TGGGGCCATT AATTGCCTAC CTCGAAGGTG GCACACCAAT GGATAAGCCC GCTGAATATG CCGCTGATTC GCCATTGAAT CGCGCTGATC GCGTGCGCAC TCCAACCCTG TTGTTCCATG GCGAATACGA CTTCTTGCCA GTCAAGTATG CCGTTGACTT CCATGACCAA ATCGAAATTC AAAAGCACCG CGTCAAGTTG GTGACCTATG AACTCGAAGG CCATGGTTTG AGCGACCCTG CCAACCAATA TCGCGCTGCC CAAGAGCAAA TCTTGTGGTT CCGCCAATAT TTGAGCGGTA GCCCAAGTGT CGCTGCCGAG CCAGTCGTGA CCGATGCCGC AACCATGACA GTGCCTGAAA CTACTGATGT AATCGTGTTT ACGGAAACCG CTACGTTTGC AGCACCAAGC TTGCAATTCG GCAAGAATCT GATTACTGCT GAATAA
|
Protein sequence | MRRRISLLLA LLAILFTTMP ASAEIVNVPY FEEPELLLAE DELLKLAQLQ DNTYPSVGTK ISPDDTTVVI GNYRYSDTGS AFLNVVDGSI VPIQPLQLPE DSDFFPLAAT EMVWLDNDNI GQVLYDLFMG GMVLSINRYS GQISLYPVNL PFLPLSIAPN GSRLLVVTFE ASELEAMRQS PDSVKLPFNI EAPKTTMERT MPKDRIAYYS HTDSRRHMSE ETLDLAIFDL TTGALTPLYS VPDHTLLYDY AWSKDGSKFA LIRDTVILGE GFGEKRLVDV MTQDALGGLS PKDNPLFTEN VLDIFDLTTG NFQPEAWRAV DGDGRVVRDI EWSTDGQRYI VRLERPAQIA GRPHPTYIFP DMASYQFRSV DGTLQRELYA PELQTPEASG FFYLSPDEVL FITANGTNQA LYYFNQGSGE FRKVSNMDGT YFGVTTTNMS RQLIFSYMSF SQPADIYRLN WDGQALSRLT WANAELEKIN NVRVDSVSFT VSSGAQRNGF LIQPAGAEFP PKDVPIVMWQ EGGPRATMTQ FFATNTENPY NLLPNFGIAV LYVPLPGRLG FGPEFLNALA DNDNFGKIDI DEGAEIIGQA ISRGWTSQNK VGVTGCSYGG YFSAQSITRH PTRYAAANPQ CTLLNNANEF HFGLGPLIAY LEGGTPMDKP AEYAADSPLN RADRVRTPTL LFHGEYDFLP VKYAVDFHDQ IEIQKHRVKL VTYELEGHGL SDPANQYRAA QEQILWFRQY LSGSPSVAAE PVVTDAATMT VPETTDVIVF TETATFAAPS LQFGKNLITA E
|
| |