Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1427 |
Symbol | |
ID | 5733335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1646623 |
End bp | 1648671 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278565 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001544199 |
Protein GI | 159897952 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGT TGCGCAACTG GCAAGCCACG AGTCTGTTAG AGCTACGTTT TATCTCCGAT GCCCAAATTT CGCCCAACGG CGAGCAAATT GCCTTTGTGG AAACCTGGAT CGAGGAAACC ACCAACAAGA GTGGCACTCG CAAGCCCGAT TATCGTTCAG CAATTATGTT AATTCAGGCC GATGGTGGCA TACCGCAACG CATGACGTTC AGCGTTTCAG GTCGCGATAG CTCGCCACGC TGGTCGCCCG ATGGCAGTAA ATTGGCCTTT ATCTCAACTC GTGATGCAGG CGTGGCTCAA CTGTTTGTGC TCGATTTGGC TCGTGGTGGC GAGGCCCAGC AACTCACCAG CCTTGGTTAT GGTGTGGCCG AAATCAACTG GCGCCCTGAT AGCCAAGCCT TGGCGTTTAT CTCGCGCGGA GCCAAAGCCA AAGCGCAAAC CCATGTCGAA AGCCTGCGCG ACGAAAAAAT TATCGAACGC TTGCCATTCA AATTCGATGG CGTGGGCTAT TTACAGCCAG AATACGCGCA AATTTGGCTG GTTGAATTAG GGCAAGAACC AAACCAAATC ACCGATCAAG CCTTTGATCA CGCTGATCCG GCTTGGTCGC CTGATGGTAG CGAGTTGGCT TTTATCACGG TTTCGCGGGC CGAATTAGAA CACACCCGCC AAGCTGATAT GTACCTACTA AATGTTGCAG CTGGTACATC GCGCTGTTTA ACTAATGCGC TGGGTCCGGT CTATCACCCA ACCTGGTCGC CCGATGGCAG CCAAATCGCC TATATCGGCC ATGATCAACA TACGGGCAAT GCATCAAATG AGGCTTTGTG GTGTGTTAGT CGAGCGGGCG GTGATGCCCG TTTGCTCAGT GTTGGCTTTG AATATGGGCT TGAAAATAGC GTGATCAGCG ATGCACGCAT TGGGCGCTTT CCTTATCGGC CTTATTGGCG CGATAATGGC ATTTACTGTT TGGCAACCCG TGCCGCCCGC ACCCGTGCTT ATCGCTACAG CGACGGCGCG ATGCATGAAC TTACGCCTGA GGATAACCCA AGCATTTCGG GCTACAGCCA ATCGTTGAAC AAGCGCACTG CCTTCACCGC TGGCACTGCC ACCCAACTCG AAGCGCTGTA TATGGGCGAT GCTGATGGCA GTATTCATCT GCTCTACGAC CCCAATGCTG CATTACTCAC CAGCATCCAA ACCATCGAGC CAGAACGCTT TACCTACAAT AGCTTCGATG GCTTGGAGAT CGAAGGCTGG GTGATCAAGC CTGTCGGATT CAGCCAAGGC CAGCAATATC CCTCGTTGTT GTATATTCAT GGTGGCCCGC ATAGCGCCTA TGGTCACAAT TTTATGCACG AATTTCAGGT GCTGGCGGCG GCTGGCTATG GCGTGATTTA CACCAATCCA CGCGGTGGCA CAGGCTATGG TCAGCGCTTC CGCGCGTTGG TACGCCAAGA TTTTGGCGGC GATGATTATC GCGATCTCAT GGCTGCTGCC GATTTGGCCG AAACCTGGGA TTGGATCGAC AGCAAACGGA TGGGTGTGCT TGGTGGCTCG TATGGCGGCT ATATGACCAA CTGGATCATC AGCCATACCG AGCGCTTTGC CGCCGCCAAT ACCCAACGCT GTATCTCCAA TTTAATGAGC TTTTTCGGCA CATCGGATAT TGGGCCATAT TTCGGCGAAG ATGAATTCGG TGGCAAGCCT TGGGCTGATA TCGATAAATT TATGGAACGC TCACCGATTC GTTATGTCAA TTCAATTAAC ACCCCATTGT TAATTTTGCA TTCCGATGAG GACCATCGCT GCCCAGTCGA GCAAGCCGAG CAACTGTATA CCGCGCTCAA AGTGCTGGAT AAACCTGTGC GTTTTGTCCG CTTCCCGCGC GAAGGCCACG AACTTTCACG TAGCGGCGAG CCATTGCATC GCATCGCACG AATCGAATAT ATTCTGGATT GGTTCGGCCA TTATTTGCAA GGCCACGAAC TCAAGCCCGC TGATCACTTC CGGCGCAGCG TTGCTGGCGA ATGGCAAAGC CGCCAATAA
|
Protein sequence | MAELRNWQAT SLLELRFISD AQISPNGEQI AFVETWIEET TNKSGTRKPD YRSAIMLIQA DGGIPQRMTF SVSGRDSSPR WSPDGSKLAF ISTRDAGVAQ LFVLDLARGG EAQQLTSLGY GVAEINWRPD SQALAFISRG AKAKAQTHVE SLRDEKIIER LPFKFDGVGY LQPEYAQIWL VELGQEPNQI TDQAFDHADP AWSPDGSELA FITVSRAELE HTRQADMYLL NVAAGTSRCL TNALGPVYHP TWSPDGSQIA YIGHDQHTGN ASNEALWCVS RAGGDARLLS VGFEYGLENS VISDARIGRF PYRPYWRDNG IYCLATRAAR TRAYRYSDGA MHELTPEDNP SISGYSQSLN KRTAFTAGTA TQLEALYMGD ADGSIHLLYD PNAALLTSIQ TIEPERFTYN SFDGLEIEGW VIKPVGFSQG QQYPSLLYIH GGPHSAYGHN FMHEFQVLAA AGYGVIYTNP RGGTGYGQRF RALVRQDFGG DDYRDLMAAA DLAETWDWID SKRMGVLGGS YGGYMTNWII SHTERFAAAN TQRCISNLMS FFGTSDIGPY FGEDEFGGKP WADIDKFMER SPIRYVNSIN TPLLILHSDE DHRCPVEQAE QLYTALKVLD KPVRFVRFPR EGHELSRSGE PLHRIARIEY ILDWFGHYLQ GHELKPADHF RRSVAGEWQS RQ
|
| |