Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2468 |
Symbol | |
ID | 5734349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3155562 |
End bp | 3157448 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279608 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001545234 |
Protein GI | 159898987 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACGCA TCGAAGCCTT GCTTGCCGCT CGTCAATTTG TTGTTCCACA ACGCGCTGGC GATTATCTTT ATTTTATTAG CGATCTGAAT GGCCGCCTAA GTTTATATCG GATGCTGCTG ACTGGCAGTG TGCCTGAGCC GTTGCTACCG CCCGATATTG CCTTGCAAAC GCCGCACCAT ATGGGCGGAA AATCGTTTGT AGTGCTGGCC GAATACAATC AAATTGTGGT TATGATCGAT AAAGATGGCG ATGAAAACTA CCAACCGCTG CGCATTCCCC TGACTGGGGG CTTCCCTGAG CCAGTTTTTG GCGATCAATT TGCTGATGCC CAAACCAACC TCTCCAAGCT TGACCCAAGC ACAGGGATTG GCTATTTGAA TGTTGCTTCA CGCGTGCGGC CCGAACTTAG TTGCTATCAA ATTAATGTGT TGACTGGCAC GAGCACTTTA CTGCATACTG GCCCAGATGG CCCATTTTAT GCGACCTCAG CTCCCGATCA ACAAACAATT ATCACGGTCG ATGGCTATGG CATTGGTGAT AGCGTGATTT ATCGCCAGCA GCTTGGTAGC ACCGAGCGCT CGGTAGTTTT CGGTACGCCG ATGGATCAAC GTACAACGCC AGTTGAGCCA AATGGTATGG GCTTTGGCGA ATGGGTCAAT GATCAGGTGG CCTTGGTGAG CACGAGCCTC TTTGATGATT GTTACAGTTT AGCCTTGTTG CGCCTTGATG GCGCGCAAAG CTTGGATTCC GTGACGATCG AGGGCTTGGT GCATAGCGGC CAAGGCGAAT TTGATCGCTT GTTACATCTG ACTGAGCAGC GCTTTTTGAT TGGCTACAAT ATCGATGGCT GTTCGTGGTG CTACGAAGCA GAGTTTGATC TAGCTGGCAA ACGTATGTTG GTTACCAAGG TTTTGGTTGG CCAAGCACCG CTCGACAATG GCGTTTTAGA GTCGATTGAC TATGATCAAG CGAGTGATAG CTTTGCGCTT TCGTTCTCGA CTGCGATTGC TCCAACCCAA ATCTACACGA TTAAGTCCAG CCAAGAGCTG CAACAGCACA CCACCGAACG AGTTTTGGGC ATCCCCGTTG AGCATTTAGC GGCTGGCGAA GATGCCTCAT TCAACTCACA TGATGGCCTG CGCATTTCGG CACGACTTTA TCGCCCAGCT CCAGCTTTGG GCTATGAAGG CCCACGCCCC TTGGTGTATT ACATCCATGG TGGCCCGCAA GGCCAAGAAC GCCCCGATTT TGCCTGGTTC TCGATGCCCT TGATTCAATT TTTGACCTTG AAGGGCTTTG CAGTCTTTGT GCCTAATGTG CGTGGCAGCA GTGGCTATGG CTTTAAGTAT ATGAACCACG TTACCCACGA TTGGGGTGGC CAAGATCGGC TTGATCATGT GCATGCCATG ACTAAGGTTT TAGTCAATGA CCCGTTGATC GATATCAAAC GAACTGGGGT GATGGGGCGT TCGTATGGCG GGTTTATGAC CCTGACCTTG CTGGGCCGTC ACCCTGAGCT TTGGCGAGCA GGCATCGATA TGTTTGGCCC CTACGATTTG CACACCTTTT CGGCGCGAGT GCCTGAAACT TGGAAGAGTT ACATGGCAAC CCAAGTTGGC GATCCTGTAA CTGAGCATGA TTTCCTAGTC GAGCGCTCGC CCAAAACCTA TATGCACAAC TTAGCTTGCC CATTATTGGT GACTCAAGGA GCCAACGATC CACGGGTGAT TGAGCGTGAA TCGAGCGAAG TGGTGCACGA ATTGCAAGCC TTGGGCAAAA ATGTTGATTA TCTGTTGTTC AGTGATGAAG GCCACGATGT TTTGAAGTAT GCCAACAAAG TGACATGCTA TAACCGGATC ACCGACTTTT TCAGCCAGCA TCTCTAG
|
Protein sequence | MPRIEALLAA RQFVVPQRAG DYLYFISDLN GRLSLYRMLL TGSVPEPLLP PDIALQTPHH MGGKSFVVLA EYNQIVVMID KDGDENYQPL RIPLTGGFPE PVFGDQFADA QTNLSKLDPS TGIGYLNVAS RVRPELSCYQ INVLTGTSTL LHTGPDGPFY ATSAPDQQTI ITVDGYGIGD SVIYRQQLGS TERSVVFGTP MDQRTTPVEP NGMGFGEWVN DQVALVSTSL FDDCYSLALL RLDGAQSLDS VTIEGLVHSG QGEFDRLLHL TEQRFLIGYN IDGCSWCYEA EFDLAGKRML VTKVLVGQAP LDNGVLESID YDQASDSFAL SFSTAIAPTQ IYTIKSSQEL QQHTTERVLG IPVEHLAAGE DASFNSHDGL RISARLYRPA PALGYEGPRP LVYYIHGGPQ GQERPDFAWF SMPLIQFLTL KGFAVFVPNV RGSSGYGFKY MNHVTHDWGG QDRLDHVHAM TKVLVNDPLI DIKRTGVMGR SYGGFMTLTL LGRHPELWRA GIDMFGPYDL HTFSARVPET WKSYMATQVG DPVTEHDFLV ERSPKTYMHN LACPLLVTQG ANDPRVIERE SSEVVHELQA LGKNVDYLLF SDEGHDVLKY ANKVTCYNRI TDFFSQHL
|
| |