Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0129 |
Symbol | |
ID | 8542506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 198659 |
End bp | 200839 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646384923 |
Product | Prolyl oligopeptidase |
Protein accession | YP_003264663 |
Protein GI | 262193454 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTGC CGCCCTCGCC TTCGGAGCGC GCCTTCCCCC CCGAGCAACA CGCCACCCAG GACGAGGAGG TCGAGTTGGT GTATCCCGAA ACCCGCCGCG AGGACACGCG CGAGACCATC CACGGCGTCG AAGTCGCCGA TCCCTATCGC TGGCTCGAAA ACGCCGACGA CCGCATGGTC GCGTCGTGGA TGTTGGCCCA GGACGGCCTG GCGCGCTCGT ATCTCGAGGC GCTGCCGGCG CGTGACGGCC TGGCCGCGCG GCTGCGCGAG CTCAACTACT ACGACGCCAT CTCGGTGCCG GCCAAGCGCG GCGAGCGCTA CTTCTTCACC CGCCGCCACG CCGACAAGGA GAAGTCGATC CTGTACTGGC GCCAGGGCCA GGGCCAGGAG CAGGTGCTCA TCGATCCCAA CACCTTGAGC GACGACGGCT CGACCTCGCT CGGCGGCTGG TTTCCCAACC GCGACGGCAC CAAGCTGGCG TACAAGCTCA ATCCCAACAA CGCCGACGCG GCCACGATGT ACGTGATGGA CGTCGCCAGC GGCGAGACCT CGACGGTCGA CGTCATCGAC GGCGCCAAGT ACGCGAGCGC GGCCTGGAAG CCCGACGGCA GCGGCTTCTA CTACACCCGC CTGCCGAGCG ATCCCGACAT CCCGATCGCC GACCTGCCGG CGCGCGCCGA GATCCGCTAC CACGAGCTGG GCAGCGATCC CGCCGGCGAC GAGCTGGTGT ACCCGGCCAC CGGCGATCCC GGCACCTTCC TCAGCGTGTC GCTGTCCCGC GACGGCCGCT ACCTGATGGT GAGCGTGCAG CACGGCTGGA ACTCGAGCGA CGTGTACTTC AAGGACCTGC GCCGCGGCCG CGACGCCGGC TTCGAGCCGC TGGTCACCGG CGAGAAGGCG CACTTCAGCG TGCGCGCCTG GCGCGGTGAC TTCTACGTGC TCACCAACCA CGAGGCCCCG CGCTACCGCA TCTTCAAGGT CGATCCGCGG CGCCCGCGCA TGTCGCGCTG GCGCGAGATC GTGCCCGAGA GCGAGGCAGT GATCGACAGC TTCAACATCG TCGGCAACCG CCTCGTGGTC ACCTACCTGA GCAACGCCTA CAGCCGCATG GAGGTGCGTT CGCTCAGCGG CCAGCGCATC CGCGAGGTCA CCCTGCCGGA AGTCGGCAGC GTGTCCAACA TGGCCGGCAA CGAGGACGCG GACGAGGCCT TCTACGCCTT CACCTCGTTC ACCTCACCGC CGCAGATCTA CCGCACCTCG GTGGCCACCG GCGAGAGTGA GCTGTGGTTT GAATTCGACC TGCCGGTCGA CACCAGCCAG TTCACGGCCG AGCAGGTCTG GTACCCGTCG CGCGACGGCA CGCAGATCTC GATGTTCCTC ATCCGCCGCA AGGACCTCAG CAGCGACCAG GCCCATCCCA CCATCCTCTA CGGCTACGGC GGCTTCAACG TCAACCTCAC GCCCGCGTTC TCGACCAACA TCGTCGCCTG GGTCGAGCGC GGCGGCATCT ACGCCATCCC CAACCTGCGC GGCGGCGGCG AGTACGGCGA GGAGTGGCAC AAAGCCGGGA TGCGGCTCAA CAAGCAGAAC ACCTTCGACG ACTTCCTGGC CGCGGCCGAT TTCCTCATCG AGACCGGCTG GACCTCGCCG CAGCGGCTGG CGATCTGGGG CGGCTCCAAC GGCGGCCTGC TGGTCGGCGC GGCCATGACC CAGGCGCCCG AGAAGTTCGC GGCCGTGGTG TGCGCGGTGC CGCTGCTCGA CATGCTCCGC TACCACCTCT TCGGCAGCGG CAAGACCTGG ATCCCCGAGT ACGGCTCGGC CGACGACGCC GCCGAGTTCT CGGTGCTCAG CGGCTTCTCG CCGTATCACC GCGTGGTCGA GGGCACCGCG TACCCGGCGC TGCTGATGCT CAGCGCCGAC AGCGACGACC GCGTCGATCC CATGCACGCG CGCAAGTTCA CGGCCGCGGT GCAGTGGGCC AGCAGCAGCG ACGAGCCGGC GATCATGCGC ATCGAGCACA ACTCCGGCCA CGGCGGCGCC GACATGGTGC GGCAACTGGT CGAGCGCAAC GCCGACAGCT TCGCCTTCGT CGCCGACGAG CTGGGCATGG CGGCCGCGCC GCCGCCCGCG CCGGCCGAAA CCGACCTGGT GTCCGACGGC GCCGAAGGAG CGGCGCAATG A
|
Protein sequence | MTVPPSPSER AFPPEQHATQ DEEVELVYPE TRREDTRETI HGVEVADPYR WLENADDRMV ASWMLAQDGL ARSYLEALPA RDGLAARLRE LNYYDAISVP AKRGERYFFT RRHADKEKSI LYWRQGQGQE QVLIDPNTLS DDGSTSLGGW FPNRDGTKLA YKLNPNNADA ATMYVMDVAS GETSTVDVID GAKYASAAWK PDGSGFYYTR LPSDPDIPIA DLPARAEIRY HELGSDPAGD ELVYPATGDP GTFLSVSLSR DGRYLMVSVQ HGWNSSDVYF KDLRRGRDAG FEPLVTGEKA HFSVRAWRGD FYVLTNHEAP RYRIFKVDPR RPRMSRWREI VPESEAVIDS FNIVGNRLVV TYLSNAYSRM EVRSLSGQRI REVTLPEVGS VSNMAGNEDA DEAFYAFTSF TSPPQIYRTS VATGESELWF EFDLPVDTSQ FTAEQVWYPS RDGTQISMFL IRRKDLSSDQ AHPTILYGYG GFNVNLTPAF STNIVAWVER GGIYAIPNLR GGGEYGEEWH KAGMRLNKQN TFDDFLAAAD FLIETGWTSP QRLAIWGGSN GGLLVGAAMT QAPEKFAAVV CAVPLLDMLR YHLFGSGKTW IPEYGSADDA AEFSVLSGFS PYHRVVEGTA YPALLMLSAD SDDRVDPMHA RKFTAAVQWA SSSDEPAIMR IEHNSGHGGA DMVRQLVERN ADSFAFVADE LGMAAAPPPA PAETDLVSDG AEGAAQ
|
| |