Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3201 |
Symbol | |
ID | 8545589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4411333 |
End bp | 4414107 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646387868 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003267596 |
Protein GI | 262196387 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.243991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCT TCCATCGTTT CGACGCTTTT CTCTTCGCAG CCGCTGTGTG TGCAGCGGCG TGCCTGGCGC TGCCCGGCTG CGGCGGAGCC CAGCACGCCG ATGTGCCGCC GGCCGCGGGT GAGCAGGCCG CCGCGCCGCT CATCGACCGC GAGCTGTTCT TCGGCGACCC GCTGATCACG GCCTCGCAGA TCTCGCCCGA CGGCCGCTTC ATCGCGTTCA TCAAGCCCTA CCGCGGGGTG CGCAACCTGT GGGTGGTGGC GACCGGCGAG GACTTCGACT CGGCGCGGCC GCTGAGCGCC AACGAGCGCG CGCTCGACGG CTACTTCTGG TCGTGCGACG GCCAGCATCT GCTGTTCGTC AAGGACAGCG GCGGCGACGA GAATTACCAC GTCTACGCGG TCGCGCCCGA GGCCGAGGCC GAGGCCGAGA CCGGCGTGCC CGCGGCCCGC GATCTCACCG ATATCGAGGG CGTGCGCGCG ATCATCCTCG ACGTGCCCAA GAACCACCCG GGCGCGCTCA TCGTCGGCCT CAACGACCGC GACCCGTCGC TGCACGACGT CTACCGGGTC GATATCGCCA CCGGCGAGCG CACCCTGGTG TACGAAAACC GCGCGGGCGC GTTCGCGATC GGCTTCGACA GCGACGGCGG GCTGCGCCTG GCCATGCGCC AGCTCCCCGA TGGCAGTCAG GAGCTGCTGC GCGTCGACGG CGACACGCTC ACGCCCATCT ATCACACCAA GGTGGAGGAG ACGGCCACGC CGCTGCACGT GCTCGGCGAC GGCAGCCGCG CGTACATCTC GACCAACCGC GGCGACGACG TCGACCTGCA GCGCCTGATG CTGCTCGACC TGGCCACCGG CGAGACCGAG CTGGTCGAAG AAGACCCCGA GGCCGAGGTC GACCTCGGCG GCGCCATCTT CCATCCGCAG ACCGAGGAGC TGCTGGCGAC CTACTACGTC GGCGACTACG AGCGCATCTA CGCCGCCGAC GACGCCGTGG CCGCGGATCT GGCGTTTCTG CGCGAGGAGC TGCCGCGCGG CACCCTCGCC ATCTCCTCGA CCACGCTCGA CCTCAGCACC TGGCTGGTCA CGGTGTCGAG CGACGTCGAT CCCGGCTCGG TGTACGTCTA CCAGCGCGCC GCGCGCTCGG TCGAGCTGTT GTATCGCTCG CGTCCCGACC TGCCGTCCGA ACACCTGGCC GAGATGCAGC CGGTGCGCTA CCGGGCCCGC GACGGCCTGG CGATTCCCGG CTACCTGACG CTGCCGCGCG GGGTCGAGGC CAAGGGCCTA CCCGTGGTCA TCCACCCGCA CGGCGGCCCC TGGGCGCGTG ACGTGTGGGG CTACGACCCC TACGCGCAGT TTCTGGCCAA CCGCGGCTAC GCGGTGCTGC AGCCCAACTT TCGCAGCTCG ACCGGCTACG GCAAGGCCTT TCTGCACGCC GGCGATCGCA GCTTCGGCAC CGGCGCCATG CAGCACGACA TCAGCGACGG CGTGCAGTGG CTGATCGATG AGGGCATCGC CGATCCCGAG CGGGTGTGCA TCTTCGGCGG CTCCTACGGC GGCTACGCCA CGCTGGCGGG CGTGACCTTT ACGCCCGATC TGTACACCTG CGGCGTGCCG TATGTGGCGC CCTCGAACCT GATCACGCTG ATCGAGTCGT TCCCCGCCTA CTGGCGGCCC TTCATGCAGG GGACCTGGTA CGCGCGCGTC GGCGACCCGG CCATCGAGGC CGATCGCGCC GACCTGCTGG CGCGCTCGCC GCTCGCGTTC GTCGACCGCA TCGAGGTGCC GCTGCTGGTG GTCCACGGCG CCAACGACCC GCGGGTCAAG CAGCACGAAT CCGACCAGCT CGTGGTCGCG CTGCGCGAAA AAGGACACGA GGTCGAATAC ATCGTGGCGC CGGACGAGGG CCACGGCTTC CGCGGCAGCG AGAACCGCCT GGCCCTGGCC GTGGCGCTCG AGCGCTTCCT GGGCAAGCAC CTCGGCGGTC GCGTCCAGGG CGAGGTCAAC CCGACCATCG CCGAGCGTCT GGCCGCGATC ACGGTCGACG TCGCCGCGGT CGAGATGCCC GATCTCAGCG GCCTCGAGGC GGCCATGACG GCGCCGCTGC CGGCGCTGCA CAGCGAGCGC ATCCGGCCGG CCCGCCTGGT CTACGCGGTG GCCCGCGAGA TGGGCGAGCA GACCATGCGC ATGGAGGTCG TGCGCAGCAT CGCCGCGTGC AAGGGCAAGA ACGCGCGCTG CTGGCGGGTC GCCGATCAGG CCAGCTCGCC GATGGGCGCG CTCGAGGAGG TGCTGCTGCT CGATCGCGAA ACCCTGCTGC CCATCGAGCG CACCAGCAAG AGCGCCAACC ACGAGCTGAG CCTGCGCTAC AGCGACAGCG CGGTCACCGG CTCGATGACC GTGATGGGCA ATCGCCAGGA TCTCTCGGCG GAGCTCGCGG CGCCCGTGTT CGGCGACAGC GCCGGCCTGG CCCTGGCACT GGCCGCGATG CCGCTGGCGG CCGATTACCG CACGCAGACG CGCGAATTCG ATCCCATGAC GCAGAAGGTG TTGGCGTACG CGTTGGCCGT GAGCGGCAGC GAGTCGATCG AGGTACCGGC CGGCAGCTTC GACACCTGGC GGGTGGAGAT GCGGCCCATC GGGCGCAGCT CGGGCAAGCG CACGCTACAT GTGACCAAGG ATGCGCCGCA TCATCTGGTG CGCGCGCTCT TCGAGGTGCC GGCCGAGATG GGCGGCGGCT CCCTGCGCAT CGAGCTGCAG TCGAACCAGC GCTGA
|
Protein sequence | MKIFHRFDAF LFAAAVCAAA CLALPGCGGA QHADVPPAAG EQAAAPLIDR ELFFGDPLIT ASQISPDGRF IAFIKPYRGV RNLWVVATGE DFDSARPLSA NERALDGYFW SCDGQHLLFV KDSGGDENYH VYAVAPEAEA EAETGVPAAR DLTDIEGVRA IILDVPKNHP GALIVGLNDR DPSLHDVYRV DIATGERTLV YENRAGAFAI GFDSDGGLRL AMRQLPDGSQ ELLRVDGDTL TPIYHTKVEE TATPLHVLGD GSRAYISTNR GDDVDLQRLM LLDLATGETE LVEEDPEAEV DLGGAIFHPQ TEELLATYYV GDYERIYAAD DAVAADLAFL REELPRGTLA ISSTTLDLST WLVTVSSDVD PGSVYVYQRA ARSVELLYRS RPDLPSEHLA EMQPVRYRAR DGLAIPGYLT LPRGVEAKGL PVVIHPHGGP WARDVWGYDP YAQFLANRGY AVLQPNFRSS TGYGKAFLHA GDRSFGTGAM QHDISDGVQW LIDEGIADPE RVCIFGGSYG GYATLAGVTF TPDLYTCGVP YVAPSNLITL IESFPAYWRP FMQGTWYARV GDPAIEADRA DLLARSPLAF VDRIEVPLLV VHGANDPRVK QHESDQLVVA LREKGHEVEY IVAPDEGHGF RGSENRLALA VALERFLGKH LGGRVQGEVN PTIAERLAAI TVDVAAVEMP DLSGLEAAMT APLPALHSER IRPARLVYAV AREMGEQTMR MEVVRSIAAC KGKNARCWRV ADQASSPMGA LEEVLLLDRE TLLPIERTSK SANHELSLRY SDSAVTGSMT VMGNRQDLSA ELAAPVFGDS AGLALALAAM PLAADYRTQT REFDPMTQKV LAYALAVSGS ESIEVPAGSF DTWRVEMRPI GRSSGKRTLH VTKDAPHHLV RALFEVPAEM GGGSLRIELQ SNQR
|
| |