Gene Hoch_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3201 
Symbol 
ID8545589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4411333 
End bp4414107 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content70% 
IMG OID646387868 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003267596 
Protein GI262196387 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.243991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCT TCCATCGTTT CGACGCTTTT CTCTTCGCAG CCGCTGTGTG TGCAGCGGCG 
TGCCTGGCGC TGCCCGGCTG CGGCGGAGCC CAGCACGCCG ATGTGCCGCC GGCCGCGGGT
GAGCAGGCCG CCGCGCCGCT CATCGACCGC GAGCTGTTCT TCGGCGACCC GCTGATCACG
GCCTCGCAGA TCTCGCCCGA CGGCCGCTTC ATCGCGTTCA TCAAGCCCTA CCGCGGGGTG
CGCAACCTGT GGGTGGTGGC GACCGGCGAG GACTTCGACT CGGCGCGGCC GCTGAGCGCC
AACGAGCGCG CGCTCGACGG CTACTTCTGG TCGTGCGACG GCCAGCATCT GCTGTTCGTC
AAGGACAGCG GCGGCGACGA GAATTACCAC GTCTACGCGG TCGCGCCCGA GGCCGAGGCC
GAGGCCGAGA CCGGCGTGCC CGCGGCCCGC GATCTCACCG ATATCGAGGG CGTGCGCGCG
ATCATCCTCG ACGTGCCCAA GAACCACCCG GGCGCGCTCA TCGTCGGCCT CAACGACCGC
GACCCGTCGC TGCACGACGT CTACCGGGTC GATATCGCCA CCGGCGAGCG CACCCTGGTG
TACGAAAACC GCGCGGGCGC GTTCGCGATC GGCTTCGACA GCGACGGCGG GCTGCGCCTG
GCCATGCGCC AGCTCCCCGA TGGCAGTCAG GAGCTGCTGC GCGTCGACGG CGACACGCTC
ACGCCCATCT ATCACACCAA GGTGGAGGAG ACGGCCACGC CGCTGCACGT GCTCGGCGAC
GGCAGCCGCG CGTACATCTC GACCAACCGC GGCGACGACG TCGACCTGCA GCGCCTGATG
CTGCTCGACC TGGCCACCGG CGAGACCGAG CTGGTCGAAG AAGACCCCGA GGCCGAGGTC
GACCTCGGCG GCGCCATCTT CCATCCGCAG ACCGAGGAGC TGCTGGCGAC CTACTACGTC
GGCGACTACG AGCGCATCTA CGCCGCCGAC GACGCCGTGG CCGCGGATCT GGCGTTTCTG
CGCGAGGAGC TGCCGCGCGG CACCCTCGCC ATCTCCTCGA CCACGCTCGA CCTCAGCACC
TGGCTGGTCA CGGTGTCGAG CGACGTCGAT CCCGGCTCGG TGTACGTCTA CCAGCGCGCC
GCGCGCTCGG TCGAGCTGTT GTATCGCTCG CGTCCCGACC TGCCGTCCGA ACACCTGGCC
GAGATGCAGC CGGTGCGCTA CCGGGCCCGC GACGGCCTGG CGATTCCCGG CTACCTGACG
CTGCCGCGCG GGGTCGAGGC CAAGGGCCTA CCCGTGGTCA TCCACCCGCA CGGCGGCCCC
TGGGCGCGTG ACGTGTGGGG CTACGACCCC TACGCGCAGT TTCTGGCCAA CCGCGGCTAC
GCGGTGCTGC AGCCCAACTT TCGCAGCTCG ACCGGCTACG GCAAGGCCTT TCTGCACGCC
GGCGATCGCA GCTTCGGCAC CGGCGCCATG CAGCACGACA TCAGCGACGG CGTGCAGTGG
CTGATCGATG AGGGCATCGC CGATCCCGAG CGGGTGTGCA TCTTCGGCGG CTCCTACGGC
GGCTACGCCA CGCTGGCGGG CGTGACCTTT ACGCCCGATC TGTACACCTG CGGCGTGCCG
TATGTGGCGC CCTCGAACCT GATCACGCTG ATCGAGTCGT TCCCCGCCTA CTGGCGGCCC
TTCATGCAGG GGACCTGGTA CGCGCGCGTC GGCGACCCGG CCATCGAGGC CGATCGCGCC
GACCTGCTGG CGCGCTCGCC GCTCGCGTTC GTCGACCGCA TCGAGGTGCC GCTGCTGGTG
GTCCACGGCG CCAACGACCC GCGGGTCAAG CAGCACGAAT CCGACCAGCT CGTGGTCGCG
CTGCGCGAAA AAGGACACGA GGTCGAATAC ATCGTGGCGC CGGACGAGGG CCACGGCTTC
CGCGGCAGCG AGAACCGCCT GGCCCTGGCC GTGGCGCTCG AGCGCTTCCT GGGCAAGCAC
CTCGGCGGTC GCGTCCAGGG CGAGGTCAAC CCGACCATCG CCGAGCGTCT GGCCGCGATC
ACGGTCGACG TCGCCGCGGT CGAGATGCCC GATCTCAGCG GCCTCGAGGC GGCCATGACG
GCGCCGCTGC CGGCGCTGCA CAGCGAGCGC ATCCGGCCGG CCCGCCTGGT CTACGCGGTG
GCCCGCGAGA TGGGCGAGCA GACCATGCGC ATGGAGGTCG TGCGCAGCAT CGCCGCGTGC
AAGGGCAAGA ACGCGCGCTG CTGGCGGGTC GCCGATCAGG CCAGCTCGCC GATGGGCGCG
CTCGAGGAGG TGCTGCTGCT CGATCGCGAA ACCCTGCTGC CCATCGAGCG CACCAGCAAG
AGCGCCAACC ACGAGCTGAG CCTGCGCTAC AGCGACAGCG CGGTCACCGG CTCGATGACC
GTGATGGGCA ATCGCCAGGA TCTCTCGGCG GAGCTCGCGG CGCCCGTGTT CGGCGACAGC
GCCGGCCTGG CCCTGGCACT GGCCGCGATG CCGCTGGCGG CCGATTACCG CACGCAGACG
CGCGAATTCG ATCCCATGAC GCAGAAGGTG TTGGCGTACG CGTTGGCCGT GAGCGGCAGC
GAGTCGATCG AGGTACCGGC CGGCAGCTTC GACACCTGGC GGGTGGAGAT GCGGCCCATC
GGGCGCAGCT CGGGCAAGCG CACGCTACAT GTGACCAAGG ATGCGCCGCA TCATCTGGTG
CGCGCGCTCT TCGAGGTGCC GGCCGAGATG GGCGGCGGCT CCCTGCGCAT CGAGCTGCAG
TCGAACCAGC GCTGA
 
Protein sequence
MKIFHRFDAF LFAAAVCAAA CLALPGCGGA QHADVPPAAG EQAAAPLIDR ELFFGDPLIT 
ASQISPDGRF IAFIKPYRGV RNLWVVATGE DFDSARPLSA NERALDGYFW SCDGQHLLFV
KDSGGDENYH VYAVAPEAEA EAETGVPAAR DLTDIEGVRA IILDVPKNHP GALIVGLNDR
DPSLHDVYRV DIATGERTLV YENRAGAFAI GFDSDGGLRL AMRQLPDGSQ ELLRVDGDTL
TPIYHTKVEE TATPLHVLGD GSRAYISTNR GDDVDLQRLM LLDLATGETE LVEEDPEAEV
DLGGAIFHPQ TEELLATYYV GDYERIYAAD DAVAADLAFL REELPRGTLA ISSTTLDLST
WLVTVSSDVD PGSVYVYQRA ARSVELLYRS RPDLPSEHLA EMQPVRYRAR DGLAIPGYLT
LPRGVEAKGL PVVIHPHGGP WARDVWGYDP YAQFLANRGY AVLQPNFRSS TGYGKAFLHA
GDRSFGTGAM QHDISDGVQW LIDEGIADPE RVCIFGGSYG GYATLAGVTF TPDLYTCGVP
YVAPSNLITL IESFPAYWRP FMQGTWYARV GDPAIEADRA DLLARSPLAF VDRIEVPLLV
VHGANDPRVK QHESDQLVVA LREKGHEVEY IVAPDEGHGF RGSENRLALA VALERFLGKH
LGGRVQGEVN PTIAERLAAI TVDVAAVEMP DLSGLEAAMT APLPALHSER IRPARLVYAV
AREMGEQTMR MEVVRSIAAC KGKNARCWRV ADQASSPMGA LEEVLLLDRE TLLPIERTSK
SANHELSLRY SDSAVTGSMT VMGNRQDLSA ELAAPVFGDS AGLALALAAM PLAADYRTQT
REFDPMTQKV LAYALAVSGS ESIEVPAGSF DTWRVEMRPI GRSSGKRTLH VTKDAPHHLV
RALFEVPAEM GGGSLRIELQ SNQR