Gene Hoch_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0129 
Symbol 
ID8542506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp198659 
End bp200839 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content69% 
IMG OID646384923 
ProductProlyl oligopeptidase 
Protein accessionYP_003264663 
Protein GI262193454 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGC CGCCCTCGCC TTCGGAGCGC GCCTTCCCCC CCGAGCAACA CGCCACCCAG 
GACGAGGAGG TCGAGTTGGT GTATCCCGAA ACCCGCCGCG AGGACACGCG CGAGACCATC
CACGGCGTCG AAGTCGCCGA TCCCTATCGC TGGCTCGAAA ACGCCGACGA CCGCATGGTC
GCGTCGTGGA TGTTGGCCCA GGACGGCCTG GCGCGCTCGT ATCTCGAGGC GCTGCCGGCG
CGTGACGGCC TGGCCGCGCG GCTGCGCGAG CTCAACTACT ACGACGCCAT CTCGGTGCCG
GCCAAGCGCG GCGAGCGCTA CTTCTTCACC CGCCGCCACG CCGACAAGGA GAAGTCGATC
CTGTACTGGC GCCAGGGCCA GGGCCAGGAG CAGGTGCTCA TCGATCCCAA CACCTTGAGC
GACGACGGCT CGACCTCGCT CGGCGGCTGG TTTCCCAACC GCGACGGCAC CAAGCTGGCG
TACAAGCTCA ATCCCAACAA CGCCGACGCG GCCACGATGT ACGTGATGGA CGTCGCCAGC
GGCGAGACCT CGACGGTCGA CGTCATCGAC GGCGCCAAGT ACGCGAGCGC GGCCTGGAAG
CCCGACGGCA GCGGCTTCTA CTACACCCGC CTGCCGAGCG ATCCCGACAT CCCGATCGCC
GACCTGCCGG CGCGCGCCGA GATCCGCTAC CACGAGCTGG GCAGCGATCC CGCCGGCGAC
GAGCTGGTGT ACCCGGCCAC CGGCGATCCC GGCACCTTCC TCAGCGTGTC GCTGTCCCGC
GACGGCCGCT ACCTGATGGT GAGCGTGCAG CACGGCTGGA ACTCGAGCGA CGTGTACTTC
AAGGACCTGC GCCGCGGCCG CGACGCCGGC TTCGAGCCGC TGGTCACCGG CGAGAAGGCG
CACTTCAGCG TGCGCGCCTG GCGCGGTGAC TTCTACGTGC TCACCAACCA CGAGGCCCCG
CGCTACCGCA TCTTCAAGGT CGATCCGCGG CGCCCGCGCA TGTCGCGCTG GCGCGAGATC
GTGCCCGAGA GCGAGGCAGT GATCGACAGC TTCAACATCG TCGGCAACCG CCTCGTGGTC
ACCTACCTGA GCAACGCCTA CAGCCGCATG GAGGTGCGTT CGCTCAGCGG CCAGCGCATC
CGCGAGGTCA CCCTGCCGGA AGTCGGCAGC GTGTCCAACA TGGCCGGCAA CGAGGACGCG
GACGAGGCCT TCTACGCCTT CACCTCGTTC ACCTCACCGC CGCAGATCTA CCGCACCTCG
GTGGCCACCG GCGAGAGTGA GCTGTGGTTT GAATTCGACC TGCCGGTCGA CACCAGCCAG
TTCACGGCCG AGCAGGTCTG GTACCCGTCG CGCGACGGCA CGCAGATCTC GATGTTCCTC
ATCCGCCGCA AGGACCTCAG CAGCGACCAG GCCCATCCCA CCATCCTCTA CGGCTACGGC
GGCTTCAACG TCAACCTCAC GCCCGCGTTC TCGACCAACA TCGTCGCCTG GGTCGAGCGC
GGCGGCATCT ACGCCATCCC CAACCTGCGC GGCGGCGGCG AGTACGGCGA GGAGTGGCAC
AAAGCCGGGA TGCGGCTCAA CAAGCAGAAC ACCTTCGACG ACTTCCTGGC CGCGGCCGAT
TTCCTCATCG AGACCGGCTG GACCTCGCCG CAGCGGCTGG CGATCTGGGG CGGCTCCAAC
GGCGGCCTGC TGGTCGGCGC GGCCATGACC CAGGCGCCCG AGAAGTTCGC GGCCGTGGTG
TGCGCGGTGC CGCTGCTCGA CATGCTCCGC TACCACCTCT TCGGCAGCGG CAAGACCTGG
ATCCCCGAGT ACGGCTCGGC CGACGACGCC GCCGAGTTCT CGGTGCTCAG CGGCTTCTCG
CCGTATCACC GCGTGGTCGA GGGCACCGCG TACCCGGCGC TGCTGATGCT CAGCGCCGAC
AGCGACGACC GCGTCGATCC CATGCACGCG CGCAAGTTCA CGGCCGCGGT GCAGTGGGCC
AGCAGCAGCG ACGAGCCGGC GATCATGCGC ATCGAGCACA ACTCCGGCCA CGGCGGCGCC
GACATGGTGC GGCAACTGGT CGAGCGCAAC GCCGACAGCT TCGCCTTCGT CGCCGACGAG
CTGGGCATGG CGGCCGCGCC GCCGCCCGCG CCGGCCGAAA CCGACCTGGT GTCCGACGGC
GCCGAAGGAG CGGCGCAATG A
 
Protein sequence
MTVPPSPSER AFPPEQHATQ DEEVELVYPE TRREDTRETI HGVEVADPYR WLENADDRMV 
ASWMLAQDGL ARSYLEALPA RDGLAARLRE LNYYDAISVP AKRGERYFFT RRHADKEKSI
LYWRQGQGQE QVLIDPNTLS DDGSTSLGGW FPNRDGTKLA YKLNPNNADA ATMYVMDVAS
GETSTVDVID GAKYASAAWK PDGSGFYYTR LPSDPDIPIA DLPARAEIRY HELGSDPAGD
ELVYPATGDP GTFLSVSLSR DGRYLMVSVQ HGWNSSDVYF KDLRRGRDAG FEPLVTGEKA
HFSVRAWRGD FYVLTNHEAP RYRIFKVDPR RPRMSRWREI VPESEAVIDS FNIVGNRLVV
TYLSNAYSRM EVRSLSGQRI REVTLPEVGS VSNMAGNEDA DEAFYAFTSF TSPPQIYRTS
VATGESELWF EFDLPVDTSQ FTAEQVWYPS RDGTQISMFL IRRKDLSSDQ AHPTILYGYG
GFNVNLTPAF STNIVAWVER GGIYAIPNLR GGGEYGEEWH KAGMRLNKQN TFDDFLAAAD
FLIETGWTSP QRLAIWGGSN GGLLVGAAMT QAPEKFAAVV CAVPLLDMLR YHLFGSGKTW
IPEYGSADDA AEFSVLSGFS PYHRVVEGTA YPALLMLSAD SDDRVDPMHA RKFTAAVQWA
SSSDEPAIMR IEHNSGHGGA DMVRQLVERN ADSFAFVADE LGMAAAPPPA PAETDLVSDG
AEGAAQ