Gene Hoch_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0091 
Symbol 
ID8542462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp142325 
End bp144628 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content72% 
IMG OID646384879 
Productpeptidase M16 domain protein 
Protein accessionYP_003264625 
Protein GI262193416 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCA TTCCGCCCAG CGCGCCCGCA CGCGCCGCGT TCGCCCTGCT GCTCTCGGTC 
TCCCTGTCCG CGCTCGCGCT CAGCGCCTGC GGCGCGCCCA GCGGCACCGT CGATCTCGCG
GGCGTGACCC CGCGCCTGCC CGGCGATGGC AGCGCCAACG TCAGCGATCC CGTCGCCAGC
GCGCGCCCGG TGGCCAGCGC GAGCGAGCCC AGCGCCGAGC CCGATCCCTG GGCCGGCCGT
GACGACCTCA TCGAGGCCCC TGCGCCGCAG CCGCCCGCGC CCCTCGAGAT GCCGGCGATC
CAGCGTTTCA CGCTGCCCAA CGGCCTGCCC GTGGTGGTCG TGAGCAAGCG CGACGTGCCC
GTGGTCGGCG TGCAGCTCAT GGTCCGCGCC GGCAACGGCG CCGTGCCGGT CGATCAATCC
GGCCTGGCCC AGTTCGTCGG CGCCATGCTG CCCAAGGGCA CGCGCACGCG CAACGCCACC
GCCATCGCCG AGGCGATCGA GTCGGTCGGC GGTCGCCTGG CGGTCGAGCC CGGCTACGAG
GCCACGCTGC TCTCGTGTCA GGTGCTGGCG GCCGAGCAGA ACACCTGCCT GAGTATCATC
GCCGACATCG CCGCCCAGCC GACCTTCCCT GAGGACGAGC TCGGCCGCGT GCGCCGCGAG
CTGCTGGCCG GCGTGCGCCA GCGCCTCGAC AGCGCCTCGC TGCTGGCCAA CGCGCACTTC
CAGAACCTGC TCTGGGGCGA CGAGCACCCG CGCGGCCGCC CGACCAGCGA GCGCAGCATC
GAGGCGCTGT CGCGCGCCGA CCTCGTGGCC TGGCACAAGC GCTGGTTCGT GCCGCAGAAC
GCGGTCCTCG TGATCGCGGG TGACGTCGAT CCCAAGGGCC TGCGCTTCCG CCTGGGCCGC
GCCTTCAACA CCTGGCGGCG CACCGGCAAG GCGCCCGCGC AGCCCAGCGT GCCCGCGCCC
GCGCCCGACA GCCCGCGCAT CCGCCTGGTC GACAAACCCG GCCAGACCCA GACCCACATC
CGCGTCGGCC ACATGGGCAT CGCCCACGAC GCGCCCGACT ACTTCGCCAC CCTGATCTTC
AACCACGTGC TCGGCAGCGG CGGCTTCTCC TCGCGCCTGA TGCAGGTGAT CCGCAGCCAG
GCCGGCAAGA CCTACGGCGC GTCCTCGCGC TTCGAGCGCA GCCGCCAGCC GGGCGCGTTC
GTGGTCCGCA CCTTCTCGCG CAACGCCGAG GCCCTGGCCA CGGTCGAGCT GCTGCTCGCC
GAGGTGGCGC GCATGCAGCA GGAAGGGCCC CGCGAGGCCG AGGTGGCCAG CGCCATCGCC
AACCTCGCCG GCCAGTACGC GATCTCGATG CAGAGCGCGG CCGACATCGC CGGCGCGCTG
CTGGCCGCCG AGCTGTACGG CTTCGACCAG AGCTACGTGC GCGACTACCC GATGAAGCTC
GCCGAGGTGA ACAAGGAGTC GGCGACCCAG GCCGCGGCCG CGCATCTGCG CCCCGACCGC
GTCGCCATCG TGCTGGTCGG CGACGCCCGC GCCCTCGAGC CGCAGCTCGA GAGCCGCGGA
CTGCCCTACG ACAAGGTGAA CTATCTGACC CCGGTGGCCG CCGCCGACCG CGAGACCGCC
CAGGTCTCGG CCGAATCCGC GGCCGCGGGT AAAGCCCTGC TGGCCAAGGC GCTGGCGGCC
AAGGGCGGCG CCAAACGCCT GGCCGCGGTG CGCACCATGC ACATCGAGGC CACCGGCGTG
ATCCACTCCG GCGGGCAGAA GATCGACGCC ACGCTCGAGC GCCGCTTTTT GGCGCCCGAC
AAGCTGCGCC TCGACCTCAA CCTCACCGTG CCCGGCGGCA CCGCCGAGCT GCTCACCGTG
CTCAACGGCA AGCGCGCCTG GACCAAGCAG CCGAGCGGCG TCGTCGAGCT GCCGCCCGAG
GGCGTGGCCG AGCTGCACAA GCAGGTGTGG CGCGACCAGG AGTTTTTGCT GCTGCACGCG
TCCGAGCCCG GCGTCCAGGT CCAGGCCGCG GGCCAGAGCA ACCGCGACGG CAAGACCTAC
GAGCTGCTGC GCGTGATCCG CGACGACGGC GTGAGCGTCG ATATCCTGCT CGATCCCAAG
ACCCACCTCA TCGCCGGACT GACGTACGAC GAAGCGCCGG GCCGCAGCGT GTTCGAACAG
CTCGACGACT ACCGCACGGT CGAGGGCATC CAGATCGCCC ACCAGCGGCG CACCAAGAGC
GTCGAGGCCG ATCTCGAGGT GCGCATCGAC AGCGTCGCCA TCAACGAGAA GCTCACCAGC
GACGTCTTCG AGCAGCCCAA GTGA
 
Protein sequence
MRRIPPSAPA RAAFALLLSV SLSALALSAC GAPSGTVDLA GVTPRLPGDG SANVSDPVAS 
ARPVASASEP SAEPDPWAGR DDLIEAPAPQ PPAPLEMPAI QRFTLPNGLP VVVVSKRDVP
VVGVQLMVRA GNGAVPVDQS GLAQFVGAML PKGTRTRNAT AIAEAIESVG GRLAVEPGYE
ATLLSCQVLA AEQNTCLSII ADIAAQPTFP EDELGRVRRE LLAGVRQRLD SASLLANAHF
QNLLWGDEHP RGRPTSERSI EALSRADLVA WHKRWFVPQN AVLVIAGDVD PKGLRFRLGR
AFNTWRRTGK APAQPSVPAP APDSPRIRLV DKPGQTQTHI RVGHMGIAHD APDYFATLIF
NHVLGSGGFS SRLMQVIRSQ AGKTYGASSR FERSRQPGAF VVRTFSRNAE ALATVELLLA
EVARMQQEGP REAEVASAIA NLAGQYAISM QSAADIAGAL LAAELYGFDQ SYVRDYPMKL
AEVNKESATQ AAAAHLRPDR VAIVLVGDAR ALEPQLESRG LPYDKVNYLT PVAAADRETA
QVSAESAAAG KALLAKALAA KGGAKRLAAV RTMHIEATGV IHSGGQKIDA TLERRFLAPD
KLRLDLNLTV PGGTAELLTV LNGKRAWTKQ PSGVVELPPE GVAELHKQVW RDQEFLLLHA
SEPGVQVQAA GQSNRDGKTY ELLRVIRDDG VSVDILLDPK THLIAGLTYD EAPGRSVFEQ
LDDYRTVEGI QIAHQRRTKS VEADLEVRID SVAINEKLTS DVFEQPK