Gene Hoch_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3679 
Symbol 
ID8546069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5059054 
End bp5061942 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content73% 
IMG OID646388347 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003268073 
Protein GI262196864 
COG category[R] General function prediction only 
COG ID[COG1480] Predicted membrane-associated HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.3383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000942474 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAA CCGAACGCAA CGACGAGATG CGCGCCACGC GCTCGCAGCT CGCCAAGGTC 
ATCCGCCGCC GTCACACGGT GGGTCTGGCT CTGGCCATCT TCATGAGCCT CGCGTTCGCC
GCCGTCACCG CCCCGCTGGT CGCCATCGAT CTCCTGCTGC CGACCACCGG GGCGGTCTCG
TTCGAGGTCG GCAAACCGGC GCCGATCACC GTGCGCGTGC CCCGCTTCTC GGGCTTCTCC
GACGGCAGCG TGGAGCTGAG CCCGGGCGTG CTGGTGTCGC GCGGGACCAT CGTCGACCGC
GAGGACTATC AAAACCTGCA GGTGCTGCGC GCCAACGGGC CGGACTCGTG GACCGCGGTC
GGCGGCTACT TCGTGCTGCT GCTCGCGGTG GCGCTGATGT TCACCATCCA CCTGCGGCGC
TCGCACCGCG GCCGGCTGCT GGCCACGCAG GCCTACACCA TGCTGCTCTT GCTCGGCTGC
ACCATCCTGG CCGAGATCGC GCTGTTATTT TCATCGATGT CGGTGTTCCT GGTGCCGGTG
GCGTGTCTGG CCATCGTCGC CACCGTGGTC GTCGACGTCT CCGCCGGCAT CGCCTCGGGG
TTCCTCGCCA GCGTGCTCAT CGGCCTGCTG GTGCCCTTCG ACCTGGGCGT GGTGCTGGTG
CTGGTGCTGC AGACCACGAC CGCCTCGCTG GTGGTCGGCG AGGGCCGGCC GCGCAACCGC
CGCATCTTCG CCGCCGGCCT CATCGGCGGT GTGTGCGCGG CCATCGGCTA CATCGTGCTG
TGTTACCTGA CCACCAAGCA CTCGCCCTTC GCCGAGCTGG CCTCGCCCAC GCGCTCGCCG
CTGGCGGCGA CCGTGGCCGG CGGCGTGCTC AGCGGCCTGC TGGCCATCCC GCTCAAGCCG
CTCTACCAGT ACCTGCGCGG CGATATCACG CAGTCCAAGC TGGTCGAGCT CGAGGACCTG
TCCAATCCGC TGCTGCGCCA GATCGCGACC AACTCGCCCG GCACCTGGCA GCACAGCCTG
GCCATGGCCA ACATGGCCGA GATCGCGGCC AACGCCATCG GCGCCGACGG CCGCCTGGTG
CGCGTGGGCG CCTACTACCA CGACCTCGGC AAGTCGCTGC AACCCAAGTA CTTCATCGAG
AACCTCGAGG CCGGCGAGAC CAGCCCGCAC GATCGCCTGC CGCCCGACGT CTCGTGCGAC
GCGATCTTCG CCCACGTCAC CGAGGGCATC CGGGTGGCCC GCAAGAACCG CCTGCCCGAG
CGCATCATCG ACTTCATGTA CATGCACCAC GGCGACGGGC TGCTCGAGTA CTTCTGGGCC
AAGTGTCGCG AGAGCGGCAA CCCCAAGGGG CTCGTCGAGG ACGATTTCCG CTATCCCGGG
GTGCCGCCGC AGAGCCGCGA GACCGCGATC CTGGCCATCG TCGACGCGGT CGAGGCGGCC
TCGCGCACGC TCAAGAAGCC CGACGAGCGC GCCATCGAGA GCCTGGTGCA GCGCATCGTC
TACGGCAAGC TGCACCTCGG CCAGCTCGAC CAGTCGGGGC TGAGCATGTC CGACCTGCGC
AAGATCTCGG ACTCGCTGCG CGAGACCATC AAGCACGCCC ACCACGGCCG CATCGAGTAC
CCGTGGCAGC GCGAGGAGCG CAAGAAGAAG GCCGCTGAGG CGGCCGCGGC CAAAGGTCTC
GCCGCCCCTG CCGACACCGA CACGGACGTC GCCGCCGAGC CCGCGGCCGC CGCGCCGCCA
GCGGCCGCGA GCGCGCCGCC GCCGGTGTCG GCCACCCAGC GCATCATCCA AGAGCCGCGG
CTCGACTCGC TCGACGTGCC GCGCCCCTAC TGGCAGGGTC GCCGGCGCAG CAGTCAGGAG
CCGGTGCTGG CCACCGCGCC CACCGAGGAG CTGGCGCCGC CGCCGGCCAA GCCCCAGCGC
GCGCGCGCCG ACAGTGACGA TATCGGCCAC TCGGCCACGC TCGACATCGA GATCGTGGCC
GCCGACGCCG ACGCCGACGC CAGCCCGGCC GCGGCCGCGA ACAACGGCGC CAAGGCGGCA
GCGGCGGGTG CTGGGGCGGA CGAAGCCGAG ACGGCGTCCG ACTACCCGTA CCTGGAGTCG
GCCAGTCAGT CGATGTCGGC GCTGCCGATG GCGGCTGAGC CCGACGACGA AGGCGACGAC
AACGCGGTGG ACGAAGGCGG CGACACCACC GGCGCGGCCC CGCCGAGCCA GGCGCCGACG
CTGTCGCTGC TCACGGCCGA GCCCGATATT GACACCGCGG CCGCGGCCGC GACCCCTGCT
CCGGCCCCGA GCCAGCCCGC GGCCCCCGAG GAGGTGCGCG CGCCGCTGCC GGCCTCGGTG
ACCCCGCGCG CGCCCGCCGA TATCGAGGCC GCGAGCGGCG ACGACGAAGC CCTGGCCCAG
GTCCACGCCG CCGAGCGCGC CGAGCGCGCC GCCGTGCTGG TCGCCGCGGC GTTGGCGCAT
GGGGCGCCCG ACGACGACGA CGACGACCCC GCCGCGGGCG ACGGCAACCG CGACGGAGCC
GAGGGCGCAG CCAGCAACGC CACGCCCATC CCCATGGGCA CCAGCGTCAC CGGGCCGCCG
CCGGCCACGC GCACGCGGCC GGTCGCGCGT CTGCGCGCCC GCGACAGCGG CATTCGCGAC
AGCGGCATTC GCGACAGCGG CGCTCGCGAC AAGCCGCTGG GCAGCACCAA GCTGGGCTTC
CCGGGCGCCG CGCAGGCGAT CGAGGAGGTC GCCGCCGGGC GTCCGAGCGG CGAGGCCGGA
GCTGCCGACG AGCCCGCCGA GGGCGCGGCT GCCGCGAGGC CAGAGACGTC CGAGGGGGCC
TCCGAGGGCG CCGCTGAGGC GGGCGCCGAG GAGAGGATCG ACCTGCCGCC CTCGGCGCGC
GCCAAGTGA
 
Protein sequence
MSETERNDEM RATRSQLAKV IRRRHTVGLA LAIFMSLAFA AVTAPLVAID LLLPTTGAVS 
FEVGKPAPIT VRVPRFSGFS DGSVELSPGV LVSRGTIVDR EDYQNLQVLR ANGPDSWTAV
GGYFVLLLAV ALMFTIHLRR SHRGRLLATQ AYTMLLLLGC TILAEIALLF SSMSVFLVPV
ACLAIVATVV VDVSAGIASG FLASVLIGLL VPFDLGVVLV LVLQTTTASL VVGEGRPRNR
RIFAAGLIGG VCAAIGYIVL CYLTTKHSPF AELASPTRSP LAATVAGGVL SGLLAIPLKP
LYQYLRGDIT QSKLVELEDL SNPLLRQIAT NSPGTWQHSL AMANMAEIAA NAIGADGRLV
RVGAYYHDLG KSLQPKYFIE NLEAGETSPH DRLPPDVSCD AIFAHVTEGI RVARKNRLPE
RIIDFMYMHH GDGLLEYFWA KCRESGNPKG LVEDDFRYPG VPPQSRETAI LAIVDAVEAA
SRTLKKPDER AIESLVQRIV YGKLHLGQLD QSGLSMSDLR KISDSLRETI KHAHHGRIEY
PWQREERKKK AAEAAAAKGL AAPADTDTDV AAEPAAAAPP AAASAPPPVS ATQRIIQEPR
LDSLDVPRPY WQGRRRSSQE PVLATAPTEE LAPPPAKPQR ARADSDDIGH SATLDIEIVA
ADADADASPA AAANNGAKAA AAGAGADEAE TASDYPYLES ASQSMSALPM AAEPDDEGDD
NAVDEGGDTT GAAPPSQAPT LSLLTAEPDI DTAAAAATPA PAPSQPAAPE EVRAPLPASV
TPRAPADIEA ASGDDEALAQ VHAAERAERA AVLVAAALAH GAPDDDDDDP AAGDGNRDGA
EGAASNATPI PMGTSVTGPP PATRTRPVAR LRARDSGIRD SGIRDSGARD KPLGSTKLGF
PGAAQAIEEV AAGRPSGEAG AADEPAEGAA AARPETSEGA SEGAAEAGAE ERIDLPPSAR
AK