Gene Hoch_3436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3436 
Symbol 
ID8545824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4748194 
End bp4749708 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content65% 
IMG OID646388103 
ProductHAD superfamily (subfamily IG) hydrolase, 5'- nucleotidase 
Protein accessionYP_003267831 
Protein GI262196622 
COG category 
COG ID 
TIGRFAM ID[TIGR02244] HAD superfamily (subfamily IG) hydrolase, 5'-nucleotidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0157387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AATCATCTGA TGCGCGGGCG CGCCTTACGG CGCTGCTCGC CGATATACGC 
AGCGAGAGCG GTGCGGTCAC ACGCGGCCGC GTGTACTGCA ACCGCAGCAT CAAGCTGTCG
TCGATCGACT ACATCGGCTT CGACATGGAC TACACGCTGG CCCGCTACCA GCAGGAGCGC
CTCGAGAAGC TGTCGATCGA CCTCACGCTC AAGTACCTGG TGCAGAACCA CGGCTACCCC
GACGCCATCC GCGAGCTGCG CTACGACCCG CGCTTCGCCA TCCGCGGGCT GGTGGTCGAC
CGCGCGCTCG GCAACGTGCT CAAGATGGAC CGCCACGGCT ACGTGCGCCG CGCGTATCAC
GGCTTCCGCC TGCTCGACAA AGAGGAGCGC CGCGAGCACT ACCTGAGCAA GCGCATCAAC
CTCTCGGACA AGCACTACGT GTGGATCGAC ACGCTGTTCT CGCTGCCCGA GGCGGTGATG
TTCGTGACCC TGGTCGACTA CTTCGACAAC CGCCAGGACG AGGTCGATTA CGCCGCGCTG
TACGATGATA TCCGCGCCTC GATCGACCTC GCGCACCGCG ACGATTCGCT CAAGTCGATC
ATCAAGGCCG ATCTGGCCGC GTACATCGTC GCCGATCCGG CGCTGTCCGA GACGCTGCAC
AAGCTGCGCT CCTCGGGCAA GAAACTGTTT CTGCTCACCA ACTCCTATTA CGACTACACG
CGCGCGGTCA TGAGCTATCT GCTCGACGGC ATCAGCGCCG CGTATCCCTC GTGGCGCGAG
TACTTCGACA TCGTCATCGT CGGCGGTGAG AAGCCGGGCT TCTTCACCGG GCAGAAACCC
TTCGTGCCTA TCGATCCCGA CAGCGGCGAG GTCATCCCCG GCGAGGTCGC GGAGCTGGCC
CCGGGCCGCA TCTACCAGGG CGGCAATATC CTGGATTTCG AACGCATGTC CGGCGCCATG
GGCGCGCAGG TGCTGTACAT CGGCGACCAT ATCTATGGCG ACATCCTGCG GCTCAAGAAG
TCGCACGTGT GGCGCACGGC CATGGTGCTC CAGGAGCTCG AGGACGAATA TTCGGTGGGC
GCGCGCGCCG AACAGCGCAT CCGCGACCTC ACCGTGCTCG ACCGTCGCCG GCGCAACATC
GAGTCGGAGA TCGACTTCCA GATGCTGGTG CTCAAGCAGC TCCAGAATCT GCTCGAGGAG
GTCGGTGCCG AAGCCGATGA GGCGCTGCGC CACGAGGCCG CCGAAGCCGT GCGCCAGGCC
GAGGAGAGTC TGGCGTCGCT ACAACTGCGC GCCCGCCTCA TGCAGGAAGA GGTCGACGCG
CTCGAGGACA GCATCGACCA TATGTACAAT CCGTACTGGG GCAGCAGCTT CCGCGCCGGC
CACGAGAGCA GCCGCTTCGG CGAGCAGGTG TCGGATTACG CCGATCTGTA CACCAGCCGG
GTGTCGAACT TCCTCGCCTA TTCGCCGCTG CGCTACTTCC GCGCGCCGCG GCGGCTGATG
CCGCACGATC TGTAG
 
Protein sequence
MSGESSDARA RLTALLADIR SESGAVTRGR VYCNRSIKLS SIDYIGFDMD YTLARYQQER 
LEKLSIDLTL KYLVQNHGYP DAIRELRYDP RFAIRGLVVD RALGNVLKMD RHGYVRRAYH
GFRLLDKEER REHYLSKRIN LSDKHYVWID TLFSLPEAVM FVTLVDYFDN RQDEVDYAAL
YDDIRASIDL AHRDDSLKSI IKADLAAYIV ADPALSETLH KLRSSGKKLF LLTNSYYDYT
RAVMSYLLDG ISAAYPSWRE YFDIVIVGGE KPGFFTGQKP FVPIDPDSGE VIPGEVAELA
PGRIYQGGNI LDFERMSGAM GAQVLYIGDH IYGDILRLKK SHVWRTAMVL QELEDEYSVG
ARAEQRIRDL TVLDRRRRNI ESEIDFQMLV LKQLQNLLEE VGAEADEALR HEAAEAVRQA
EESLASLQLR ARLMQEEVDA LEDSIDHMYN PYWGSSFRAG HESSRFGEQV SDYADLYTSR
VSNFLAYSPL RYFRAPRRLM PHDL