Gene Hoch_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1436 
Symbol 
ID8543818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1946605 
End bp1948863 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content73% 
IMG OID646386148 
Producthypothetical protein 
Protein accessionYP_003265883 
Protein GI262194674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAAC CCCGAACGGC TCCGGCCATC GAATCCTCTG CCGAGCGCGC GCCGCGCGGA 
CGCCGCATCC TGTGGCGCAC GACCCAGGTG GTGCTGGGCT TGTTGGCCGG GCTCGCGCTG
GCCGAGCTCG GGTTCTGGTG GCGCGACCAA GGTGCGTTTC CCCACGTCAA CGTGTACCTG
CCCGACGCCG AGCTGGGCGC GCGCCTCGAG CCCGGCGCCG AGCAGGGCTT CAAGCTGCGC
GACAACCCGC TCACCCACAT CCGCATCAAC GCGGACGGGT ATCGCGGCGC CGAGCTGCCG
CCGCCGGCCG AGGACGAGAT CCTGGTGGTC GGCGATTCGC AGGTCTTCGG CCTGGGCGTC
GAACAAGACG AGACCTTCAG CGCGCAGCTC GCCAAGCTCA GCGGACGGCC GGTGGTCAAC
GGCGGCGTGC CCACCTACGG GCCCGGCGAG TACACTGCCG TGGCCCGCGA GATGCTCGAG
AAGCGCTCGC CGAGCACCGT GGTGTACGTG GTCAACATGG CCAACGATCT GTTCGAGACC
AAGCGCCCCA ACCGCGAGCG CCACGCGATC TGGGACGGCT GGGCGGTGCG CATCGAGACC
GCGCCCGCGG ACACCGTCGA GTTTCCCGGA CGCCGCTGGC TGATGAGCCG CTCACACGCT
GTCTACGCGC TGCGGCGCTG GAATCACAGC GCCGATCCGA CGGTCGACCT CGGCTTTGCC
TCCGAGGGCA CGTGGAACGA CCTGGTCGAT TGGGGCGCGC AGGCCGGCGA GCTGCACGCC
GACGCGCGCG CCGAGGCCGA CAAAGCACGC AGCGAGCGCA GCGACAAGCT GCGTGCGCTC
GAGGCCGACA TCGACGCGGC CGAGGGCGAG GTCGAGCGGC TGCTGGTGCT GAGCAATCCC
GACGCCGAGT ACGGGGAGGA CAACCTGCGG CTGCAGGCGG CGCGGGCGTC GCCCGGCGAC
ATCGTGATCG ACGATCTCGC CGAAGAGGGG CGCTCGGTGG TGGTCACCGC CGGGCTGCTC
CAGGCCGGCG TGCTGTACCG CCATCAGCTC CTGCGGCGCG CGGCCCGGGG GCCGCAGAAT
CAGCACACGC GCGACCTGCT GAGCACCGCG GCGAACCGCG ACGAGCTGTT GCAGCAGCGC
CTGGCCGTGC ACTCGCAGAC CGCGGCCGAG ACCCGGGTGC CCTCGGTGCT GGAGCCGCAG
CTCCGCGAGC TCGAGGCGCT GTGCGAGCAG CACGGCGCCG AGCTGGTGGT GGTGGCGCTG
CCCATCGACG TGCAGGTGTC GGCGGACGAG TGGGCGAAAT ACGGCGTCGA TGAGCCGCTG
GACATGGAGC CGACCCGCGT GCTGCTCGCC GACCTGGTCG CCAGCGCCGA GGGTATGGGC
GTACGCGCGC TCGATGTCAC CGCGCCGCTG GCCGAGGTCG CGGCCCGCCA GCCGGCCTTT
CTCGACGGCG ACATTCACTT GACCCCGGCC GGTCACCGCG CGGTGGCCGA GGCCCTGGCC
GCCAAGCTGA GCGAGCCGGC GCCGCTGCCG CAGCCCGAGC CCGGCTTGCC CGAGGGGCGG
ACGCGGGTGC CGCCGCCGGC CGCGTGGCGC GGCATCCTCG AGGCCACGGT GCGCGGCTCG
AGCGCGCTGC GCTGTCAGAC CTATATGGTC GCCGAGTGGC TGCGCGTGTC GTGTCTGCGC
GAGGGTCGGC GACACGTGCC CTCGGGCATC GCGGTCGAGA GCGGCGGCCA CGGCGAGGCC
ATGACCCTGG TGACGGGCGA GGCCGCGACG CTGGTTGCGC CGCTGCTGCG CGGCGACGAG
CTGGTGGCGA GCTTCCGCTG GAGCGATCGC GCGCGCACCC TGGTGGCGCG CTGGCCCGAG
GACGCCGAGC GGCCGCGCAT GTGGTTCGAG GATCGCGGCC AGGAGGGCGC GCCCTACCAG
GAGGACGAGG CCGCGACGAT GCTGTGCGAC TGCTACAAAG AGCTGTACAG CGAGCGGGAT
TGCGCGGTCG ACGAGTACGG CTATCCCAAC ACCTCGCAGT GCGAGCCCAT CTGCGTGGGC
GCCTACGGCG AGATTTCGGA CGCCTGTCTG GCCGCGTACG AGGTCGATTG CGCCAAGCTA
GAGGCCTGCG CGCGCGGCGA ACTCGAGGCC CAGCCGCCGT GTCCGGCCGG CGAGGTCAAC
CTGGCCACGA CCGGGCAGTG CGTGGCCCTG TGCAGCGACG AGCGACCGTG CGCCGAGGGC
ACCTGCACGC CGTATCGCGG CGCCCAGGTG TGCCGCTAG
 
Protein sequence
MDQPRTAPAI ESSAERAPRG RRILWRTTQV VLGLLAGLAL AELGFWWRDQ GAFPHVNVYL 
PDAELGARLE PGAEQGFKLR DNPLTHIRIN ADGYRGAELP PPAEDEILVV GDSQVFGLGV
EQDETFSAQL AKLSGRPVVN GGVPTYGPGE YTAVAREMLE KRSPSTVVYV VNMANDLFET
KRPNRERHAI WDGWAVRIET APADTVEFPG RRWLMSRSHA VYALRRWNHS ADPTVDLGFA
SEGTWNDLVD WGAQAGELHA DARAEADKAR SERSDKLRAL EADIDAAEGE VERLLVLSNP
DAEYGEDNLR LQAARASPGD IVIDDLAEEG RSVVVTAGLL QAGVLYRHQL LRRAARGPQN
QHTRDLLSTA ANRDELLQQR LAVHSQTAAE TRVPSVLEPQ LRELEALCEQ HGAELVVVAL
PIDVQVSADE WAKYGVDEPL DMEPTRVLLA DLVASAEGMG VRALDVTAPL AEVAARQPAF
LDGDIHLTPA GHRAVAEALA AKLSEPAPLP QPEPGLPEGR TRVPPPAAWR GILEATVRGS
SALRCQTYMV AEWLRVSCLR EGRRHVPSGI AVESGGHGEA MTLVTGEAAT LVAPLLRGDE
LVASFRWSDR ARTLVARWPE DAERPRMWFE DRGQEGAPYQ EDEAATMLCD CYKELYSERD
CAVDEYGYPN TSQCEPICVG AYGEISDACL AAYEVDCAKL EACARGELEA QPPCPAGEVN
LATTGQCVAL CSDERPCAEG TCTPYRGAQV CR