Gene Hoch_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4014 
Symbol 
ID8546410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5507604 
End bp5509646 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content71% 
IMG OID646388686 
Productoxidoreductase domain protein 
Protein accessionYP_003268406 
Protein GI262197197 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.876961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTG ACGGCACCAA ATTCAAGGTC GGCCTCGTGG GGGCCGGCGG CATCAGCGAG 
TTTCACATTT ACGCGCTCCG AGCGCTGCCC CAGTGCGAGC TGCTCGGCAT CCACGACATC
GATGAGGCGC GCGCGCAGGC CACCGGCCAG CGCTTCGACG TGCCGACCTT TCCCTCGCTC
GAGGCGATGC GCGAGGCCGG CGCCAACGTC ATCCACGTGC TGACGCCGCC GCACGTGCAC
GCGCCGGTCG CCATCAAGGC GATGGAGCTG GGCTGCGACG TGTTCATCGA GAAGCCGCTG
GCCGAGGACG TGGACGAGGC CCGCAAGGTG CTCGAGGTCG CCGAGCGCAC CGGCCGCGTG
GCCTCGGTCA ACCACTCGCT GCTCTACGAC CCGCAGGTCA AGCGCGCGCT CGAGCTGGCC
CGCAGCGGCG CGCTCGGCAA GGTGGTGTCA GTCGACATCC TGCGCGGCTC GGACTATCCG
CCGTACGAAG GTGGCCCGCT GCCGCCGCAC TACCGCACCG CGGGCTACCC CTTCCGCGAC
ATCGGCGTGC ACTGCTACTA CCTGATCCAG GCCTTCCTGG GCAGCATCGA GCACGTCGAC
GCCGAGTGGG ACAGCCTGGG CGGCGATCCC AACCTGGCCT TCGATGAGTG GCGCGCGCTG
GTCAAGTGCA AGGGCGGCAT CGGCCAGTTC CAGATCACCT ACAACACCAA GCCGGCGCAG
AGCCAGCTCA TCATTCACGG CACCCGCGCG GTGCTGCGCG TGGACCTGTT CACCATGTTC
CACGCCAAGC GCGCGACCAC GCCGCTGCCC AAGGCGGCCG AGCGGCTGAT CAACGCGATG
ACCGACTCGA TTCAGCCGCT GATCGACGTG CCGGTCAACG TCGTGCGCTT CGTGTCCAAG
CAGGTGCAGC CCTACCAGGG CCTGCGCGAC CACGTAAAAG CCTTCTACGA GGCCCTGGAG
GCGGGGACGC CGCCGCCGGT GTCGCTCGAG GAGTCGATCG AGGTGCTGCA CTGGACCGAG
TCCGTGGCCC GCGCCGCCGA GGCCGAGCAC GCCGAGCGCC TGGCCCAGTA CACGCTGTCG
AAGACCGTGC CCTTCGTGGT CACCGGCGCC TCGGGCTCCC TGGGCAGCGC GGTGGTGCAG
CGCCTGCTCG ACGACGGTCA CAAGGTGCGC ATCTTCGTGC GCCGTCCGCC CGCCGAGGTG
CCCGAGGGCG TCGAGGTGGC GATCGGCAAC CTGGGCGATC CCAAGGCCGT GGATCGCGCT
ATCGCCGGCG CCGAGACGGT CATCCACGTG GGCGCGGCCA TGAAGGGCAG CGCCATCGAC
TTCGAGTGCG GCACCGTGGT CGGCACCCGC AACGTGCTCG ACGCCTGCAA GGCGCACGGG
GTCGCGAAGC TGGTGCACAT CAGCTCGATG TCGGTCGTGG ACTGGGCGGG CTCGTCCGAG
GGCCAGCCGG TGTCCGAGGC CACGCCGCTG GAGCCGCGCG CCGATGAGCG CGGCGCGTAC
ACGCAGACCA AGCTGGCCGC CGAGAAGCTG GTGGTCGAGG CCGCGCAGGC CGGCGAGGTG
CCGTCCGTGG TGCTGCGCCC GGGGCAGATC TTCGGCGGCA AGATCCCGGT GCTCACCGGC
GCGGTGGCGC GTCGCGCCGG CGGCCGCTGG CTGGTGCTGG GCGACGGCGA GCTGCTCTTG
CCGCTGATCT ACATCGACGA CGTGGTCGAC GCGGTGATGG CGGCTGCCGA CAGCGAGCTG
AGCGGCGGCG AGATCATCCA GCTCATCGAT CCCGAGCCGC TCACGCAGAA TCAGGTGCTC
GAGACCGTGG GCGGCGACGC GCCCGTCATC CGGGTGCCGC GCGCGGCCGT GTTCTTTGCC
GGACGCATGT CCGAACCCGT GTTCGGCGCG CTCAAGCGGC AGTCGCCGGT GGCCGTCTAC
CGCCTGCAGT CGGCCATGGC GCGGCTCAGC TTCGACAGCG ACCGCGCCCA GGAGCTGCTG
TCGTGGACGC CGCGGGTCGG CGTGCGCGAG GGCCTGCGCC GCGAGCTGGC GCAGCAGCGC
TAG
 
Protein sequence
MTTDGTKFKV GLVGAGGISE FHIYALRALP QCELLGIHDI DEARAQATGQ RFDVPTFPSL 
EAMREAGANV IHVLTPPHVH APVAIKAMEL GCDVFIEKPL AEDVDEARKV LEVAERTGRV
ASVNHSLLYD PQVKRALELA RSGALGKVVS VDILRGSDYP PYEGGPLPPH YRTAGYPFRD
IGVHCYYLIQ AFLGSIEHVD AEWDSLGGDP NLAFDEWRAL VKCKGGIGQF QITYNTKPAQ
SQLIIHGTRA VLRVDLFTMF HAKRATTPLP KAAERLINAM TDSIQPLIDV PVNVVRFVSK
QVQPYQGLRD HVKAFYEALE AGTPPPVSLE ESIEVLHWTE SVARAAEAEH AERLAQYTLS
KTVPFVVTGA SGSLGSAVVQ RLLDDGHKVR IFVRRPPAEV PEGVEVAIGN LGDPKAVDRA
IAGAETVIHV GAAMKGSAID FECGTVVGTR NVLDACKAHG VAKLVHISSM SVVDWAGSSE
GQPVSEATPL EPRADERGAY TQTKLAAEKL VVEAAQAGEV PSVVLRPGQI FGGKIPVLTG
AVARRAGGRW LVLGDGELLL PLIYIDDVVD AVMAAADSEL SGGEIIQLID PEPLTQNQVL
ETVGGDAPVI RVPRAAVFFA GRMSEPVFGA LKRQSPVAVY RLQSAMARLS FDSDRAQELL
SWTPRVGVRE GLRRELAQQR