Gene Hoch_0260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0260 
Symbol 
ID8542639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp385377 
End bp386756 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content68% 
IMG OID646385056 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003264794 
Protein GI262193585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.755813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAA ACTCGACCCC CTCCGCAGCG GCCTGGAGCC CGACTTCTTG GCAGGAAAAG 
ACTGCCACCC AACAAGCCAC CTACCCCGAC CAGGAAGCGC TCGACCGCGT GGTCGCCCAG
ATCGCGCGCT TGCCGCCGCT GGTGACCTCG TGGGAGATCG AGGCGCTCAA AGAGAAGCTC
GGTCGCGCCG CCCGCGGTGA GGCCTTCTTG CTGCAAGGCG GCGACTGCGC CGAATCCTTC
GACAACTGCG ACTCGCAGAC CATCGCCGGT AAACTCAAGG TCCTGCTGCA GATGAGCCTG
GTGCTCACGC ACGGGATCCG GCGCCCCATC ATCCGCGTGG GCCGCATCGC CGGGCAGTAC
GCCAAGCCGC GCTCGGCCGA TACCGAGAGC CGCGGCGAGG TCACCCTGCC GAGCTACCGC
GGCGACCTCG TCAACCGCCT GGCCTTCACC GACGAAGATC GCACGCCCAA CCCCGATCTG
ATGCTGCGCG GCTACGAGCG CGCCGCGCTC ACGCTCAACT TCATCCGCGC GCTGGCCGAC
GGCGGCTTCG CCGACCTGCA CCACCCGGAA TACTGGGACC TGTCCTTCGC CCAGCACCAC
GACACCGAGG GTCACTACGA GCGCATCGTC GCCTCGATTC GCGACGCCAT CGCCTTCATG
GAGTCGGTGG GCGAGATGCG GCTCACCGGC CTGGGCCGCG TGGACTTCTA CACCAGCCAC
GAGGGCCTGA TGCTGCACTA CGAACAGGCC CAGACCCGGC GCGTCCCCCG CCGTGACGGC
TGGTACAACC TGTCGACGCA TATGCCCTGG ATCGGCATGC GCACGGCCAC CATGGATAGC
GCCCACATCG AGTACTTCCG CGGCATCCGC AACCCGGTGG CCGTGAAGCT GGGGCCCGCG
GTCACGCCGG AGTGGATCAC CCAGCTCCTC GACGTGCTGC ATCCCGACGA CGAGCCCGGC
CGGCTCACCT TCATCCATCG CCTGGGCGCT GAGAAGGTCT CCGACCTCCT GCCCAGAATG
ATCGATACCG TCCAGCGCAG CGGCAAGACC GTGCTGTGGA CCGTGGACCC CATGCACGGG
AACACCGAGT CCACCGAGCG CGGCGTCAAG ACCCGCCACT TCGACAAGAT CCTGGCCGAG
GTCGAAGCCT CGTTCGAGAT CCACGAGAGC ATGGGCAGCA CGCTCGGCGG CGTCCACCTC
GAGCTCACCG GCGACAACGT CACCGAGTGC GTGGGTGGCG CCCGCGGTCT CAGCGAGGCC
GACCTCGAAC GCGCCTACCG CAGCACCGTG GACCCGCGCC TCAACGCCGA GCAAGCGCTC
GAGCTGGCGC TGCGCGTGAC CCAACACCTT CATACATCGA ACCAATCGTC CCGCCGCTGA
 
Protein sequence
MKPNSTPSAA AWSPTSWQEK TATQQATYPD QEALDRVVAQ IARLPPLVTS WEIEALKEKL 
GRAARGEAFL LQGGDCAESF DNCDSQTIAG KLKVLLQMSL VLTHGIRRPI IRVGRIAGQY
AKPRSADTES RGEVTLPSYR GDLVNRLAFT DEDRTPNPDL MLRGYERAAL TLNFIRALAD
GGFADLHHPE YWDLSFAQHH DTEGHYERIV ASIRDAIAFM ESVGEMRLTG LGRVDFYTSH
EGLMLHYEQA QTRRVPRRDG WYNLSTHMPW IGMRTATMDS AHIEYFRGIR NPVAVKLGPA
VTPEWITQLL DVLHPDDEPG RLTFIHRLGA EKVSDLLPRM IDTVQRSGKT VLWTVDPMHG
NTESTERGVK TRHFDKILAE VEASFEIHES MGSTLGGVHL ELTGDNVTEC VGGARGLSEA
DLERAYRSTV DPRLNAEQAL ELALRVTQHL HTSNQSSRR