Gene Hoch_6437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6437 
Symbol 
ID8548852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8831842 
End bp8833473 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content68% 
IMG OID646391098 
Productmalate synthase A 
Protein accessionYP_003270799 
Protein GI262199590 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.588827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAA CCCCTCAAAC CGAAGCCAGC AGCGGCCTGC AGGTCACCGG CTCGCTCAGC 
GATCGCTACG CCGAGCTCCT CACGCCCGAG GCGCTGGGCT TCGTCGAGTC GCTGGCCCGC
GCCTTCGAGG AGCGGCGCCG CGCACTGCTG GCCAGTCGCC GCGAGCGCCA GGCACGCCTG
GACGCCGGCG AGCGACCGAG CTTCGTCACC GAACTGGAGT CCGCGCCTGA CGCCGAGACC
CGCGCACAGT TTTCGTCCGC GTGGACCGTG GCGCCCATCC GGCCCGACTT GCTCGACCGC
CGCGTCGAGA TCACCGGGCC GACCGATCGC AAGATGGTCA TCAACGCGCT CAACTCGGGC
GCCAACGTGT TCATGGCCGA CCTCGAGGAC TCGTCGTCGC CGACCTGGGA CAACGCGATC
CAGGGCCAGA TCAACCTCTT CGATGCGGTC CGCCGCACCA TCAGCTTCAC CAACCCGGCC
GGCAAGCAGT ACGCGCTCAA CGAGTCCATC GCCACGCTGA TGGTGCGCCC GCGCGGCTGG
CACCTCGACG AGAAGCACCT GCTGCTCGAC GGCCAGCCGA TGTCGGCCTC GCTCGTGGAC
TTCGGCCTGT ACTTCTTCCA CAACGCGCGC GAGCTCATCG ATCGCGGCAC TGGGCCCTAC
TTCTACCTGC CCAAGCTCGA GAGCCATCTC GAGGCCCGGC TGTGGAACGA CGTGTTCGTG
CTCGCGCAGG ATCGCCTCGG CATCCCGCAC GGCACCATCC GCGCCACCGT GCTCATCGAG
ACCATCCTGG CCGCGTTCGA GATGGACGCC ATCCTGTTCG AGCTGCGCCA GCACTCGGCC
GGGCTCAATT GCGGACGCTG GGACTACATC TTCAGCATCA TCAAGAAGTT CCGCGCCGAC
CCCGGCTTCG TCATGCCCGA CCGCGCGCAG GTCACCATGA CCTCGCACGC CATGCACTCG
TACTCGCTGC TGGCCATCCA GACCTGCCAC CGCCGCGGCG CCCACGCCAT CGGCGGCATG
GCGGCACAGA TTCCGGTCAA AGGCGACCCC GAGGCCAACG AGCGCGCGCT CGGCAAAGTG
CGCGACGATA AAGAGCGCGA GGCGCGCGAC GGCCACGACG GCACCTGGGT CGCGCACCCG
GGCCTGGTCG AGCTGGCGCG CAAAGCCTTC GACGCGCACA TGGAGGGTCC GCACCAGATC
CAGCGCAAGC GCGAGGACGT CACCATCACG GCGGACGACC TGCTCACCGT GCCCGAGGGC
TCGATCACCG AGAAGGGCCT GCGCCACAAC CTCGACGTCG GCATTCGCTA CCTCGGCGCC
TGGCTGGCCG GCACCGGCTG CGTGCCCATC TACAACCTGA TGGAGGACGC GGCCACGGCC
GAGATCTCGC GCGCCCAGGT GTGGCAGTGG GTGCACCACG ACAGCGCCAA GCTCTCGGAC
GGTCGTCCCA TCACGGCCGA GATGGTCCGC GCCTGGACCC GCGAAGAGCT CGAGCGCATC
CGCGCCGACC TCGGCGACAC CGCCTTCGCG CAGGCCCGCT ACGAACAGGC CGCGGCCATG
ATCGAAGACA TCGCCACCCG CCCGCAACTC GACGATTTCA TGACTTCGGT CGCCTACGAC
CAGCTCGACT AA
 
Protein sequence
MSSTPQTEAS SGLQVTGSLS DRYAELLTPE ALGFVESLAR AFEERRRALL ASRRERQARL 
DAGERPSFVT ELESAPDAET RAQFSSAWTV APIRPDLLDR RVEITGPTDR KMVINALNSG
ANVFMADLED SSSPTWDNAI QGQINLFDAV RRTISFTNPA GKQYALNESI ATLMVRPRGW
HLDEKHLLLD GQPMSASLVD FGLYFFHNAR ELIDRGTGPY FYLPKLESHL EARLWNDVFV
LAQDRLGIPH GTIRATVLIE TILAAFEMDA ILFELRQHSA GLNCGRWDYI FSIIKKFRAD
PGFVMPDRAQ VTMTSHAMHS YSLLAIQTCH RRGAHAIGGM AAQIPVKGDP EANERALGKV
RDDKEREARD GHDGTWVAHP GLVELARKAF DAHMEGPHQI QRKREDVTIT ADDLLTVPEG
SITEKGLRHN LDVGIRYLGA WLAGTGCVPI YNLMEDAATA EISRAQVWQW VHHDSAKLSD
GRPITAEMVR AWTREELERI RADLGDTAFA QARYEQAAAM IEDIATRPQL DDFMTSVAYD
QLD