Gene Hoch_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0253 
Symbol 
ID8542632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp375938 
End bp377593 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content74% 
IMG OID646385049 
Product3-dehydroquinate synthase 
Protein accessionYP_003264787 
Protein GI262193578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCC ACGTCTTCCT GACCGGCTTC ATGGCCACCG GCAAGAGCAC GGTTGGCCGC 
CAGCTCGCCG CCCGCCTGCG GCGGCCCTTT CTCGATCTCG ACGACGCGGT CGAAGCCGAG
GCCGGCCACA CCGTGGCCGA TATCTTCGCC AGCGAGGGCG AGATCGGCTT TCGTCGCCGC
GAGCGCGCCG CCCTGCAGCG CATCGCCGAC GGACCCGCGG CCGTCATCGC CACCGGCGGG
GGCGCTGCCT GTCACGGCGA CAACCTGGCG CAGATGCGCC GCAGCGGCTT GACCATCGCG
CTCACGGCGC CGCTGGCGAC CGCCCGCGCG CGCGCCGACG CCGGCGAGCG CGAGCGGCCG
CTCCTGCGCG CCACCGAGGC CGAGCTCGAG GCCCTGTACC GCTCCCGCGA GCCGATGTAC
CGCCAGGCCC ACGCCTGCGT GCGCACCGAG GACAGCGAGC CCGCGCTGTT GGCCCGCGAG
ATCGCGGCCC TGGTAGCGCG CGCCGAAACC CTGCCCGACG ACGCCCAGGA GCAGGCGAGC
TGGGTGGCGC TGCGCGAGGG CGCGTATCCC GTGGTCGTCG CCGAGGGCGG CAGCGATCGC
GTGGGCACCT GGCTGCGCAG CGTGCTCGGC GAGCGGCGAC CGAGCCGAGT TGCCGTGGTC
TCGGACGACA ACGTGGCCCC GCTGCACGGC GAGCGCGTGC GCCGGGCGAT CGACGGCGCC
GACCTGTGCG AGAGGCCGTG CTCGCTGCAC ACCGTGGCAG CCGGCGAGCG CTCCAAGCGC
TTCGAGGTAT TGGGCCGACT GGTCGACGAG CTGGTCGCCG AGGGCCTCGA CCGCAGCTCG
CTGGTGGTCG CCCTGGGCGG CGGCGTGGTC GGCGATCTGG CCGGCTTCAC GGCCGCGTGT
CTGTATCGCG GCGTGCCCGT GGTGCAGGTG CCGAGTACGC TGCTGGCCAT GACCGACGCC
GCCATCGGCG GCAAGACCGG CATCGACATC GCGGCCGGCA AGAATCTGGT CGGCGCCTTC
TGGCAGCCGC GCATGGTGGT CGTCGATCCC GCGCTGCTGG CGACCCTGCC CGCGCGCGAG
CTGCGCGCCG CCTTCGGCGA GCTGATCAAG TACGGCCTGC TCGACGGCGA GGAGCTGTAC
GCGCGCATCG AGGCGCTCGC CGACGCGCTG GCGGCCGCCG GGGACGAGCC CGGCGCGGCG
CTGTCGCCCG CGTTCACCGA GATCATCCGC CGCTGCGCCG CGATCAAATG CTGGATCGTC
ACCCGCGATC AGCGCGAGCA GACCGGCGAG CGCGCGCTGC TCAACCTCGG CCACACCGTG
GGCCACGCCA TCGAGGCCGC CTGCGCCTAC GAGGGCATGC TGCACGGCGA GGCCGTCGCG
CTCGGGCTGG TGGCCGCGTG CCGGGTCTCG GCGCGGCTCG GACAGTGCGC GGACGGGCTC
GAGGAGCGGG TGCGCGCGAC CGTCGAGCGC GCCGGCCTCG ACGCTGATCT CGACCGCTGG
CTGCGCGGCG ACGACGCCGA GCGGGTGTTG GGATTTCTGG CCACAGACAA GAAGCGCGCT
GGCAAGCGCA TCGGTTTCGT CACCATCGGC GCGATGGGCG ATTGCGGCAT CACGCCCATC
GAACTCGCAG AACTGGTGAG AATTTTGCGC CCCTAG
 
Protein sequence
MPRHVFLTGF MATGKSTVGR QLAARLRRPF LDLDDAVEAE AGHTVADIFA SEGEIGFRRR 
ERAALQRIAD GPAAVIATGG GAACHGDNLA QMRRSGLTIA LTAPLATARA RADAGERERP
LLRATEAELE ALYRSREPMY RQAHACVRTE DSEPALLARE IAALVARAET LPDDAQEQAS
WVALREGAYP VVVAEGGSDR VGTWLRSVLG ERRPSRVAVV SDDNVAPLHG ERVRRAIDGA
DLCERPCSLH TVAAGERSKR FEVLGRLVDE LVAEGLDRSS LVVALGGGVV GDLAGFTAAC
LYRGVPVVQV PSTLLAMTDA AIGGKTGIDI AAGKNLVGAF WQPRMVVVDP ALLATLPARE
LRAAFGELIK YGLLDGEELY ARIEALADAL AAAGDEPGAA LSPAFTEIIR RCAAIKCWIV
TRDQREQTGE RALLNLGHTV GHAIEAACAY EGMLHGEAVA LGLVAACRVS ARLGQCADGL
EERVRATVER AGLDADLDRW LRGDDAERVL GFLATDKKRA GKRIGFVTIG AMGDCGITPI
ELAELVRILR P