Gene Hoch_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2018 
Symbol 
ID8544400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2786791 
End bp2788068 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID646386721 
Productisochorismate synthase 
Protein accessionYP_003266456 
Protein GI262195247 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.577572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACA TGAACAACGT GTTTCCGAGC CAAAGCGCGC AGCCTGATGC CGCGCTGGCG 
ATGCCGCATG CGCTGACGCC TCCGACCCGG TCCGCGGCCG AGTTGTTGGC CGCCTACGAG
ACCTCGTCGG CATTCTTCTT CGCCTCGCCC ACGCGCACCC TGCACGCGCC CGATGTCTAC
GCCGTGATGG TGCCGCAGGC GCCCACGGCC GTGCGCAGTT TGGCCGACCG GGTGGACGAT
CTCCTGGCGG AGTCGCAGCG CTTTGGCCGC GAGAGCGCGA TGGTGGTCGG CGCGGTGCCC
TTCGACCCGC AGGCGCCCGC GCACCTGGTG GTGCCGATGA CGCTCGACAG CGCCGGGCCG
TTGAGGCTCG ACCAGGCCGC GCCCACCCAG CGCGCGCTGA CCGCGGGCTT TCGCATGCAC
CCCTGTCCCG AGCCCGAGGG CTACGCCCGC GGCGTGGCGC AGGCGCTCGG CCTGCTCGCG
CGCGAAGAAC TGCGCAAGGT GGTACTCGCG CGCATGCTCG AGCTCGAGCT GTCGCGGCCG
CTGGACGTGC CCGCGCTGCT GCGGCGCCTG GCCGGGCAGA ACCCGGGCGG CTATGTGTTT
GCGCTCGACC TGCCGGCGCT CGAGGACGCG CCCGCGGGCA CGCGCCGCAG CCTGGTTGGC
GCCAGCCCCG AGCTGCTGGT GTCGCGCAGC GGCAACACCG TGGTCGCCAA TCCGCTGGCG
GGTTCGGCTG CGCGCAGCCT CGATCCCGAG GAAGACCAGC GCCGCGCCAA GGCGCTGCTG
GCGTCCGAAA AAGACCGCGC CGAGCACGCC TTCGTGATCG AGGACGTGGT CGCGCGCCTG
CGCCCCTTCT GCCGTTTCAT CGAGGTGCCG GCCGCGCCCT CGCTGATCCA CACCCAGACC
ATGTGGCATC TCTCGACCCG CATCACGGCC GAGCTGCGCG AGCCCGCCGC GTCCTCGCTG
CGGCTGGCCT CGGCGCTGCA TCCCACGCCG GCCGTGTGCG GCTCGCCCAC CGAGCTCGCG
CGCTCCACCA TCGCGGCCAT CGAGCCCTTC GAGCGCGGCT TCTTCACCGG CATGGTCGGC
TGGTGCGACG CCAGCGGCGA CGGCGAGTGG GTGGTGACCA TCCGCTGCGC CGACGTTACC
GACCAGGCGG TCAAACTCTA TGCCGGTGCT GGAATCGTCG CCGCCTCCGA GCCCGAGCGC
GAGCTCGCCG AGACCTCGGC CAAGCTGCGC ACGATGCTCA ACGCGCTCGG TATCGACGAC
ATTCCCGAGG TGATGTAA
 
Protein sequence
MDNMNNVFPS QSAQPDAALA MPHALTPPTR SAAELLAAYE TSSAFFFASP TRTLHAPDVY 
AVMVPQAPTA VRSLADRVDD LLAESQRFGR ESAMVVGAVP FDPQAPAHLV VPMTLDSAGP
LRLDQAAPTQ RALTAGFRMH PCPEPEGYAR GVAQALGLLA REELRKVVLA RMLELELSRP
LDVPALLRRL AGQNPGGYVF ALDLPALEDA PAGTRRSLVG ASPELLVSRS GNTVVANPLA
GSAARSLDPE EDQRRAKALL ASEKDRAEHA FVIEDVVARL RPFCRFIEVP AAPSLIHTQT
MWHLSTRITA ELREPAASSL RLASALHPTP AVCGSPTELA RSTIAAIEPF ERGFFTGMVG
WCDASGDGEW VVTIRCADVT DQAVKLYAGA GIVAASEPER ELAETSAKLR TMLNALGIDD
IPEVM