Gene Hoch_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5286 
Symbol 
ID8547698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7265708 
End bp7267087 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content69% 
IMG OID646389960 
Productcystathionine beta-synthase 
Protein accessionYP_003269664 
Protein GI262198455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATGG ATCAGATTCA CAATTCGATC TGCGACGCAG TGGGGCGGAC CCCGGTCGTA 
CGTTTGTCCC GTTTCGGACG CGATCTGCCG TGTGAGCTGC TCGCCAAGTG CGAGTTCATG
AACCCCGGCG GTTCCGTCAA AGACCGCATC GGCGTGCGCA TGATCGAGGA CGCCGAGGCC
GCGGGGCGCA TCAAGCCGGG CGACACGCTC ATCGAGCCGA CCTCGGGCAA CACCGGCATC
GGTCTGGCGA TGGCGGCCGC GGTCAAGGGC TACCGGGTGA TCATCACCAT GCCCGAGAAG
ATGAGCCAGG AGAAGCAGGT GGTGCTCGAG GCGCTGGGCG CCGAGATCAT CCGCACGCCC
ACGGAGGCGG CCTGGGATTC GCCCGAGAGC CACATCGGCG TGGCCAAGCA GCTCAAAGAG
GTGATTCCCA ACGCCCACAT CCTCGACCAG TACGCCAACC CGAGCAATCC GCTGGCGCAC
GAGGAGGGCA CCGCGCGCGA GATCCTCGAG CAGTGCGGCG GCAAGCTCGA CGTGGTGGTG
ATGACGGCCG GTACCGGCGG CACCATCTCG GGTGTGGCGC GCGCGCTCAA GGCCGCGCTG
CCGACGATCG AGATCGTCGG CGTGGACCCG GAGGGGTCGA TCCTGGCCGG TCCCGGCGAG
ATCAAGAGCT ACAAGGTCGA GGGCATCGGC TACGACTTCA TCCCCGATGT GCTCGATCGC
GGTCTGGTCG ACCGCTGGAT CAAGAGCAAC GACCGCGACT CGTTCCGCAC CGCGCGCCAG
CTCATCCGCC AGGAGGGCCT GCTGTGCGGC GGCTCGTGCG GCGCGGCGGC GTGGGCGGCG
GCCAAGGTGT GCCGCGAGCG CCAGCCCGGC GAGCGCGTGC TGGTGATCCT GCCCGACTCG
ATCCGCAACT ACCTCACCAA GTTCGCCGAC GAGCACTGGA TGCGCCAGCA CGGCTTTGCC
CAGTCCGAGT GGGAGATGGG CAGCATCGCC GATATCGTGC GCTGTCTGCC GCCGCGCGAG
GTGCTGTCGG TGAGCACCTC GGCGACCCTG GGCGATGCGC TCGAGCGCTT CCGCGACAGC
GGCGTATCGC AGATGCCGGT GCTCGACGGC GAGCAGGCGC TGGCCGGCAT CGTCACCGAG
ACCGACATGC TGCATCACCT GGTGAGCGGG CGCGCCAGCC ACGACACCTC GGTGGTCGAG
ATCATGGAGC GCCGGGTGAC GACCGTGGGC ATGCACGCGC CCGCCAGCGA GCTGCCGCGC
ATCTTCGACC GGGGCCAGGT CGCCGTGGTC ATCGACGCCG AGCGCCGGGT CGAGGCCATC
GTGACCAAGC TCGACCTCAT CGACATCCTG GCCGCGCGGC GCGCGCCGAG CGCGTCCTGA
 
Protein sequence
MSMDQIHNSI CDAVGRTPVV RLSRFGRDLP CELLAKCEFM NPGGSVKDRI GVRMIEDAEA 
AGRIKPGDTL IEPTSGNTGI GLAMAAAVKG YRVIITMPEK MSQEKQVVLE ALGAEIIRTP
TEAAWDSPES HIGVAKQLKE VIPNAHILDQ YANPSNPLAH EEGTAREILE QCGGKLDVVV
MTAGTGGTIS GVARALKAAL PTIEIVGVDP EGSILAGPGE IKSYKVEGIG YDFIPDVLDR
GLVDRWIKSN DRDSFRTARQ LIRQEGLLCG GSCGAAAWAA AKVCRERQPG ERVLVILPDS
IRNYLTKFAD EHWMRQHGFA QSEWEMGSIA DIVRCLPPRE VLSVSTSATL GDALERFRDS
GVSQMPVLDG EQALAGIVTE TDMLHHLVSG RASHDTSVVE IMERRVTTVG MHAPASELPR
IFDRGQVAVV IDAERRVEAI VTKLDLIDIL AARRAPSAS