Gene Hoch_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0551 
Symbol 
ID8542931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp743607 
End bp744740 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content69% 
IMG OID646385345 
ProductPeptidoglycan-binding lysin domain protein 
Protein accessionYP_003265082 
Protein GI262193873 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1388] FOG: LysM repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.161209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA TACGAGCAAG CAGCGGTCAT CCTCCTCTCG TCCCCTCGAC GGAGTCGAAC 
GCCCCGCCGC GACGCTCGGC GCCCGCGCAG CGCCGCACGC TCCGCATCGG CTGGCTGCGC
GCGACCCTGG TGGCCGCGTG TGCGCTGCTG GCCTCGAGCA GCGACGCCCG CGCCGAGGGC
AAAGGCCGAG CGCACGTGGT CGAGGCCGGA CAGACGCTGT GGGAGATCGC CCAGTCCTAC
GGCTGCAAGG TCGATGAGCT GCAGGCGGTC AACGAGATCG ATGGTCTGCT GATCCAGCCG
GGCCAGCGCC TGCGGGTGCC GCGGTGCAAG CAACAGGGCG GCAGCAGCAG CAAGGGGCTG
CACGTGCTCA GCCACGAGGT GAGCAGCGGG GACACGCTGT ACGAAATCGC CCGCCGCTAC
GACACCAGCC TCGATGACCT GCGTCGCCGC AACGACATCA AGGGCAACGT CATCCACCCC
GGCCAGACCC TGCGCGTGGC CGTGGGCAAG GACGGCGAGG GTCGCCCGGT GGCCGGCCAG
AGCGTGGGCT CGGTGGTCCG CGGTCGGCTG GTCAACGGCA TGCAACTGCC CGAGGGCCGC
GGCTACTACC GCCGCCGTCC GCATCGCGCC TGGGGCGCCA GTCACACGAT TCATCACATC
CGGCGCGCGA TCGCGTCGGT GCGCAATCGC CTGCCCAAGG TCCACGAGGT CGCCATCGGC
GACATCTCGA CGCACGATGG CGGCCAGCTC GCCGACCACC GCTCGCACCA GTCGGGGCGC
GACGTCGATA TCGGCCTGTA CTACAAGAAG CGGCCGCGCG GGTATCCCAA GAGCTTCATC
CGCGGCGACC AGCAGAACCT CGACCTGCAG GCCACCTGGG CGCTCATCGA GGCGCTGGCC
AACACATCCA AGGTCGCGGG CGGGGTCGAC ATCATGTTCC TCGACTACGA CCTGCAGGGA
CATCTGTACA AGTGGGCGAA AAAGCACGGC GTGTCCAAGC GCAAGCTGGG CGAGATCCTG
CAGTACCCGC GCGGCGAGTC GTCGGCCAGC GGTCTGGTGC GGCACGAGCC CGGACACGCC
ACCCACGTGC ACGTCCGCTT CAAGTGCCCC AAGAGCGACG AGAAGTGCTG GTGA
 
Protein sequence
MSTIRASSGH PPLVPSTESN APPRRSAPAQ RRTLRIGWLR ATLVAACALL ASSSDARAEG 
KGRAHVVEAG QTLWEIAQSY GCKVDELQAV NEIDGLLIQP GQRLRVPRCK QQGGSSSKGL
HVLSHEVSSG DTLYEIARRY DTSLDDLRRR NDIKGNVIHP GQTLRVAVGK DGEGRPVAGQ
SVGSVVRGRL VNGMQLPEGR GYYRRRPHRA WGASHTIHHI RRAIASVRNR LPKVHEVAIG
DISTHDGGQL ADHRSHQSGR DVDIGLYYKK RPRGYPKSFI RGDQQNLDLQ ATWALIEALA
NTSKVAGGVD IMFLDYDLQG HLYKWAKKHG VSKRKLGEIL QYPRGESSAS GLVRHEPGHA
THVHVRFKCP KSDEKCW