Gene Hoch_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0501 
Symbol 
ID8542881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp681488 
End bp682777 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content70% 
IMG OID646385296 
ProductPeptidoglycan-binding lysin domain protein 
Protein accessionYP_003265033 
Protein GI262193824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGA TTTCCACCTT AGTCATCGCG GCGCCGCTGC TGCTCGTCGG AGCGCGCGGC 
GCCTTCGCCC AGGATGTGCC CCAGACCCTC GAGCCCGGCG TGACCGCGCC GGCTCGGTCC
GCGCCCGCAC CGGCGCCCGC GCCGGCGCCC GCGCCGCAGC CGCAGACCGT CGTCATCCCC
GTGCCTGAAG GCACCACGCT GCCGCCGCCG GCCGAGGATG AGAGCGGGTT CTTCTACCTG
CCCGACGACT TCGTCACCGA TCCTCAGCAG GAGGGCTTTC AGCCCGGCCC CGCGCCCGAG
GTTCACGAAG TGCGCAGCGG CGACACGCTC TGGGATATCT GCTTTTTGTA CTTCAACAAT
CCCTGGGATT GGCCGCGGAT CTGGTCGTAC AACCCCGAGA TCACCAATCC CCACTGGATT
TATCCCGGCG ATCTCGTGCG CCTCTACGCC GAGGGCGAAG GCCCCCGGGT CAACGACCTC
ACGCCGCTCG ACAACCTGCC CGTGGATCCC GAGGGCGACT TCGAGGACCC CGAGGCGCCG
CTGATCGTCG GCAATCAGAA CCCGCGGCCC TCGCGCGACG AGGGTGTGCG CCTGCGCCAG
CTCACCTTCG TGGGCCAGGA GGTGCGCGAG AACTCGTTCA CCATCGTCGG CGCCATCGAG
GAGCGCAGCC TGCTCTCGGC CGGTGACTCG GTGTATCTCG AATACCCCGA GGACCGCCCG
CCCGAGCAGG GCAAGCGCTA CGCCATCTAC ACCGAGACCG TGCCGGTGCG GCATCCCGAC
AACAAGAGCG CGGTCACCGA CGTCGGCAGC TACGCGCGCA TTCTCGGCGA GCTGCAGGTG
GTGAGCGTGC GCGAGGGCAA ACGCGCGCGC GCCTTCATCA CCGACTCCTT CGACGTCATC
GAGCGCGGCG ACAAGGTCGG CCCGCTGCGC AGCACCTTCC GCACGGTCGA GCCCCAGCCC
AACGAGGTCG AGCTGCAGGG CACCCTCGTC GCCCTGGTCG GCGGCGAGCA GCTCATCGGC
GAGAACCAGG TGGTGTTTCT CGACCGCGGG TCCGAAGACG GCCTGCGCGT CGGCAATCGC
CTCTACGTCG TGCGCCGCGG CGACGCCCGC GGCACCACGA CCTCGTACTT CGAGGGCATC
GGCCAGAACG ACCAGCGCTT CCCGGCGCGC GCCATCGGCG AGGTCATGGT GGTCGAGACC
GGCAAGAAGG TGGCCACCGC CCTGGTCACC CTGTCGCTGC AGGAATTCGG CGTCGGCGAC
CGCGTGCTGA TGCGCAAGAC CGCGCCCTGA
 
Protein sequence
MGRISTLVIA APLLLVGARG AFAQDVPQTL EPGVTAPARS APAPAPAPAP APQPQTVVIP 
VPEGTTLPPP AEDESGFFYL PDDFVTDPQQ EGFQPGPAPE VHEVRSGDTL WDICFLYFNN
PWDWPRIWSY NPEITNPHWI YPGDLVRLYA EGEGPRVNDL TPLDNLPVDP EGDFEDPEAP
LIVGNQNPRP SRDEGVRLRQ LTFVGQEVRE NSFTIVGAIE ERSLLSAGDS VYLEYPEDRP
PEQGKRYAIY TETVPVRHPD NKSAVTDVGS YARILGELQV VSVREGKRAR AFITDSFDVI
ERGDKVGPLR STFRTVEPQP NEVELQGTLV ALVGGEQLIG ENQVVFLDRG SEDGLRVGNR
LYVVRRGDAR GTTTSYFEGI GQNDQRFPAR AIGEVMVVET GKKVATALVT LSLQEFGVGD
RVLMRKTAP