Gene Hoch_0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0534 
Symbol 
ID8542914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp721236 
End bp722600 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content75% 
IMG OID646385328 
Productprotein of unknown function DUF472 
Protein accessionYP_003265065 
Protein GI262193856 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGC GCGCCCACCC GCCCGATGCC GTGGTCGCGC TGCTCCGCCG CTACGGGCGC 
GAGACCACCT CATTTCAGAC CCTCGAGCCC GGGCTCTCGT ACTGGCTCGA CCCCGACGGC
GACGCCTGCG TGGCCTATGC CGACACCGGC GCCGCCTGGG TCGCCGTGGG CGCGCCGCTG
GCCGAACCCG CGCGCCGCGA TGAGGTCGCC GCCCGCTTCC TCGAGGCCGC TCGCGCCGCC
GGCCGCCGGG CGCGCTTCTT CGGCGTCGAG GACGGCGACT TCGGCGGCGC CGCCTTCACC
TGCCAGCACG TCGGCGAGCA GCCGTGCTGG GACCCGCGCG CCTGGCCCGA GGTGCTGCGC
AAGAAGCGCA GCCTGCGCGA ACAGCTCCGG CGCGCGCGGG CCAAAGGCGT GCGCGTGCGC
CGCCTCGCCG CCGCCGAGCT GAGCGATCGC GGCAACCCCA TCCGCCGCGA GCTCGACGCG
CTGGTGGCCG AATGGCAAGG CGCGCGCGAG ATGGCGCCCA TGGGCTTCGT GGTGCAGATC
GCGCTCGACC TGCTGCCCGA GGAGCGGCGC GTGTTCGTCG CCGAATTTTC CGGACAGGTG
GTCGCCTTCC TCGGCGCGGT GCCCGTCTAC GCGCGCGGCG GCTGGTTCTT CGAGGACGTC
TTGCGGCGCG GCGGCGCGCC CAACGGCACC GTCGAACTGC TCATCGACCA CGCCATGCGC
GCGCTGGCCG AGGACGGCTG CGACTACGTC ACCTACGGCC TGGCGCCGCT GGCGCGCACG
CCCTCGCCGG TGCTCGGCTG GATCCGCGAC CACACCCGCT GGCTGTATCA CTTCGACGGC
CTGCGCGCGT TCAAGGACAA GTTCCAGCCC GCCGCCTGGC AGCCCGTGTA CCTGGCCTAC
CCGCGCCGCG AGCGCGGCCT GCGCGCCACC GTGGATCTGC TCGCGGCCTT TGCCTGCGGC
AGCTTCGCGC GCTTCGGCTG GGCCACCCTG GTGCACCGCG CCGCCGCGGT CACGCGCTGG
CTGGCCTGGC TGCTGCTGCC GTGGACCGCG CTGCTGATCG CGGCCGACAG CGCGCGCTGG
TTTCCCAGCC CGGCCGTCAA AGCCGCCTGG ATCGCCTACG ACCTGCTGCT GTTCACCGGC
CTGCTGAGCC TGGCGCGGCG CTGGCGGCCG CGGCTGGCCA CGGCCCTGGC CGGCGGCGCG
GCCCTCGACT TCAGCCTTGG CTCGGTGCAG GCCGGGCTGT ACAACGCCGA GCGCGTGCGC
GGGCCGGTGG ACCTCCTGTT CCTGGTCCTG GCCCTGGGCG CGCCGCTGTT CGCGGCGCTG
TTTCTGTGGA GCGCCCGCCA CCGAGGTGCG GGCGGCAAGC GCTGA
 
Protein sequence
MATRAHPPDA VVALLRRYGR ETTSFQTLEP GLSYWLDPDG DACVAYADTG AAWVAVGAPL 
AEPARRDEVA ARFLEAARAA GRRARFFGVE DGDFGGAAFT CQHVGEQPCW DPRAWPEVLR
KKRSLREQLR RARAKGVRVR RLAAAELSDR GNPIRRELDA LVAEWQGARE MAPMGFVVQI
ALDLLPEERR VFVAEFSGQV VAFLGAVPVY ARGGWFFEDV LRRGGAPNGT VELLIDHAMR
ALAEDGCDYV TYGLAPLART PSPVLGWIRD HTRWLYHFDG LRAFKDKFQP AAWQPVYLAY
PRRERGLRAT VDLLAAFACG SFARFGWATL VHRAAAVTRW LAWLLLPWTA LLIAADSARW
FPSPAVKAAW IAYDLLLFTG LLSLARRWRP RLATALAGGA ALDFSLGSVQ AGLYNAERVR
GPVDLLFLVL ALGAPLFAAL FLWSARHRGA GGKR