Gene Hoch_0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0068 
Symbol 
ID8542438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp100792 
End bp103797 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content70% 
IMG OID646384855 
Productcysteine-rich repeat protein 
Protein accessionYP_003264602 
Protein GI262193393 
COG category 
COG ID 
TIGRFAM ID[TIGR02232] Myxococcus cysteine-rich repeat 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATA ATCGAGCTCT CGCCCGAGGT CTGTCCGTCG CCGCGCTGCT GGCATTTGCC 
GCCGCCGATG ACGCGCCCAC CGCCCACGCC TTCACCGGCG CCGACACACC CGGTATCGCC
GGAGTCGAAC ACGCCGCCGA ACCCGGCCTC GAAGTCGCCG AGGGGCCGCG CGCGCGCGCG
CAACCGGCGG TCACCTGGAA CCGGCCGCCG GCGCAGCGCG CGGCCGCCTG GGAGCGCTTC
GTCGCGGACA CCGGGACGGC GTGGACGCCG ATGTGGGACG CCGACACCGC GATGCCGCTG
CGCATCGCGG GCGCCGGCAT GGCGATGCCG GGCAGCGTGG CCTCGGCCGA TAAAGCCGCC
GACTACGCGC GCGCGTTCCT GGCCGAGTAC ATCGACCTGC TGGCGCCCGG CAGCCGCCCG
GAGTCGTTCC ACGTGGTCGG CAACGACCTC AGCGACGGCG TGCGCGCGGT GGGCCTGTAT
CAATATCACG AGGGCATGCG CGTGCTCGGC GGCCAGCTCA GCTTCCGCTT CAAGAACGAC
CGCCTGATCC TGGTCGCCTC CGAGGCGCTG CCCGATATCG CGCTGCCGGC CGTGAGCTAC
ACCACCGCCG AGGCCGTGGT CCGGGATGCC GCGCTGTCGT GGGTGGCGGG CGAGGTCGGC
AAAGCCTGGG TGACCGAGAC CAGCGGCCCC TACGTGCTTC CCGTCATCTC TACCGGACGT
GTGTCCTATC ACACCGTCAT GCAGGCGACC GTGGAAGGGC GCGCGCCGAC CTCGCGCTAC
CGCGTGTACA TCGACGCCTC CAGCGGCGAG CCGGTGGCGC GCGAGCAGAT GCTGCTCTAC
GCCGACGCCC AGCTCCTGTA CAACGTGCCC GCGCGCTATC CCGAGGGCGA TCGCGCCGAC
CTGCCCGCGA GCTTCACCGA GGTGGTCTAC GAGGACGAGA GCTACTTCAC CGACGGCGCT
GGCGTGTTCT CGTGGGACGG CGAGGGCGCG GCGTCGGTGT CGGCCTCGGT CAGCGGCGAG
CTGGTGACCG TGAGCAACCA GCGCGCGCCC GACGAGAGCA CGGTGTTCGA GGTCGCGCTC
ACGCCCGCGG GCACCGGCGT GTGGGACGCG CGCGACGACG AGTTCGTCGA CGCCCAGCTC
ACCACCTTCG TGCACTCCAA CATCGTCAAA GAGTACGTGC GCGTGTTCGC GCCCGGTCTC
AAGTATCTCG ACGAGCAGCT CCTGGCGCGC GTCAACATCG ACGACACCTG CAACGCGTTT
TCGGACGGCA CGACCATCAA CTTTTTCCGC GCCAGCGGGC AGTGCGCCAA CACCGGGCGG
CTGCCCGACG TGGTCTATCA CGAGTTCGGA CACTCGATGC ACTGGCAGTC GCTGGTGCCG
GGCGTGGGCG CCTTTGACGG CGCCTTCAGC GAGGGTCTGT CCGACTATCT GGCCGCGACC
ATCACCGGCG ACCCGGCTAT GGCCCGCGGC TTCTTCTACG GCGACGAGCC GTTGCGGCAC
CTCGATCCCG AGGACTTCGA GCACTCCTGG CCGCGCGACA TCGCCGGCGT GCACTACACC
GGGCTCATCT TCGGGGGCGC GATGTGGGAT CTGCGCAAAG AGCTCGTCGC CCTGTACGGC
GAAGAGGAGG GCGTGGCCGT GGCCAACCGG CTGTACTACG CCGCCGTGCT GCGCGCCAGC
TCCATCCCGG CGACCTATTT CGAACTCCTG GCCGCCGACG ACGACGACGG CAACCTGGCC
AACGGCACGC CGCACGAGTG TCTGATCAAC GACGCCTTCG GCGCGCTGCA CGGCCTGCGC
GAGATCGGCA ACGAGCACAT CCCGCTGGGC ATTCAGCCGC CCGAGCGCGA GGGCTACTCT
CTGAGCGCGC GCCTGCAGGG GACCAACGCG CGCTGTGCCG GTGACGAGGT GCTGTCGGTG
ATCGTGCGCT GGCAGCGCCG CGGCAGCGAG GTCGGCGAGG ATCTCGAGGC CACGCTGCAG
GACGGCGGCG ACGGCGTCTA CGAGGCCACG ATTCCGGCGC AGCCGGCGGG CAGCACGGTC
CGCTATCAGG TGGTGGTCGA GTTCGCCAAC GGCGGCGTGA TCACCTTCCC CGACAACCCG
GCCTGGGAAT ATTACGAGTT CTACGTCGGC GAGCTGATCG AGCTGTACTG CACCGACTTC
GAGAGCGATC CCTACGCCGA GGGCTGGAGC CGCGGACAGA CCCGCGGCGT GGCCACCGGC
GGCGCCAACG ACTGGCAGTG GGGCCGTCCG CTGGGCAAGG CCGGCGATCC CGCGGCGCCG
TATTCGGGAG CCGCCAGCAT GGGCAACGAC CTCGGCGATG AAGGCTTCGA CGGCTTCTAC
CAGCCGCAGA AGGGCAACTA CTTCGAGAGC CCGGTCATCG ATGTCGGCGA CTACAGCGAT
GTGCGCCTGC AGTATCGCCG CTGGCTCACG GTCGAGGATT CGCGCTGGGA CGACGCCATC
ATCTACGTCA ACGGCCGGCC GGCGTGGCGC AACCGCCAGA CCCCGTCGGG CAAGGTGCAC
CATATCGACA AGCAGTGGAT GTTTCACGAT GTCTCGCTCA GCGGCCAGAT CCTGGGCGAT
ACCGCCCAGC TGCGGTTTGC GCTCGAGACC GACGGCGGCC TGCAGTTCGG CGGCTGGAAC
ATCGACGACG TGTGCATCGT GGCCGCGCCC GACGCCATCT GCGGCAACGG CACGGTCGAG
GGCACCGAGC GCTGCGACGC GGGCGACGCC AACAGCGACA CCGAGTCCGA CGCCTGCCGC
ACCAACTGCC GCACGGCCTT CTGCGGCGAC GGCGTGCGCG ATCGCTACGA GCAGTGCGAC
GACGGCAACG ACGACCCCGA CGACGGCTGC ACGCCGGCCT GCTTCTTGCC CTTGCCCGAG
CGCGGCTGTA GCGTGCGCCC GGGTGGCGCT GGCGATGGCG GCGCTGCCGG GTTGGCCCTG
CTCGCGCTGC TCGGGCTTGT CGGACGCGCG TACCACACGC GGCGCCGCGG CCGCGCGCGC
GCCTGA
 
Protein sequence
MTHNRALARG LSVAALLAFA AADDAPTAHA FTGADTPGIA GVEHAAEPGL EVAEGPRARA 
QPAVTWNRPP AQRAAAWERF VADTGTAWTP MWDADTAMPL RIAGAGMAMP GSVASADKAA
DYARAFLAEY IDLLAPGSRP ESFHVVGNDL SDGVRAVGLY QYHEGMRVLG GQLSFRFKND
RLILVASEAL PDIALPAVSY TTAEAVVRDA ALSWVAGEVG KAWVTETSGP YVLPVISTGR
VSYHTVMQAT VEGRAPTSRY RVYIDASSGE PVAREQMLLY ADAQLLYNVP ARYPEGDRAD
LPASFTEVVY EDESYFTDGA GVFSWDGEGA ASVSASVSGE LVTVSNQRAP DESTVFEVAL
TPAGTGVWDA RDDEFVDAQL TTFVHSNIVK EYVRVFAPGL KYLDEQLLAR VNIDDTCNAF
SDGTTINFFR ASGQCANTGR LPDVVYHEFG HSMHWQSLVP GVGAFDGAFS EGLSDYLAAT
ITGDPAMARG FFYGDEPLRH LDPEDFEHSW PRDIAGVHYT GLIFGGAMWD LRKELVALYG
EEEGVAVANR LYYAAVLRAS SIPATYFELL AADDDDGNLA NGTPHECLIN DAFGALHGLR
EIGNEHIPLG IQPPEREGYS LSARLQGTNA RCAGDEVLSV IVRWQRRGSE VGEDLEATLQ
DGGDGVYEAT IPAQPAGSTV RYQVVVEFAN GGVITFPDNP AWEYYEFYVG ELIELYCTDF
ESDPYAEGWS RGQTRGVATG GANDWQWGRP LGKAGDPAAP YSGAASMGND LGDEGFDGFY
QPQKGNYFES PVIDVGDYSD VRLQYRRWLT VEDSRWDDAI IYVNGRPAWR NRQTPSGKVH
HIDKQWMFHD VSLSGQILGD TAQLRFALET DGGLQFGGWN IDDVCIVAAP DAICGNGTVE
GTERCDAGDA NSDTESDACR TNCRTAFCGD GVRDRYEQCD DGNDDPDDGC TPACFLPLPE
RGCSVRPGGA GDGGAAGLAL LALLGLVGRA YHTRRRGRAR A