Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1574 |
Symbol | |
ID | 8543956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2153564 |
End bp | 2156476 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646386283 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_003266018 |
Protein GI | 262194809 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.441656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGCA AGAGAACTCG CAAGATAGGG GGCCGTGCGC TCGGCGCATG GCGCCCGGCG TTGGCGGTTG CCGTGCTCGG CAGCGCTGCC GGCTGCGGCT ACGAGGCCGC GGGCAGCTTC GATGACAAGG CCGGCGCGCT GCTGGCCCGC TCCTTCGAGG TCGAGGACGG CGCGCGCAGC GGCACGCTCA ACCGCGCCTG GGGCGCGATC TTCGACAGCG CCGGCGACCA GGTGTGCTGG AGCGGTATCA ATATGGCCGG CGTCGCCAAC GCCAGCGTTC GCTACAGCAA CGGCGAGGGC TTCGACGACC GCGTGCAGCT CACCTATAAC GGTGCCGCCC TGGCCACCGT GGACCTGCCC AACTGCATCG CCAACGGCGC CTGGGAAGGC GACTGCGGCG ACGTCTTGGC CGAGTTCGCG CCGCAGAGCG GCAGCGGCAC GGTGTGCCTG GTGACCTCTG GTCCGGGCTG GAACGGCGCG CTCAACCGCC TCGAGCTCGA CGCCGCCTGC GAGGGCTCGA GCTGTGGCGG CGGCGGCGGC GACGCCATCA TCATCCGCGC CAATCGCGGT CAGGGCTACT TGCGCCTGAG CGGCGACGGC TACCTCAACT GGACCGGCGG CGACGCTGCC TCGGCCGAGG TCTTCGAAAA GGTCGAGGCC AGCGGCACGC GCTTCAAGCT GCGCGCCCAG AGCACGGGCA ACTTCGTGCG CTTCGACGCC GCCGACGATC TGGTCGCCAA CGCCTCGCAG TCGCAGGCGA CGACCTTTGA CGCGTCCGCC TGCGCCTCGC CCTTCGTGAG CCTGCAGGCG CTCGACGACG GCGACGGCGC CAACTTCGTC GCCACCGAGG ACACCGGGCG TCTGCGCGCG CGCACGCCCT ACTGCGACCC CAACGACGCG GCCGCGTGGG AGAAGTTCGA GCTCTTCGCC GCCGGCGACC CGCCGACCGA CCCGCCGCCG ACCGATCCGC CGCCGACCGA CCCGCCGCCG GGACAGGATC CGGGCGCGAT CGTGGCGCTG TTCAACCAGA GCACCCCGCG CGAGCCGGCC GTCTTCGTCG ACACCGGCTC GGCCCTGATC ACGCGCTTCG CCGATCGCGG CCGCGACCGC CACGCGCGCG AGTCGAGCTT CGCCTCGTAC GAGCACTACC TCACCTGGTA CTGGGAGGAT CGCACCGCCG AGGTGACCAT CACCGACACC GTGGGCCGGG GCGGCTCGGA GATCCGCTTC GACGTCGTCA CTCAGCACAA GCTGGGCGCG CGCGAGATGC GCATGGGCTT CCGCGGCATC AACACCGTGG CCGAGTACTG CGATAACTCG CCGCTGATTC CCGCGTATCC CGACGGCAGC GAGATGTCGC TCGGCGACTA TGCCAACAGC CCGCCCGACG GCTTGCACTA CTACTTCAAG GTGGTGCGCC AGCACTTCAC GCCCATCACC TGCACGATCT CGGCGCTCAG CCCCGGCCAG AAGATCGAGA TCGAGATGAG CCAGTTCCTC GACGCCCCGC CCAACGGGCG CGCCAACTAC TACGGCACCA CCTACCTGTA CGTCGTCGGC CAGGGCATGA TGCCCTGGGA AGGCACCGGC GAGCTGGCCA ACAACAACCC CGGCGGCATG TACGGCGTCC CCGCGGATTC GATGCCGATC CCGGTGGGCG CGCGCCTCGG CGGCCAGACC ACCACGCACC GCAACGAGTC GGCCGAGCCG GACAACCTGT TCATGCAGAT GGCCACCAAC CTGGCGCCGC AGAACGGCCA GCGCTTCGTT CTCGGACGCC GCGTCCTGCA TACCGACTTC GTCAACGGCG AGCATGATGA GCCTGGCAAT CCCGCCTGGA GCCAGCAGAG CGGCAAGGGC GGCCCGCTGT TGATCAACCG CACCTGCAAT AGCTGCCATA CCAAGAACGC GCGGGCGCTG CCGGTGGCCC CGGGCCAGCC GCTCGACAAG TGGGTGTTCA AGGTGGCCGA CGCCAACGGC AATCCCCACC CGCAGGTCGG CGCGGTGCTG CAGACCTCGA GCACGGGCGG CGCCTCCGAG GGCGGCGTGA CCATCGCCAG CTACAGCCAG AGCAACGGCC TGCGCAGCCC CAACTACGCC TTCAGCGGCG TCAACCCGGC GCGCTTCTCG GCGCGTATCT CGCCGCAGCT CGTCGGTATG GGCCTGCTCG AGGCGATTCC GGAGTCCGCG ATCCTGGCGC TGGCCGACCC CAATGACGCC AACGGCGACG GCGTCTCCGG CCGCGCCAAC GTGGTCAACG ACACCGCCGG CGTGACCCGC CTGGGCCGCT TCGGCTGGAA GGCCGGCATG CCCGACCTGC GCCACCAGGT AGCCAGCGCG CTGCGCACCG ACATGGGCGT GCTCAGCTCG GTGTACTCGA CGCCCGACTG CGGCAGCTCG CAGGGCAACT GCGGTCCCAA CGGGGCCGAG CTGTCCGATG CCGACCTCGC CAACCTGGTG ATCTACACCT CGCTGCTCGG CGTCCAGCCG CAGACCTTCC ACGACGACGC GCAGGTGCGC TCGGGCGCCG ATGTGTTCCA GCGCATCGGC TGCGCGAGCT GCCACACGCC CAGCTTCCAG ACCTCGCAGT TCGCGCCCTT GGCCGAGCTG CGCAGCCAGA CCATCCGCCC GTATACCGAC CTCTTGCTGC ACGACATGGG CTCGGGTCTG GCCGACAACC TGGGCGAGGG TCAGGCCAGC GGCTCCGAGT GGCGCACGCC GCCGCTGTGG GGCATCGGCA AGACCCGCGA CATGCACGGC GGCCAGGAGG CGTATCTCCA CGACGGCCGC GCGCGCACGC TCGAGGAGGC CATCCGCTGG CACGGCGGCG AGGGTCAGGG CGCCAACGAC CGCTTCCAGG CGCTCTCGGC CGGTGAGCGC GCCGCGCTGA TCGCCTTCCT GCGCTCGCTG TAG
|
Protein sequence | MLRKRTRKIG GRALGAWRPA LAVAVLGSAA GCGYEAAGSF DDKAGALLAR SFEVEDGARS GTLNRAWGAI FDSAGDQVCW SGINMAGVAN ASVRYSNGEG FDDRVQLTYN GAALATVDLP NCIANGAWEG DCGDVLAEFA PQSGSGTVCL VTSGPGWNGA LNRLELDAAC EGSSCGGGGG DAIIIRANRG QGYLRLSGDG YLNWTGGDAA SAEVFEKVEA SGTRFKLRAQ STGNFVRFDA ADDLVANASQ SQATTFDASA CASPFVSLQA LDDGDGANFV ATEDTGRLRA RTPYCDPNDA AAWEKFELFA AGDPPTDPPP TDPPPTDPPP GQDPGAIVAL FNQSTPREPA VFVDTGSALI TRFADRGRDR HARESSFASY EHYLTWYWED RTAEVTITDT VGRGGSEIRF DVVTQHKLGA REMRMGFRGI NTVAEYCDNS PLIPAYPDGS EMSLGDYANS PPDGLHYYFK VVRQHFTPIT CTISALSPGQ KIEIEMSQFL DAPPNGRANY YGTTYLYVVG QGMMPWEGTG ELANNNPGGM YGVPADSMPI PVGARLGGQT TTHRNESAEP DNLFMQMATN LAPQNGQRFV LGRRVLHTDF VNGEHDEPGN PAWSQQSGKG GPLLINRTCN SCHTKNARAL PVAPGQPLDK WVFKVADANG NPHPQVGAVL QTSSTGGASE GGVTIASYSQ SNGLRSPNYA FSGVNPARFS ARISPQLVGM GLLEAIPESA ILALADPNDA NGDGVSGRAN VVNDTAGVTR LGRFGWKAGM PDLRHQVASA LRTDMGVLSS VYSTPDCGSS QGNCGPNGAE LSDADLANLV IYTSLLGVQP QTFHDDAQVR SGADVFQRIG CASCHTPSFQ TSQFAPLAEL RSQTIRPYTD LLLHDMGSGL ADNLGEGQAS GSEWRTPPLW GIGKTRDMHG GQEAYLHDGR ARTLEEAIRW HGGEGQGAND RFQALSAGER AALIAFLRSL
|
| |