Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3784 |
Symbol | |
ID | 8546177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 5197778 |
End bp | 5200693 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646388454 |
Product | Protein of unknown function DUF2344 |
Protein accession | YP_003268177 |
Protein GI | 262196968 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.539179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.106507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCACA TCTACGCAGA CTTCATCGAC CGCGTCGCCA AGCCGGCACG CTACCTGGGC GGCGAGTACC TGTCCGTCGT CAAGCCGGCC GAGGAAGTCG ACGTCCGCGT CGCCCTGGCG TTCCCCGATG TCTACGACAT CGGCATGTCG CACCTCGGCA CCAAGATCCT CTATTCGCTG CTCAACAAGC AGCCGCGGAT CGCGGCCGAG CGGGTGTTCG CCCCGTGGGT CGATATGGAG GCCGAGCTGC GCGAGCGCGG GCTGCCCCTG GTGTCGCTGG AGTCGTCCAC GCCGCTCTCG GACTTCGACG TCATCGGCTT CTCGCTGCAA TACGAGCTGA CCTTCACCAA CGTCCTCACC ATCCTCGACC TCGGCGGCGT CCCGCTGCGC GCGGCCGAGC GCACGGACGA TGACCCCCTG GTGCTGTGCG GCGGTCCCGT GGCCTCGCAC CCGGAGCCGG TCGCGCCCTT CTTCGACGCC TGCTACATCG GCGAGGCCGA AGAGGAGCTG TCGGGGCTGC TGCTCGAGTG GGCCGAGATG CGCCGCGCGG GCCGCGCCCG GCTCGACGCG CTGGCCGAGC TGGCCGGCCG CTATCCGATC TATGTGCCCG CGCTCTACGA CACCGAGCGC GACCCCGAGA CCGACATGAT CGTGGTCGGC GCGCCCCGCG ATGCCCGCGC GCCCGCCCGC GTGCGCCGCG GCGTGGTCCG CGATATCGAC GCCTATCCCT TCCCCTCGGA TACGCCGGTG CCCTACGCCG AGGCGGTGTT CGACCGCGCC GCGGTCGAGA TCGCGCGCGG CTGCACCGAG GGCTGCCGCT TCTGCCAGGC CGGCATGATC TACCGCCCGG TGCGCGAGCG CTCGCCCGAG TCGATCGCCA AGAGCGTGAT CGACAGCGTC GAGAAGGCCG GCTACGACGA GACCGCGCTC ACCTGCCTGA GCACGGCCGA CTTCTCGAGC ATCACGCCGC TGGTCAAGAA CGTGATGAGC GAGCTGCGCA AGCGCAAGGT CACGCTGTCG GTGTCGTCGC TGCGCGCCTA CGGCCTGGGC GAGGACATCC TCGACGAGAT GGCCTCGATG CGCATCACGG GGCTCACCTT CGCGCCCGAG GCCGGCACCC AGCGCATGCG CGACGTGGTC AACAAGAACG TGACCGAGGC GCATATCGAG GAGTCGACGA CGCGCGTGTT CGCGCGCGGC TGGCACCGGC TCAAGCTGTA CTTCATGATC GGCCTGCCGA CGGAGGAGGA CGATGACGTG GTCGGCATCG TCAACACCGG CCAGCGCATG CTGCACATCG GCCGCCGCGA GGCCGGCAAA CGCGCCGAGG TCACGGTCAG CGTGTCCTCG CACGTGCCCA AGCCGCACAC GCCCTTTCAG TGGTGCGCCC AGGACTCGCT GCCCGAGATC AAGCGCAAGC AGCAGCTCCT ACGCGGCGCG CTGCGCGACC GCAACCTGCG CCTCAAATAC CACGACGCCG GCATCAGCTT CGTCGAGGGC GTGATGTCGC GCGGCGACCG GCGCGTGGCC GACGCCATCG AGATGGCCTG GCGCCGCGGC GCCCGCTTCG ATGGCTGGGA TGAGCTCTTC GACCTGGGCA TGTGGCAAGA GGTTTTCGGC GCGTGCGAGA TCGACGCCGA CGTGTACCTG TCCACGCGTC CGATCACGGC CCGGCTGCCC TGGGACCACA TCGATGTCGG TCTCGAGGAC GGCTTCCTGC TCGGCGAGTA CCGCAAGGCG CTCAAGAGCC GGCTGTCGCC GCCCTGCGGC AAGGTCGCCG GCCAGCTCGT GCACCACAAT AACCTCGACG ACGCGCGCGC CGACCAGCGC CGCCTGGTGT GCTACGACTG CGGCGTCGCC TGCGATCTGT CGAAGATGCG CAGCGACCGG CTGGTGGCCC TGGGCGCGCT CGGCGCCGAA CACGCGCCGC GGCGGCCCGA GCCCCGCGCC GAAGCGGCCG AGGCCAGCGA CACGGCGGCG GCGACGGGCG ACGCCCAGCC GGCGGCCGAC GGCGACAAGG GCGCGAGCGC CGAGACGGCG GCGAGCGAGC GGCCGCGCAA GGGCAAGAAG AGCCGCCGCG GGCCCAAGGT GTCGTTCCCC GACCTGCCCA AGGTGGGCTA CCGGCTGCGC TACGCCAAGC TGGGCCGCGC GGCCTATCTC GGGCACCTGG ACACCGGCCG CATGCTGGCG CGCCTGTTCC GCCGCGCGGA CCTGACTCTG GCCTACAGCC GCGGCTATCA CCCCAAGCCG ATCATCCAGT TCAGCCCGGC GCTGCCGCTG GGCGTGGCCA GCATGGGCGA ATTGCTCGAC GTGAGCGTCG AGGCGCCCTC GGCGGTGCCG GCCGAGGCGC TGCTGCGGCG GCTGCGCGAG GTCTCGCCCG AGGGCATCCT GTTCGGCGAT GCCTGGGCGC TGCCGCCGGG CAGCCCGGGC CTGGGCAAGC TGATCGAGGC CTACGATCTG CTGCTGGCGC CGGCGCCCGG TCTGCCCGCG GACGAGGCCG CGCTGATGCG CGTGGCCGAC GAGTTCCTGG GCCGCGCGTC GGTGCTGGTG CCGCGCAAGG AGCGCGAGAT CGATGTGCGC GCCTTCGTCT CGCGCATCGA CGTGCTGGCC GAGCGCGCGG CCGAGCGGCT GGCGGGCGCG CTGGGCTGGC CGCTGGCCGA GACCGCGAGC GCGCCCGCGC TGCTGCAGGT GCGGGTGCAC ATGACGCCGC AGGGCTCGGC CAAACCCACC GAGATCGCCG AGGCGCTGGG GCTGTGGGGC GACCCCGACC CGCGCGCGCC GCACGCGCTG CTGGCGCGCC TGGGCTTCCC GGGCGTCGAG CCCACGGCCG AGGACCACGC CCACGCCCGC GGCGAGGGCA TCCATCTGGC CGCCGCGCAC TCCGAGGAGG TCTCGGCCGC CTCGGCGCCG TCCTGA
|
Protein sequence | MRHIYADFID RVAKPARYLG GEYLSVVKPA EEVDVRVALA FPDVYDIGMS HLGTKILYSL LNKQPRIAAE RVFAPWVDME AELRERGLPL VSLESSTPLS DFDVIGFSLQ YELTFTNVLT ILDLGGVPLR AAERTDDDPL VLCGGPVASH PEPVAPFFDA CYIGEAEEEL SGLLLEWAEM RRAGRARLDA LAELAGRYPI YVPALYDTER DPETDMIVVG APRDARAPAR VRRGVVRDID AYPFPSDTPV PYAEAVFDRA AVEIARGCTE GCRFCQAGMI YRPVRERSPE SIAKSVIDSV EKAGYDETAL TCLSTADFSS ITPLVKNVMS ELRKRKVTLS VSSLRAYGLG EDILDEMASM RITGLTFAPE AGTQRMRDVV NKNVTEAHIE ESTTRVFARG WHRLKLYFMI GLPTEEDDDV VGIVNTGQRM LHIGRREAGK RAEVTVSVSS HVPKPHTPFQ WCAQDSLPEI KRKQQLLRGA LRDRNLRLKY HDAGISFVEG VMSRGDRRVA DAIEMAWRRG ARFDGWDELF DLGMWQEVFG ACEIDADVYL STRPITARLP WDHIDVGLED GFLLGEYRKA LKSRLSPPCG KVAGQLVHHN NLDDARADQR RLVCYDCGVA CDLSKMRSDR LVALGALGAE HAPRRPEPRA EAAEASDTAA ATGDAQPAAD GDKGASAETA ASERPRKGKK SRRGPKVSFP DLPKVGYRLR YAKLGRAAYL GHLDTGRMLA RLFRRADLTL AYSRGYHPKP IIQFSPALPL GVASMGELLD VSVEAPSAVP AEALLRRLRE VSPEGILFGD AWALPPGSPG LGKLIEAYDL LLAPAPGLPA DEAALMRVAD EFLGRASVLV PRKEREIDVR AFVSRIDVLA ERAAERLAGA LGWPLAETAS APALLQVRVH MTPQGSAKPT EIAEALGLWG DPDPRAPHAL LARLGFPGVE PTAEDHAHAR GEGIHLAAAH SEEVSAASAP S
|
| |