Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3869 |
Symbol | |
ID | 8546264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5323485 |
End bp | 5324900 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646388540 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_003268261 |
Protein GI | 262197052 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.421634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.273576 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCC CCATCGTGAT CGCCTCGGCG CTCGGGCTCG GCCTGGGCGC CTGCACCGGA TCGGACGGCA TGAACCCGCC GCCCGATCCG CCACCCGTCA CCCCCGTGGA CCCGGCCCAA GCGCTGTCCG GCGGCGACAC CGGCACGGTC TTCGACAGCG GCCGCGACGC CTACTCGCAC CCGCTGAGCG CGCTCGACAC CGACGAGGAG CGGGCGTTTT TCCGCGGCCG CGCGGTGTTC CGCGACGGCT GGGTCGAGGC CCCGTCGTCG ACCACCACGC GCGACGGCCT GGGCCCGCTG TTCAACGCGC GCTCGTGCAT CGCCTGCCAC GTGCGCGACG GCCGCGGCCG CCCGCCCATC GGCGACGAGC CGCTGCTGTC GATGCTGTTT CGGCTCAGCA TTCCGGGCGC CAACGCGCTC GGCGGACCCG AGCGCGAGCC CACCTACGGC GGCCAGCTCC AGCCCTTTGG CGTGTTCCCG CTGGTCGGCG ACGGCGACGT CGAGATCGTC TATGAAGAGG TCGCCGGCAG CTACGGCGAC GGCAGCGCGT ATACGCTGCT GCGGCCGCGC TACAGCTTCC GCGACCTGGC CTACGGGCCC ATGGCCGAGG ACGCCCTGTC GTCGCCGCGG GTGGCGCCCT CGGTGTACGG CCTGGGCCTG CTCGAGGCCG TGCCCGAGTC CGCTATTCTC GACCTGGCGG ACGAAGACGA CGCCGATAGC GACGGCGTCT CGGGCCGCCC CAACTACGTG TGGGACGTCG AGGCCGAGGC CGAGGCCCTG GGCCGCTTCG GCTGGAAGGC CAACCAGCCC TCGCTGCTGC AGCAGACCGC GGGCGCGTTC AACGGCGACA TCGGCATCAC CTCGCGCCTG TTCCCGGTCG ACGATTGCTC GACCGTGCAG CTCGATTGCG TCGAGGCGCA ATCCGGCGGC GACCCCGAGC TGATCGAGGC CTTCCTCGAG GACGTCACCT TCTACACCCG CACCCTGGCG GTGCCCGCGC GCCGCGACAT CGACGACGCC GAGGTGCGCG CCGGCCAGAA GCTGTTCTCG GACATGGGCT GCGCCGCCTG CCACGCGCCC GCGCTGCGCA CCGGCGCCAG CGAGGTCGCC GCGCTGGCGA ATCAGGACAT CCGTCCGTAT ACCGACCTCT TGCTGCACGA CATGGGCCCG GAGCTGGCCG ACGGCCGCCC CGATTTTCTG GCCACGGGCA GCGAGTGGCA GACCCCGCCG CTGTGGGGCG TGGGCCTGCT GGCCACGGTC AACGATCACT CGCGCCTGCT GCACGACGGC CGCGCCCGCG ACCTCGCCGA GGCCATCCTG TGGCACGGCG GCGAGGGCGA GAGCGCGCGC GAGCGCTTCC GCACCGCCTC CGCCGAGGAG CGCGCCCAGA TCATCGCCTT CCTGGAGTCC CTGTGA
|
Protein sequence | MRTPIVIASA LGLGLGACTG SDGMNPPPDP PPVTPVDPAQ ALSGGDTGTV FDSGRDAYSH PLSALDTDEE RAFFRGRAVF RDGWVEAPSS TTTRDGLGPL FNARSCIACH VRDGRGRPPI GDEPLLSMLF RLSIPGANAL GGPEREPTYG GQLQPFGVFP LVGDGDVEIV YEEVAGSYGD GSAYTLLRPR YSFRDLAYGP MAEDALSSPR VAPSVYGLGL LEAVPESAIL DLADEDDADS DGVSGRPNYV WDVEAEAEAL GRFGWKANQP SLLQQTAGAF NGDIGITSRL FPVDDCSTVQ LDCVEAQSGG DPELIEAFLE DVTFYTRTLA VPARRDIDDA EVRAGQKLFS DMGCAACHAP ALRTGASEVA ALANQDIRPY TDLLLHDMGP ELADGRPDFL ATGSEWQTPP LWGVGLLATV NDHSRLLHDG RARDLAEAIL WHGGEGESAR ERFRTASAEE RAQIIAFLES L
|
| |