Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1997 |
Symbol | |
ID | 8544379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2755305 |
End bp | 2757518 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646386701 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_003266436 |
Protein GI | 262195227 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.107987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.248873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCGCC CGAGCCTCAC GCTGCTCTCA TCGAGCGATG GCGACGTGTA TAGCGAGCCC ACGGTCGAGC TTCGTTTCCG CATCGATAAT GCGCCCGCGC TCTCGTACCG TATCGTCGTC GACGACGTAC AGGTCAGCGC GGTCTCGGGT GCATTCGCGG TTGGCGAGGA ACTCACCGCG CTCCTCACCT TGGAGGAGGG CATCCGCTCG GTCGAGATCC TGCTTTTCGA CGGCGACGGG GTGGCCGATA GCGAGCTTCT GCTGCTCGCG ATCGAGCTCC CCGCAGGGCC GTCGATCGTC CTCGACGCGG CGCTGCCAGC GACGAGTTTC GAAGAGAGCC TGGTCGTTAG CGGTCGCGTC GATGGGGGGC GTCCGCTCGT GTCGCTCACC CTCGTGAGCG GCGACGAGCG CTCGGACATC GAGGTCACCG AGGCCGACGG CGGCGCTCTG TTCTCCGCGC TCGTACCGCT CACGCTGGGA GACAACCCCT TCACACTGCA GCCCGTCGAC GATTTCGGAC GAACCGACGA GAAAACCTGG AGCGTGCTCC GTTCTGTCGA TACGGAACCG CCACACGTCG ACGGGCGCGA CCACGTGCTC GCCGTAGGCG ACGACGGTTC CCTGTGGGCC TGGGGGCTCA ACGGTAGCAG CCAGGTGGGC CCCGAGGGGA TCGGAGGCTT CGTGGACGAC GTGCTCAGCC CGGTGGTGCT CGCCGGGGTA GACGACGCGT TCGCCGTCTA CGCCAACGGC AACCAGGGGT TCTATGAGGA CGGCGCCGGT CAGCTCTGGG GATGGGGACA GAACGGCGCT ACCGGCAACC TCGGCATCCC GGCCGAAGGC GACGTCGCCG TGCCCTCGGC GCCGGTCTTC GGTATCCGCG GCGTTGTCGA TGTCGCCATC GGCGCACTGC ATGGCGTCGT CGTCGACGGC GCTGCCGAAG ACGGCACACC GCTGCGTGAC AGCGACGCCG GACGCTCGTC CGAAGGCGAC ACAGTAGATG CCTCCTCGCT GGCAGATGCC GCGAGCGACG AAAATTCGGA CGCGAGTCCG ACGTCGATTC CCGATCCGCT GCGCATTGGC GGTGCGCTCA CGGCCACAGA GTACGGGAGC CGCCCCTTTT TGACCACCGC GCCCGCTCTC GACCTCGGCG GCGCGCCAGC CGTCTCCTTT GGCCGCGAAC TCTTCGTCGC CGACTGGGAC GCCGCCCCCG GCTCGCGCGA ACTCATCGAC GGCCTTGGGC CCCTCTACCA TGCTCTCGCC TGCCTCGGCT GCCACCCCGA GAGCGGGCGA GCTGCGTCAC TCGAAGCGGG GGGCTCCGTG GCCCCCGGTT TGCTCCTGCG CCTCGTGCGG ACCGAGGGCG AAGGCCTCGT GCACGACCCG TCACTAGGCG GTCAGCTACA GACGCTCGCG ATCGCCGGGG TGCCTGCCGA AGGGACGGCG CAGTGGGAGC CCGCAGCCAT CGCCGAGTTG GCGCCACACT ACCATGAGGT CGCAGCTCGC GCGCCTGCGC CTCGCTTCAC GGTCGCGATC GACCCCGCCT ACCCCGCGCT CGCGGACAAC ACGAATTCCG GCCCGCGGCT CGCCCCTCAG CTCGTCGGCG TCGGTCTGCT CGAGCAGGTC CCAGAGGACA CGCTGCTTGC CTGGGAAGAT ATCGAGGACG CGGATGGAGA CGGCATCTCG GGGCGGGCGA GTTGGGTCGC GACGCCCTCC GGCCCACGGA TCGGACGCTT TGGCTGGAAC GGTGACGGAC TCCAGGCCGT CGCCACCTTC CTCTCGTTGC TCGCGGTACC GGCTGCGCGT CGCGAGAGGC GGGATCCGCA GGTAGAGGAA GGCGCGTCGC TGTTCCGTGC CGCGCGTTGC GATGCCTGCC ACCGCGAGAC GCTCACCACC GGAGCGGTGG CGACGCAAGC GCTGCTCTCG GAACAGACCT TTCACCCATA TACCGATCTG CTGCTACACG ACATGGGCGC CGCACTCGCC GATCCTGTCG GCGAAGGCGA CACCGCCGCG CGCGAGTGGC GTACACCGCC GCTCTGGGGC CTCGGCCTGA TCGAGGAGGC GGCCAACGCG CGCTTCCTCC ACGACGGGCG TGCGCTCTCG CTCGAGGACG CCATCCTTTG GCATGGCGGC GAAGCCGAGG CCGCGCGCGC GGCCTTCGCC GCGATGGGCC GCAGCGACCG CGACGCGCTG CTGGCGTTCG TGCGCTCACT GTGA
|
Protein sequence | MRRPSLTLLS SSDGDVYSEP TVELRFRIDN APALSYRIVV DDVQVSAVSG AFAVGEELTA LLTLEEGIRS VEILLFDGDG VADSELLLLA IELPAGPSIV LDAALPATSF EESLVVSGRV DGGRPLVSLT LVSGDERSDI EVTEADGGAL FSALVPLTLG DNPFTLQPVD DFGRTDEKTW SVLRSVDTEP PHVDGRDHVL AVGDDGSLWA WGLNGSSQVG PEGIGGFVDD VLSPVVLAGV DDAFAVYANG NQGFYEDGAG QLWGWGQNGA TGNLGIPAEG DVAVPSAPVF GIRGVVDVAI GALHGVVVDG AAEDGTPLRD SDAGRSSEGD TVDASSLADA ASDENSDASP TSIPDPLRIG GALTATEYGS RPFLTTAPAL DLGGAPAVSF GRELFVADWD AAPGSRELID GLGPLYHALA CLGCHPESGR AASLEAGGSV APGLLLRLVR TEGEGLVHDP SLGGQLQTLA IAGVPAEGTA QWEPAAIAEL APHYHEVAAR APAPRFTVAI DPAYPALADN TNSGPRLAPQ LVGVGLLEQV PEDTLLAWED IEDADGDGIS GRASWVATPS GPRIGRFGWN GDGLQAVATF LSLLAVPAAR RERRDPQVEE GASLFRAARC DACHRETLTT GAVATQALLS EQTFHPYTDL LLHDMGAALA DPVGEGDTAA REWRTPPLWG LGLIEEAANA RFLHDGRALS LEDAILWHGG EAEAARAAFA AMGRSDRDAL LAFVRSL
|
| |