Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_6570 |
Symbol | |
ID | 8548987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 9017102 |
End bp | 9019882 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646391231 |
Product | hypothetical protein |
Protein accession | YP_003270930 |
Protein GI | 262199721 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATAT CGCCCATCCA CGCCGACGTG CGCGCGGCCG GTGAGCGCGA CGAGTCCGGG CGGCGCCTGG CGCTCGGCTT CGTCCGCGCG GTCGAAGCTC TCGGCGCAGG CTTTCTCGGC GCATCCGAGA ATCTTCGTCT GCGCCAGGCG ATCGCCGACG GCAGCATCGA TGCGCCGCTG CTCTACCGAC AGCTCATCGC GCTGGTGTAC CGCCTGGCGT TCCTGCTCAT CGCCGAGGAG CGCGGGTTAG TGCGCGATCC ACGCGCGCGC GCGGCCGCGG CGCCGGGCAC GGCCGCGCAC ACCGGACTGC GCGCGCTGCG CGAGCGGCTG GCGACACCGG CGGGCAGTGG CGAGGGCGAC GCCTGGGCCG AGCTGCGGGC GCGAATGGAC GCGCTAGCGG CGGACTGTCC GCGGCGCGGG CTGCCGGCGC TCGGCGGCGC GTTGTGGTCG TGCAATCGCG ACGTGGCCAC TCCCGAGCTG GGCTGTCCCT GGCTGATCGA TGCGGATTGC CCGCGCGAAC ATCTGGGCGT GGCGCTGGGT GCCTTGCTCG ACGCCCCGGC CGCCGGCGGC GACGCAGCGG ATGCCGAGGA GGCTGCGCCC GCGCGCCCGG ATTGGGCAGA GATCGCGGGC GATGAGCTCG GCAGCGCCTA CGAGCATCTG CTGGCCCGAC AGCCGATCTT CGTCGGACAG GCGCCCGATT TCCTCCTGCG CCCGGCGCCC GAACACGCGC GCAAACGCTC GGGCAGCTAC TACACACCGG CCGCCCTGGT CGAGGAGCTG CTCGCGGCCA CGCTCGACCC GGCGCTCGAA CGCGCCGCGC GCGCGCCCGA TCCCGCGGCC GCCATCCTCG CGCTGCGCGT CTGCGATCCC GCCTGCGGCG CCGGCAACGT GCTCGTCGCC GCGGCCCGGC GGATGGCGGC GCGGCTCGCA CACGCGCGCG GCCGCGGCGA CGACCCAGCC GCGCGGCAGC TCGCCCTGCG CGCGATCGTC GCCCGCTGCA TCCACGGCGT GGATATCGAT CCCATGGCCG CCGAGCTGTG CAAGATCAGC CTGTGGCTGG CGGCCGCCGA GCCGGGCACC GGCCCGGGTC GCTTCGACTC CCGCATCCAG TGCGGCAACG CGGTCCTCGG CGCCACCCCG GCGCAGATGC GCGAAGGCAT CCCGGCGGCC GCGTTTCGCG CCGTCGCCGA CGAGGACCGG AGCGTGACCC GCCGCCTGGC GCAGCGCAAT CGTCTCGAGC GCCAGCGAGC TGCGCGCGCG CAACTCGCGA GCGCCGCGCG CGCGGCGAAC ACCGCGAGCG CCGCAGCGCC GAGCGCGCCC TCGCCCGCCG CCGCCGACGC CTGGTGTGCG GCCTTTGTGT GGCCCAAACG CACCGGCGCC GACGAAGACG CCGCCCCCAC CCACGGCGCG TGGCTGCGCT TCGCGCTCGC TGACGCAGCG GGCGCTGACG CGGCGGGCGC TGACGATGAC GCTGCCGCCA CGGCGGGCGT CGCCGACAGC CACGCGCACC TGCGCGAGCG GGCGAGCGCG CTCGCCCGCG CCCACCGCTT TTTTCACTGG CAGCACCGCT TCCCGCACAT ATTCGTCCAC GCGGACGAGG ATGCCGAGCG CTCGCCGGCG GGCTGGGCCG GAGGCTTCGA CGTGGTCATC GGCAACCCTC CGTGGGGACA GAAGCTGGTC GACCCGATGG CCATGCCAGC GCGCCTGTTG CGGCAGCGCT TCGCCTCGCT CGACGGCATC CCCGACGCGT TCCGCCCCTT TCTCGAGCTA GCCACCGAGA TCACTGCGCC CGGTGGCAGC TTCGGCTTCG TCTTGCCCGA CACCCTGCTG CTCAAGAACT ACGAGCCCAC GCGCCGGCTG CTCCTCGACC GCTGCCGCCT GCGCAGCCTA TCGTGGTGGG GCATGGCATT TCCCGGGGTG ACCATGGACG TCATCACCCT GAGCTGCGCG CTCGGCGAGG CCACTCCCGA GCACACCATC GAGGTCGCCG TGCGCGCGCC CGCCGAGCCG CTCCGCCACC GCATCGAACA GCGCGCGTTT CGCCACACGC CGCGCCACAC ATTCAATCTG CACCTCACCG CCGAGCGCCT GGCGCTGCTC GCTCACCTGC GCCGAGGACG ACCGCTGCGC GACTGTTTCG AGGTTCACGA GGGCGTCCAC AGCGGCAATA TCCGCGGCGA GCTGTTCGTC GAGCGCGCGC TCGACGATTC CTGCTACCCG CTGTATTTCG GACGCGACGA GCTGCGCCCC TTCCGCCTGC TGTGGCGCGG GCGCTACCTG CGCCGCGCGG CGATTCCCGA GCGCAACAGC AAAGCGCGCT ACGCGGGCGC CGGCCAGCGC GCCTGGCACG AGACGCCCAA GCTGCTGCTG CGGCGCACGG GCGACAGCGT GCGCGCGGCC GCCGATCTCG AGGGTCGCTA CGCCAGCAAT AACTTCTTCC TGGTGCTCGC CCGCCGCGAC GTCGGCTGCG CCCTCAATCT CGACGGCCTG TGCGCGCTGC TCAACTCCGC GCTCATGACC GAATTCTTCC GCACCATCGA GCCGCGCCGC GGACGCGCCT TCGCCGAGCT GAAGATCAAG CACATCGGCG AGTTCCCCCT GCCGCCCGGC TGTGTGCGCG GCGACGGTTC TTACATCGAC AGCTCGTCCG AACAAGGCTG CACCGCCCTC AACCACCTGG GCGCGGCCTG CCGGCAGGCG GCCGCTGTCG CCGACGAAGA CGCGCTGCTC GCGCTGCAGG CGCACACCGA AGCTTTGGTT CGCCGCCTCT ACGGGCTGTA G
|
Protein sequence | MTISPIHADV RAAGERDESG RRLALGFVRA VEALGAGFLG ASENLRLRQA IADGSIDAPL LYRQLIALVY RLAFLLIAEE RGLVRDPRAR AAAAPGTAAH TGLRALRERL ATPAGSGEGD AWAELRARMD ALAADCPRRG LPALGGALWS CNRDVATPEL GCPWLIDADC PREHLGVALG ALLDAPAAGG DAADAEEAAP ARPDWAEIAG DELGSAYEHL LARQPIFVGQ APDFLLRPAP EHARKRSGSY YTPAALVEEL LAATLDPALE RAARAPDPAA AILALRVCDP ACGAGNVLVA AARRMAARLA HARGRGDDPA ARQLALRAIV ARCIHGVDID PMAAELCKIS LWLAAAEPGT GPGRFDSRIQ CGNAVLGATP AQMREGIPAA AFRAVADEDR SVTRRLAQRN RLERQRAARA QLASAARAAN TASAAAPSAP SPAAADAWCA AFVWPKRTGA DEDAAPTHGA WLRFALADAA GADAAGADDD AAATAGVADS HAHLRERASA LARAHRFFHW QHRFPHIFVH ADEDAERSPA GWAGGFDVVI GNPPWGQKLV DPMAMPARLL RQRFASLDGI PDAFRPFLEL ATEITAPGGS FGFVLPDTLL LKNYEPTRRL LLDRCRLRSL SWWGMAFPGV TMDVITLSCA LGEATPEHTI EVAVRAPAEP LRHRIEQRAF RHTPRHTFNL HLTAERLALL AHLRRGRPLR DCFEVHEGVH SGNIRGELFV ERALDDSCYP LYFGRDELRP FRLLWRGRYL RRAAIPERNS KARYAGAGQR AWHETPKLLL RRTGDSVRAA ADLEGRYASN NFFLVLARRD VGCALNLDGL CALLNSALMT EFFRTIEPRR GRAFAELKIK HIGEFPLPPG CVRGDGSYID SSSEQGCTAL NHLGAACRQA AAVADEDALL ALQAHTEALV RRLYGL
|
| |