Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4852 |
Symbol | |
ID | 8547259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6640324 |
End bp | 6642108 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646389525 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003269234 |
Protein GI | 262198025 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.464981 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACT CAGACGACAC CTTGTCCGCT CCGTCCGCCT CGTCGCCCGC GGGCGCACAC GATCCGCCTG CGGGCGCACG GACCCAACAC ACGCCCGCGA GCACACCGCT GCTGCCCGTG CGCATGCTCA ACGAGTACGC CTACTGCCCG CGCTTGTTTC ATCTCGAGTG GGTGCAGCGC GAGTGGGCTG ACAACGCCTA CACCCTGGAT GGCAAGCGGG TCCACAAGCG CGTGGACAAG CCCTCGCGGC ATGGCTTGCG CTCTGCCGAC CGCGCGTCCG ACGATAGCGC TTCGAGCAAA GACGCGGGCC AACCCGAAGA CACCCTGTTT CAGCAGCACG CGCGCAGCGT GGACCTCGGC GACGACGCGC TCGGCCTGAT CGCGCGCATC GACCTCGTCG AGGCCGAGGG TGACCAGGCC ACGCCCATCG ACTACAAGCG CGGCAAGCGC CCGGACGTCC CCGGCGGCGC CTACGAGCCC GAGCGCGTGC AGGTCTGTGC CCAGGGCTTG CTCCTGCGCG CCCATGGATT TCGCAGCGAC CACGGTATTT TGTACTTCGC CGGCTCGCGC GAGCGCGTGG ACGTGCCCTT CACCGACGCG CTCGTCGAGC GCACCTTGGC CCTGCGCGAT CAGGCCCTGC AGGCTGCCGA AGCCGAAAAG CCGCCGTCGC CGCTGGTAGA CAGCCCCAAA TGCCCGCGCT GCTCGCTGGT CGGCATCTGT CTACCGGACG AGCAGAATGC CCTGCTCGGA CGCAGCACCG AGGGAATTCG TCCACTCGTC TCGCTACGTG ACGACGCCCT GCCCTTGCAC GTGCAGGAGC ACGGCGCCGT GGTGAGCAAG CGCGCCGCCG AGCTCGTCAT CAAGCGCAAA GGCAGCGAGC TCGAGCGCGT GCGCATCAAA GACGTCTCGC GCATCAACCT GCACGGCAGC GCGCACATCA CCTTGCCCGC CCTGCAGACA GCATTGGGCA ATGGCATTCC CGTCGGCCTA TTCACCTACG GCGGCTGGTA CTACGGGCGT GCACAGGGAC ATGATCACAA GAACGTGCTC CTGCGTCAGG CGCAGTTTGC CAGCGCGCAG GACGAGGGGC GCTGTCTGCG CATCGCGCAG CGGCTGGTCC ACGCCAAGAT CAAAAACAGC CGCGTCATGT TGCGGCGTAA CAGCCGAGCG CTCGATCGAC GGATTCTCGA CGACCTGTCC GGTCATGCGC GACGCGCGCG TCAGGCCGAC AGCCAGGCCA CCTTGCTCGG CATCGAGGGC AGCGCCGCGC GCCTGTACTT TCAGAATTTC AGCGGCATGT TGCGCCAGGA CGTGCCGTTT TCGTTTGACA GTCGCAATCG CCGCCCGCCG CGCGACCCAG TCAACGCGCT GCTGTCGTTT TCGTACGCGT TGCTCACAGC GGAGTGGACG GCGACCTTGA GCACCGTTGG ATTTGATCCG TACCAGGGCT TTTATCATCA GCCGCGCTAC GGCCGTCCGT CACTCGCGCT GGACCTGATG GAGGAGTTTC GACCGCTCAT CGCCGACAGC GTGGTCATCG GTGCGATCAA CAACGGTGTA CTCGACGAAG ATGATTTCGT CGTGACCGCC ACCGCGGCCG CATTGAAACC CGCGGGGAGG AAGCGGTTTT TGCAGGCATT CGAGCGCCGC CTCGACGAGC AGGTGACCCA TCCGGTCTTT GGCTATCGGC TTAGCTATCG CCGGGTACTG GACGTACAGG CGCGGCTGCT GGGACGCTAC ATCATGGGAG AGATCGATGA GTATCCCGAG TTTGTCACCC GATGA
|
Protein sequence | MNNSDDTLSA PSASSPAGAH DPPAGARTQH TPASTPLLPV RMLNEYAYCP RLFHLEWVQR EWADNAYTLD GKRVHKRVDK PSRHGLRSAD RASDDSASSK DAGQPEDTLF QQHARSVDLG DDALGLIARI DLVEAEGDQA TPIDYKRGKR PDVPGGAYEP ERVQVCAQGL LLRAHGFRSD HGILYFAGSR ERVDVPFTDA LVERTLALRD QALQAAEAEK PPSPLVDSPK CPRCSLVGIC LPDEQNALLG RSTEGIRPLV SLRDDALPLH VQEHGAVVSK RAAELVIKRK GSELERVRIK DVSRINLHGS AHITLPALQT ALGNGIPVGL FTYGGWYYGR AQGHDHKNVL LRQAQFASAQ DEGRCLRIAQ RLVHAKIKNS RVMLRRNSRA LDRRILDDLS GHARRARQAD SQATLLGIEG SAARLYFQNF SGMLRQDVPF SFDSRNRRPP RDPVNALLSF SYALLTAEWT ATLSTVGFDP YQGFYHQPRY GRPSLALDLM EEFRPLIADS VVIGAINNGV LDEDDFVVTA TAAALKPAGR KRFLQAFERR LDEQVTHPVF GYRLSYRRVL DVQARLLGRY IMGEIDEYPE FVTR
|
| |