Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1317 |
Symbol | |
ID | 8543699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 1743413 |
End bp | 1745365 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646386033 |
Product | CRISPR-associated protein, Crm2 family |
Protein accession | YP_003265768 |
Protein GI | 262194559 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA AGCTCCACCT CAGTCTGGGC CCGGTACAGG CATTCATCGC TGAATCGCGA CGCACACGCG ACCTCTGGGT AGGCTCGTAC CTGCTGTCGT ATCTCGCCGG CCGCGCGCTG TACGCAGCCG CGCAGCACGG CGAGATCGTC CTGCCGCGCG TACACGACGA CGTGCTGGCG CTGTATGCCC CCGGGCGCAC GGATCGCGCC CGGTTGCCCG CTCACGCGAG CTTGCCCAAC CGCTTCGTGC TCGAGTGCGC AGACCAGGCC GCGGCAGTCC AGGCGGCCGA GGCAGCAACG AGCGCGCTCC GCGCCGCCTG GGCGCATATC GCCGGCACCG TCCGGAAACA GTTTATCGAC CCTGTGTCCG GTTCCGACGA CGAGACCCAG CGCATCTGGC GCCGCCAGGT CGAGTCGTTC TGGCACGTGG TCTGGGTGAT CGGCGACGAC CCGGCCTTGC TCGATCAACG CAAACACTGG CGCGCGCCCG CCCTCGCGCC CGACCCCGAG CCCGGCGAAC ACTGCACCAT GATGGGCAGA TACCAGGAGC TATCCGGCTT TCTGCGCGGC CAACGCGGAC TGGAAGAATT CTGGATAGAC GTGCGCGCAC GTCTGGGCGG AGCGATGAAC CTGGACCTGA GGCCGGACGA GCGACTGTGC GCTATCGCGC TCATCAAACG GCTCCTGCCG CGGGTCTCGG GCCAGGCCCT CGGTCGCCGC CTCGACGAGG AGCAGGTCGC CTGGCCGTCT ACTCTGTACA TGGCCGCGCG GCCGTGGATC GGAACGGTGT GCGAAGCGCA GCCTGAACCC GCCGCACGTT ATGCAAAGCA GGTGCTCCGT GCGCGCGCGA GCGCCCGCGG CGAACGCAAG GCCGGTCAGA CGCTGCTTGA TGCGCTCACG GGGACGTCAG CCTCGACAGC CTCCGCGGGT GCATTCCCGC TCCTGGATGG CAACTTCTCG TTCATCGGCG CCCTGGAGAA CGAACGAGCC ACCCCACTAG ACCGTGAGGA CGAGCGCCGC GGCCTGGTGA AGGCCCTGAA AGCGCTGCAC GCCCGCCAGG GCACCGGGCC GAGCCCATAC TACGCGTTCT TGCTCATGGA CGGCGACAGC CTCGGCACCC TCCTGAGCCG CGCGGAGCCG CGTACGATAA CCGACTGTCT GTCCGATTTC ACCGAGAGAG TACCCGATAT CGTCTGCGCA CGCGGGGGCG CCACGGTCTA CGCAGGCGGC GATGACGTGC TGGCCTTATT GCCGGTCGAA GGTGCCCTGC CGACAGCGCT GGCGCTGGCG CGCTGCTATG AGCAGCGCTT CGCAGATAGC CAGCTCGACC GCGAACTCCT GCCCGCAGCC ACCATCTCCG GCGCGATCGT GTTCGCGCAC TACCATCTAC CGCTGCGGTA CATCACCGCC CGCGCGCACG AGCTCCTCGA CCACGTGGCC AAGGACCAGA CCGGCCGCGC CAGCCTGGCC ATCTCGCTAC ACCAGAGCAG CGGCGAGACC GCGCGCTTCA GCGTACCGTG GAGCTACCTG CGCACCGACG AAGACGATCG CACCACGAGC ATCGATCCAC TGCTCGCCGA TATCCAGGCC GGTCGCCTGG GCAAGAGCCT TCTGTACCGT TTGCGCGCGC TGCTCGGCCG CATCAGTGGC GCCGGCGAAG TCGGACCGGG TGTCCCCCTG GACCTCAGCA CCCTGCACCA AGCCGGCGCC GAAAGCGGCG CAAGCGATCC TGTGCTCGAC CTGTTTGCCG CCGAGATTCG CAGTACCCGC GGGGACGCGG AGCGAACACC GGCGCAGGTC CGCGAGCTAG CCATACACCT CCAGGCAGCA TGCCGGGTCG TTCGCCGCGT CGCCGGTAGT AAGCACCAGA TCGAGCGCGG CCACCTATGC CTGGATGGCG CCCGCTTGGC CTATTTCATG GCCACTGGGG GCAGCAACGA GGATGAGATA TGA
|
Protein sequence | MTVKLHLSLG PVQAFIAESR RTRDLWVGSY LLSYLAGRAL YAAAQHGEIV LPRVHDDVLA LYAPGRTDRA RLPAHASLPN RFVLECADQA AAVQAAEAAT SALRAAWAHI AGTVRKQFID PVSGSDDETQ RIWRRQVESF WHVVWVIGDD PALLDQRKHW RAPALAPDPE PGEHCTMMGR YQELSGFLRG QRGLEEFWID VRARLGGAMN LDLRPDERLC AIALIKRLLP RVSGQALGRR LDEEQVAWPS TLYMAARPWI GTVCEAQPEP AARYAKQVLR ARASARGERK AGQTLLDALT GTSASTASAG AFPLLDGNFS FIGALENERA TPLDREDERR GLVKALKALH ARQGTGPSPY YAFLLMDGDS LGTLLSRAEP RTITDCLSDF TERVPDIVCA RGGATVYAGG DDVLALLPVE GALPTALALA RCYEQRFADS QLDRELLPAA TISGAIVFAH YHLPLRYITA RAHELLDHVA KDQTGRASLA ISLHQSSGET ARFSVPWSYL RTDEDDRTTS IDPLLADIQA GRLGKSLLYR LRALLGRISG AGEVGPGVPL DLSTLHQAGA ESGASDPVLD LFAAEIRSTR GDAERTPAQV RELAIHLQAA CRVVRRVAGS KHQIERGHLC LDGARLAYFM ATGGSNEDEI
|
| |