Gene Hoch_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1317 
Symbol 
ID8543699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1743413 
End bp1745365 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content68% 
IMG OID646386033 
ProductCRISPR-associated protein, Crm2 family 
Protein accessionYP_003265768 
Protein GI262194559 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA AGCTCCACCT CAGTCTGGGC CCGGTACAGG CATTCATCGC TGAATCGCGA 
CGCACACGCG ACCTCTGGGT AGGCTCGTAC CTGCTGTCGT ATCTCGCCGG CCGCGCGCTG
TACGCAGCCG CGCAGCACGG CGAGATCGTC CTGCCGCGCG TACACGACGA CGTGCTGGCG
CTGTATGCCC CCGGGCGCAC GGATCGCGCC CGGTTGCCCG CTCACGCGAG CTTGCCCAAC
CGCTTCGTGC TCGAGTGCGC AGACCAGGCC GCGGCAGTCC AGGCGGCCGA GGCAGCAACG
AGCGCGCTCC GCGCCGCCTG GGCGCATATC GCCGGCACCG TCCGGAAACA GTTTATCGAC
CCTGTGTCCG GTTCCGACGA CGAGACCCAG CGCATCTGGC GCCGCCAGGT CGAGTCGTTC
TGGCACGTGG TCTGGGTGAT CGGCGACGAC CCGGCCTTGC TCGATCAACG CAAACACTGG
CGCGCGCCCG CCCTCGCGCC CGACCCCGAG CCCGGCGAAC ACTGCACCAT GATGGGCAGA
TACCAGGAGC TATCCGGCTT TCTGCGCGGC CAACGCGGAC TGGAAGAATT CTGGATAGAC
GTGCGCGCAC GTCTGGGCGG AGCGATGAAC CTGGACCTGA GGCCGGACGA GCGACTGTGC
GCTATCGCGC TCATCAAACG GCTCCTGCCG CGGGTCTCGG GCCAGGCCCT CGGTCGCCGC
CTCGACGAGG AGCAGGTCGC CTGGCCGTCT ACTCTGTACA TGGCCGCGCG GCCGTGGATC
GGAACGGTGT GCGAAGCGCA GCCTGAACCC GCCGCACGTT ATGCAAAGCA GGTGCTCCGT
GCGCGCGCGA GCGCCCGCGG CGAACGCAAG GCCGGTCAGA CGCTGCTTGA TGCGCTCACG
GGGACGTCAG CCTCGACAGC CTCCGCGGGT GCATTCCCGC TCCTGGATGG CAACTTCTCG
TTCATCGGCG CCCTGGAGAA CGAACGAGCC ACCCCACTAG ACCGTGAGGA CGAGCGCCGC
GGCCTGGTGA AGGCCCTGAA AGCGCTGCAC GCCCGCCAGG GCACCGGGCC GAGCCCATAC
TACGCGTTCT TGCTCATGGA CGGCGACAGC CTCGGCACCC TCCTGAGCCG CGCGGAGCCG
CGTACGATAA CCGACTGTCT GTCCGATTTC ACCGAGAGAG TACCCGATAT CGTCTGCGCA
CGCGGGGGCG CCACGGTCTA CGCAGGCGGC GATGACGTGC TGGCCTTATT GCCGGTCGAA
GGTGCCCTGC CGACAGCGCT GGCGCTGGCG CGCTGCTATG AGCAGCGCTT CGCAGATAGC
CAGCTCGACC GCGAACTCCT GCCCGCAGCC ACCATCTCCG GCGCGATCGT GTTCGCGCAC
TACCATCTAC CGCTGCGGTA CATCACCGCC CGCGCGCACG AGCTCCTCGA CCACGTGGCC
AAGGACCAGA CCGGCCGCGC CAGCCTGGCC ATCTCGCTAC ACCAGAGCAG CGGCGAGACC
GCGCGCTTCA GCGTACCGTG GAGCTACCTG CGCACCGACG AAGACGATCG CACCACGAGC
ATCGATCCAC TGCTCGCCGA TATCCAGGCC GGTCGCCTGG GCAAGAGCCT TCTGTACCGT
TTGCGCGCGC TGCTCGGCCG CATCAGTGGC GCCGGCGAAG TCGGACCGGG TGTCCCCCTG
GACCTCAGCA CCCTGCACCA AGCCGGCGCC GAAAGCGGCG CAAGCGATCC TGTGCTCGAC
CTGTTTGCCG CCGAGATTCG CAGTACCCGC GGGGACGCGG AGCGAACACC GGCGCAGGTC
CGCGAGCTAG CCATACACCT CCAGGCAGCA TGCCGGGTCG TTCGCCGCGT CGCCGGTAGT
AAGCACCAGA TCGAGCGCGG CCACCTATGC CTGGATGGCG CCCGCTTGGC CTATTTCATG
GCCACTGGGG GCAGCAACGA GGATGAGATA TGA
 
Protein sequence
MTVKLHLSLG PVQAFIAESR RTRDLWVGSY LLSYLAGRAL YAAAQHGEIV LPRVHDDVLA 
LYAPGRTDRA RLPAHASLPN RFVLECADQA AAVQAAEAAT SALRAAWAHI AGTVRKQFID
PVSGSDDETQ RIWRRQVESF WHVVWVIGDD PALLDQRKHW RAPALAPDPE PGEHCTMMGR
YQELSGFLRG QRGLEEFWID VRARLGGAMN LDLRPDERLC AIALIKRLLP RVSGQALGRR
LDEEQVAWPS TLYMAARPWI GTVCEAQPEP AARYAKQVLR ARASARGERK AGQTLLDALT
GTSASTASAG AFPLLDGNFS FIGALENERA TPLDREDERR GLVKALKALH ARQGTGPSPY
YAFLLMDGDS LGTLLSRAEP RTITDCLSDF TERVPDIVCA RGGATVYAGG DDVLALLPVE
GALPTALALA RCYEQRFADS QLDRELLPAA TISGAIVFAH YHLPLRYITA RAHELLDHVA
KDQTGRASLA ISLHQSSGET ARFSVPWSYL RTDEDDRTTS IDPLLADIQA GRLGKSLLYR
LRALLGRISG AGEVGPGVPL DLSTLHQAGA ESGASDPVLD LFAAEIRSTR GDAERTPAQV
RELAIHLQAA CRVVRRVAGS KHQIERGHLC LDGARLAYFM ATGGSNEDEI