Gene Hoch_1316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1316 
Symbol 
ID8543698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1742241 
End bp1743416 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID646386032 
ProductCRISPR-associated protein, Cmr3 
Protein accessionYP_003265767 
Protein GI262194558 
COG category[L] Replication, recombination and repair 
COG ID[COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01888] CRISPR-associated protein, Cmr3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGC GCGCCTACCT GCTTCAGCCC ACCGACGTAT GGTTCTTTCG CGACGGCCGC 
CCGTATGACC GCTACGAGGC CAGTCAGACG GCGGTCAAGA GCCTGTTCCC GCCCTCTCCG
CTAACCGTAC TTGGCGCCTT GCGTGCCGGC CTGGCGCGCG CCCTGGGTTG GCGTGACGGG
CCCTGGCCAG CCGAAATCTG TGCCGTGCTC GGCGACGGTA TCGAAGACCT GGCCGAGCTC
TCGCTGCGTG GCCCGTATCT GGCCCGGAGC GTCGACCCGG AGCGGCCCGA GCCGTGGTGG
CCGCTTCCCG TACACCTGGT CGGAATTGTC CAGAACGGAG TGCGGCATGC TCAGGCTCTT
GGGCATCGCG AAGACGAGCA AAGCCTACCG TGGCGAGCCC GCGCGCTGCT GAGACCCAGC
CAGGAGCCGA TCCGCTGCGA CCTCGGCGAG GTGCACCTGC CCGTGCCGTC CAGTGACAAG
CCGGCTCCGC CCTCCGAGCG CTTGTCCGCG CGGCCGCGGT ACTGGGTGAA CACCGCCGGC
CTCGACGCCA TCCTCGCCGG CCGGTTGCCC AAACCCGAGG ACGTGATCGC GCCGCCCTGG
CAGCACCAGA TGCGCGTGGG CATCCACCGG GACGAGACCA CGCGAACGAC CAGCGATCGC
GCCCATGCCT TGTACAGCCC GCTCATGGTG AGCTTGCGGC CCGACTTCGG ACTTTTGGCC
GAAATGCGCG GCGTACCCGA CACCGTGGAC GATCCCGCGC CGGTCTTGCC GCTGGGTGGC
GAGTCCCGCC TGGCTGCTTG CCAGCGCGTG GCCTCTCCCC GTGCCCCGTC GTGTCCGAGC
AATCTCATCC GCAAGAGCCG ACGCTGCGTT GTGGTCCACC TGTCCCCCGC CCGACTATCG
AGCCTGCCGC GACCAGGCGA AACGCTCCCC GACCTGCCGG GCGCGCGGGT GGTAACGGCC
TGCTTACGGC CGCTCGAGCA GATCGGCGGC TGGGATGGCC GGGACCGAGC GAAAGCACGC
CCGCGACCGC TCAACCCCGT GGTGGCCGCG GGTAGCGTGT GGTTCTGCGA GCTCGATGGT
GACGTCGACG CAACCCTGAA CATGCACGAT GGCCGCATCG GCGATGACAC GCGCTGCGGC
TTTGGACACC TTGCCTTGGG CACCTGGCCC GCTTGA
 
Protein sequence
MTTRAYLLQP TDVWFFRDGR PYDRYEASQT AVKSLFPPSP LTVLGALRAG LARALGWRDG 
PWPAEICAVL GDGIEDLAEL SLRGPYLARS VDPERPEPWW PLPVHLVGIV QNGVRHAQAL
GHREDEQSLP WRARALLRPS QEPIRCDLGE VHLPVPSSDK PAPPSERLSA RPRYWVNTAG
LDAILAGRLP KPEDVIAPPW QHQMRVGIHR DETTRTTSDR AHALYSPLMV SLRPDFGLLA
EMRGVPDTVD DPAPVLPLGG ESRLAACQRV ASPRAPSCPS NLIRKSRRCV VVHLSPARLS
SLPRPGETLP DLPGARVVTA CLRPLEQIGG WDGRDRAKAR PRPLNPVVAA GSVWFCELDG
DVDATLNMHD GRIGDDTRCG FGHLALGTWP A