Gene Hoch_4852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4852 
Symbol 
ID8547259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6640324 
End bp6642108 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID646389525 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003269234 
Protein GI262198025 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.464981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACT CAGACGACAC CTTGTCCGCT CCGTCCGCCT CGTCGCCCGC GGGCGCACAC 
GATCCGCCTG CGGGCGCACG GACCCAACAC ACGCCCGCGA GCACACCGCT GCTGCCCGTG
CGCATGCTCA ACGAGTACGC CTACTGCCCG CGCTTGTTTC ATCTCGAGTG GGTGCAGCGC
GAGTGGGCTG ACAACGCCTA CACCCTGGAT GGCAAGCGGG TCCACAAGCG CGTGGACAAG
CCCTCGCGGC ATGGCTTGCG CTCTGCCGAC CGCGCGTCCG ACGATAGCGC TTCGAGCAAA
GACGCGGGCC AACCCGAAGA CACCCTGTTT CAGCAGCACG CGCGCAGCGT GGACCTCGGC
GACGACGCGC TCGGCCTGAT CGCGCGCATC GACCTCGTCG AGGCCGAGGG TGACCAGGCC
ACGCCCATCG ACTACAAGCG CGGCAAGCGC CCGGACGTCC CCGGCGGCGC CTACGAGCCC
GAGCGCGTGC AGGTCTGTGC CCAGGGCTTG CTCCTGCGCG CCCATGGATT TCGCAGCGAC
CACGGTATTT TGTACTTCGC CGGCTCGCGC GAGCGCGTGG ACGTGCCCTT CACCGACGCG
CTCGTCGAGC GCACCTTGGC CCTGCGCGAT CAGGCCCTGC AGGCTGCCGA AGCCGAAAAG
CCGCCGTCGC CGCTGGTAGA CAGCCCCAAA TGCCCGCGCT GCTCGCTGGT CGGCATCTGT
CTACCGGACG AGCAGAATGC CCTGCTCGGA CGCAGCACCG AGGGAATTCG TCCACTCGTC
TCGCTACGTG ACGACGCCCT GCCCTTGCAC GTGCAGGAGC ACGGCGCCGT GGTGAGCAAG
CGCGCCGCCG AGCTCGTCAT CAAGCGCAAA GGCAGCGAGC TCGAGCGCGT GCGCATCAAA
GACGTCTCGC GCATCAACCT GCACGGCAGC GCGCACATCA CCTTGCCCGC CCTGCAGACA
GCATTGGGCA ATGGCATTCC CGTCGGCCTA TTCACCTACG GCGGCTGGTA CTACGGGCGT
GCACAGGGAC ATGATCACAA GAACGTGCTC CTGCGTCAGG CGCAGTTTGC CAGCGCGCAG
GACGAGGGGC GCTGTCTGCG CATCGCGCAG CGGCTGGTCC ACGCCAAGAT CAAAAACAGC
CGCGTCATGT TGCGGCGTAA CAGCCGAGCG CTCGATCGAC GGATTCTCGA CGACCTGTCC
GGTCATGCGC GACGCGCGCG TCAGGCCGAC AGCCAGGCCA CCTTGCTCGG CATCGAGGGC
AGCGCCGCGC GCCTGTACTT TCAGAATTTC AGCGGCATGT TGCGCCAGGA CGTGCCGTTT
TCGTTTGACA GTCGCAATCG CCGCCCGCCG CGCGACCCAG TCAACGCGCT GCTGTCGTTT
TCGTACGCGT TGCTCACAGC GGAGTGGACG GCGACCTTGA GCACCGTTGG ATTTGATCCG
TACCAGGGCT TTTATCATCA GCCGCGCTAC GGCCGTCCGT CACTCGCGCT GGACCTGATG
GAGGAGTTTC GACCGCTCAT CGCCGACAGC GTGGTCATCG GTGCGATCAA CAACGGTGTA
CTCGACGAAG ATGATTTCGT CGTGACCGCC ACCGCGGCCG CATTGAAACC CGCGGGGAGG
AAGCGGTTTT TGCAGGCATT CGAGCGCCGC CTCGACGAGC AGGTGACCCA TCCGGTCTTT
GGCTATCGGC TTAGCTATCG CCGGGTACTG GACGTACAGG CGCGGCTGCT GGGACGCTAC
ATCATGGGAG AGATCGATGA GTATCCCGAG TTTGTCACCC GATGA
 
Protein sequence
MNNSDDTLSA PSASSPAGAH DPPAGARTQH TPASTPLLPV RMLNEYAYCP RLFHLEWVQR 
EWADNAYTLD GKRVHKRVDK PSRHGLRSAD RASDDSASSK DAGQPEDTLF QQHARSVDLG
DDALGLIARI DLVEAEGDQA TPIDYKRGKR PDVPGGAYEP ERVQVCAQGL LLRAHGFRSD
HGILYFAGSR ERVDVPFTDA LVERTLALRD QALQAAEAEK PPSPLVDSPK CPRCSLVGIC
LPDEQNALLG RSTEGIRPLV SLRDDALPLH VQEHGAVVSK RAAELVIKRK GSELERVRIK
DVSRINLHGS AHITLPALQT ALGNGIPVGL FTYGGWYYGR AQGHDHKNVL LRQAQFASAQ
DEGRCLRIAQ RLVHAKIKNS RVMLRRNSRA LDRRILDDLS GHARRARQAD SQATLLGIEG
SAARLYFQNF SGMLRQDVPF SFDSRNRRPP RDPVNALLSF SYALLTAEWT ATLSTVGFDP
YQGFYHQPRY GRPSLALDLM EEFRPLIADS VVIGAINNGV LDEDDFVVTA TAAALKPAGR
KRFLQAFERR LDEQVTHPVF GYRLSYRRVL DVQARLLGRY IMGEIDEYPE FVTR