Gene Dhaf_2188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_2188 
Symbol 
ID7259157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp2360341 
End bp2361372 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content48% 
IMG OID643562078 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002458658 
Protein GI219668223 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT TATTGAATAC CCTCTACGTG ACTTCGCCCA ATACTTATCT ATCGCTTGAC 
GGAGAAAATA TTGTGATTTT GAAGGATGAT GTGGAGGCCT TGAGAGTCCC TTTGCATAAT
CTGGAGAGCA TCATTGCCTT TGGCTATACC GGGGCCAGCC CGGCTTTGAT GGGCGCTTGT
GCCAAGCGCA ATATCTCCTT GAGCTTTATG AAATCCAACG GAAAATTTCT GGGCAGAGTA
GTGGGGGAAG TCAGAGGGAA TGTCACTTTA AGAAAAGCGC AATACAGACT TTCGGATGAT
GAAGCGACAA GCCATAGAAT TGCCAAGAGT TTTATTTTGG GGAAAGTATA TAATTCCCGC
TGGGTGGTGG AACGGGCCAC CCGGGATCAC GGGGCAAGAC TGGATGTAGA GAAATTAAAG
GGAGTCTGCC AAACCTTGGC TAATGCTCTG AGACTGGTCG AGAACAGTAA GGATTTGGAC
CAGCTGCGGG GTGTTGAGGG GGAGGCTGCA GCCCAATATT TCCGGGTGCT GGATGATCTG
ATTCTCCAGC AGAAGGAGGA TTTTTACTTC AAATGTCGTA ATAAACGGCC TCCTCTGGAC
AATGTTAATG CTATGCTCTC TTTTATCTAC ACCTTGTTGG CCCATGATGC CGCTGCTGCT
CTGGAAACAG TGGGACTGGA CCCCTATGTA GGATTTTTGC ACCGGGATAG ACCGGGAAGG
ATCTCGCTGG CCCTGGATCT GATGGAAGAG CTGCGGGCAG TTTTTGCAGA TCGTTTTGTC
CTTTCTTTAA TCAATAGAAG GGAAGTCAAT CCCAGCGGGT TCACCCGAAT GGAAAATGGA
GCAGTTGTAA TGGATGATGA TACCAGAAGA GATATCCTTA AGGCATGGCA AAGCAGGAAA
CAGGAAGAGA TCAAGCATCC CTTTCTGCAG GAGAAAATGG AGTGGGGGCT TGTGCCCTAT
GCTCAGGCTA TGTTGCTGGC CCGGTTTATC CGTGGGGATT TGGACGGATA TCCGGCTTTT
ATGTGGAAGT AG
 
Protein sequence
MRKLLNTLYV TSPNTYLSLD GENIVILKDD VEALRVPLHN LESIIAFGYT GASPALMGAC 
AKRNISLSFM KSNGKFLGRV VGEVRGNVTL RKAQYRLSDD EATSHRIAKS FILGKVYNSR
WVVERATRDH GARLDVEKLK GVCQTLANAL RLVENSKDLD QLRGVEGEAA AQYFRVLDDL
ILQQKEDFYF KCRNKRPPLD NVNAMLSFIY TLLAHDAAAA LETVGLDPYV GFLHRDRPGR
ISLALDLMEE LRAVFADRFV LSLINRREVN PSGFTRMENG AVVMDDDTRR DILKAWQSRK
QEEIKHPFLQ EKMEWGLVPY AQAMLLARFI RGDLDGYPAF MWK