Gene Dhaf_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3931 
Symbol 
ID7260952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp4166529 
End bp4167524 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content45% 
IMG OID643563854 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002460382 
Protein GI219669947 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAA CCTTATATAT CTTCTCCAAT GGTCAGCTCC GGCGGAAGGA TGATACGATT 
TTCTTCGTCA ATGAGGAGGG GGATCAAAAA TATATCCCTG TGGAAGATAC CTCCGAGTTG
ATGGTCTTTG GGGAGGTGGA TATTAATAAA CGATTCCTGG AGTTCTGCAC GCAAAAGGAG
ATTATCATTC ATTACTTTAA TAACTATGGT TATTACAGCG GTACCTTTTA TCCCCGGGAA
CACTATAACT CAGGGTATAT GATATTGAAG CAGGCTGAAG CTTACCTGAA TGAGGAAAGA
AGGCTGTTGC TGGCGCGGCA ATTTGTGAAC GGTGCTTTTT TAAATATCAG GCAGGTTCTT
AAATACTATG CCAATCGGGG CAAGGAAGTG GGGCCCCGGT TAACGGAGAT AGAAAAGCTG
AGTGAAGGTA TAGGAGCTGC CGGCACGATT CCTGAGCTGA TGGCTTTTGA AGGGAATATC
AGGGAACATT ATTATAAGGC CTTTGACGCG ATTCATGGCC ACCCGGAATT TGTGTTCGAG
GGGCGCTCAA AGCGGCCTCC TAAAAATGCA ATGAATACCT TGATCAGTTT TGGCAATTCC
ATTGTATACT CCACAGTGTT GAGTGAAATT TATAAAACTC ATCTGGATCC GCGCATCGGC
TATCTTCATA CCACTAATTT TCGCCGGTTT AGTTTGAATT TGGATGTGGC GGAAATCTTT
AAACCGATTT TGGTGGATCG GGTGATTTTT ACTCTGATCG GGAAAAAGAT GATCAAAAAG
AGCGATTTTA AAAAGGAGTC CGGGGGCTTA ATGCTGAAGG AGAACGGACG GAGGGTTTTC
GTTGAGGAGC TGGAAAACCG TTTGAAAACC ACCATCAACC ACCGGGATAT AGGCACTCCG
GTGTCTTATC GGCGCTTGCT TCGTCTTGAG CTGTACAAGC TGGAAAAGCA TCTCATGGGT
GAGAAAGACT ACGAGCCTTT TGTCAGCCAG TGGTGA
 
Protein sequence
MKKTLYIFSN GQLRRKDDTI FFVNEEGDQK YIPVEDTSEL MVFGEVDINK RFLEFCTQKE 
IIIHYFNNYG YYSGTFYPRE HYNSGYMILK QAEAYLNEER RLLLARQFVN GAFLNIRQVL
KYYANRGKEV GPRLTEIEKL SEGIGAAGTI PELMAFEGNI REHYYKAFDA IHGHPEFVFE
GRSKRPPKNA MNTLISFGNS IVYSTVLSEI YKTHLDPRIG YLHTTNFRRF SLNLDVAEIF
KPILVDRVIF TLIGKKMIKK SDFKKESGGL MLKENGRRVF VEELENRLKT TINHRDIGTP
VSYRRLLRLE LYKLEKHLMG EKDYEPFVSQ W