Gene Dshi_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0401 
Symbol 
ID5711317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp395098 
End bp396009 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content70% 
IMG OID641266306 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_001531751 
Protein GI159042957 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.455858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA TCGTCGATAT CGCCACGGAT GGGCGGCATC TCTCGCGTGA CCGGGGCTTC 
CTGAAGGTCA GCGAGGGCGC CCGGGAAATC GGCCGTATCC CCCTGGACCA GATCGCGGGC
GTCATCGTGC ACGCCCATGG CACCACCTGG ACCACCTCCC TTCTGACCGA GCTGGCGGAT
CGCGGGGCGC CGGTCGTGCT CTGCGGGGCC AACCACGCGC CGCGCTCGGT CCTCATGCCG
CTCGACGGGC ACCATGCCCA GGGCGCGCGC CTGCGGGCAC AGTGGCAGGC CAGGGCCCCG
CTCGTGAAAC AGGCCTGGAA GCAGACGGTA ATCGCGAAGA TCGCCATGCA GGCCGCCGCC
TTGGAGGCCA TGGGCGAACC CCATGCCCCC GTCGGCATGC TCGCGCGCAA GGTGACCAGC
GGCGATGCCA CCAATGTCGA GGCGCAGGCC GCGCGTCTCT ACTGGCCCCG AATGATGGGC
ACCGAGTTCC GCCGCGACCG CACCGCCCCC GACCTCAACG CGCTCCTGAA CTATGGCTAC
ACGGTGCTGC GCGCCGCCAC CGCCCGTGCG GTCGTGGCCG CAGGGCTGCA CCCGACCATC
GGCCTGCACC ATTCGAACCG CGGCAACGCC TTTGCCCTGG CCGACGACCT GATGGAGCCG
TTCCGCCCGC TCGTCGATTG TTGCGTGCGC GGCCTAGCCG CCCGTAACGG ACCCCAGGTC
GATCCCGCGG CCAAGCAGTC CCTAGCGCGG CTCATCGCGC TGGACCTGCC CCTCGGCGAC
AGCCTCACCC CGGTTTCCGT CGCGCTCGGC AAGCTTGCCA TCTCTCTCGG TCAGAGCTTC
GAGTCCGGAA CGCTGGATCT CGCCCTGCCT GCACCGCCCG ATGCGCTGAC CCTTGCAGGC
CTCGGCGCAT GA
 
Protein sequence
MDQIVDIATD GRHLSRDRGF LKVSEGAREI GRIPLDQIAG VIVHAHGTTW TTSLLTELAD 
RGAPVVLCGA NHAPRSVLMP LDGHHAQGAR LRAQWQARAP LVKQAWKQTV IAKIAMQAAA
LEAMGEPHAP VGMLARKVTS GDATNVEAQA ARLYWPRMMG TEFRRDRTAP DLNALLNYGY
TVLRAATARA VVAAGLHPTI GLHHSNRGNA FALADDLMEP FRPLVDCCVR GLAARNGPQV
DPAAKQSLAR LIALDLPLGD SLTPVSVALG KLAISLGQSF ESGTLDLALP APPDALTLAG
LGA