Gene Dd1591_0696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_0696 
Symbol 
ID8119739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp801338 
End bp802339 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID644851084 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003003056 
Protein GI251788335 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG TATTCAGCCC GTCGGATTTA AAAACCATTC TGCATTCCAA ACGCGCCAAT 
ATTTATTACC TCCAGCATTG CCGTATTTTG GTCAATGGCG GACGGGTGGA ATATGTCACC
GAAGAGGGGA ATCAGTCGTT GTACTGGAAT ATCCCGATTG CCAATACCAG CGTGGTGATG
CTCGGCACCG GCACTTCGGT CACGCAGGCG GCGATGCGGG AATTTGCCCG CGCCGGGGTG
ATGGTCGGGT TTTGTGGTGG TGGCGGTACG CCGCTGTTCG CCGCCAATGA GGCCGAAGTG
GCGGTGTCGT GGCTGTCGCC GCAGAGCGAA TACCGCCCCA CCGGCTATTT GCAGGATTGG
GTCAGCTTCT GGTTTAACGA AGAGCAGCGG CTGGCGGCGG CGGTTGCCTT CCAGCAGGTG
CGCATCGGCC AGATTCGCCA GCACTGGCTG GGCGGGCGGC TGGCGCGCGA GTCGCGTTTT
GCTATCAAAC CCGAGCATGT GGAAGCACTG CTTAACCGCT ATCAGCAGGG GCTGACGGCG
TGTCGCACCA GTAACGACGT ATTGGTGCAG GAAGCGATGA TGACCAAAGC GCTGTACCGG
TTGGCGGCCA ACGCGGTGAG TTACGGTGAT TTTACCCGCG CCAAACGCGG CGGCGGCACC
GACATGGCGA ACCGTTTTCT CGACCACGGC AACTATCTGG CTTACGGTCT GGCGGCGGTG
GCGCTGTGGG TGTTGGGATT GCCGCACGGG CTGGCGGTGC TGCACGGCAA AACCCGCCGT
GGCGGGCTGG TGTTCGATGT GGCGGACCTG ATTAAAGACG CGCTGATTCT GCCGCAGGCG
TTTATCGCCG CGATGGAAGG GGAAGACGAG CAGGAATTTC GCCAGCGCTG CCTGACGTCG
TTTCGTCAGG CCGAGGCGCT GGACGTGATG ATCGACAGCC TGCAACAGGT GGCGCAGCAA
TTAAGCCAGG TGGCGAAAAC CGGGAGTCGG GGGGCGCAAT GA
 
Protein sequence
MDNVFSPSDL KTILHSKRAN IYYLQHCRIL VNGGRVEYVT EEGNQSLYWN IPIANTSVVM 
LGTGTSVTQA AMREFARAGV MVGFCGGGGT PLFAANEAEV AVSWLSPQSE YRPTGYLQDW
VSFWFNEEQR LAAAVAFQQV RIGQIRQHWL GGRLARESRF AIKPEHVEAL LNRYQQGLTA
CRTSNDVLVQ EAMMTKALYR LAANAVSYGD FTRAKRGGGT DMANRFLDHG NYLAYGLAAV
ALWVLGLPHG LAVLHGKTRR GGLVFDVADL IKDALILPQA FIAAMEGEDE QEFRQRCLTS
FRQAEALDVM IDSLQQVAQQ LSQVAKTGSR GAQ