Gene Dd703_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd703_0734 
Symbol 
ID8088007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya dadantii Ech703 
KingdomBacteria 
Replicon accessionNC_012880 
Strand
Start bp835354 
End bp836355 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID644834806 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002986367 
Protein GI242238186 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG TATTCAGCCC GTCGGATTTA AAAACCATTC TGCACTCCAA ACGCGCCAAT 
ATTTATTACC TCCAGCATTG CCGTATTTTG GTGAATGGCG GACGGGTGGA ATATGTCACT
GAAGAAGGAA ATCAGTCGCT GTACTGGAAT ATCCCTATCG CTAATACCAG CGTGGTAATG
CTCGGCACCG GCACCTCGGT GACGCAGGCG GCGATGCGGG AATTTGCTCG CGCCGGGGTG
ATGGTCGGGT TTTGCGGCAG TGGAGGTACG CCATTGTTCG CCGCCAACGA GGCCGAAGTG
GCGGTATCGT GGCTGTCGCC GCAAAGTGAA TACCGCCCTA CTGAGTATTT GCAGGATTGG
GTCAGCTTCT GGTTTAACGA ACAGCAGCGG CTGGCGGCGG CGATTGCCTT TCAACAGGTG
CGCATTGGGC AGATTCGTCA GCACTGGCTG GGTGGGCGAC TGGCGCGTGA ATCACGTTTC
ACCATCAAAC CCGAACATGT GGAAGCGTTG CTTAACCGCT ATCAGCAGGG ACTGGTCGAC
TGCCGCACCA GTAACGATGT GCTGGTACAG GAAGCGATGA TGACCAAAGC GTTATATCGG
CTGGCGGCCA ACGCCGTGAG CTACGGTGAT TTTACCCGCG CCAAACGCGG CGGCGGCACC
GATTTGGCGA ACCGTTTTCT CGACCACGGC AACTATCTGG CTTACGGGCT GGCGGCAGTA
GCATTGTGGG TGCTGGGCCT GCCGCATGGC CTGGCGGTGC TGCACGGCAA GACCCGACGC
GGCGGGCTGG TGTTCGATGT GGCGGATTTG ATCAAAGACG CGCTGATTTT GCCGCAGGCG
TTTATCGCCG CGATGGAAGG CGAAGACGAG CAGGATTTCC GCCAGCGTTG CCTGACGGCG
TTTCGACAGG CCGAGGCGTT GGATGTAATG ATCGACAGCC TGCAACAGGT GGCTCAGCAA
TTAAGCCAGG TGGCGAAAAC CGGCAGCCAG GTGGCGCGAT GA
 
Protein sequence
MDNVFSPSDL KTILHSKRAN IYYLQHCRIL VNGGRVEYVT EEGNQSLYWN IPIANTSVVM 
LGTGTSVTQA AMREFARAGV MVGFCGSGGT PLFAANEAEV AVSWLSPQSE YRPTEYLQDW
VSFWFNEQQR LAAAIAFQQV RIGQIRQHWL GGRLARESRF TIKPEHVEAL LNRYQQGLVD
CRTSNDVLVQ EAMMTKALYR LAANAVSYGD FTRAKRGGGT DLANRFLDHG NYLAYGLAAV
ALWVLGLPHG LAVLHGKTRR GGLVFDVADL IKDALILPQA FIAAMEGEDE QDFRQRCLTA
FRQAEALDVM IDSLQQVAQQ LSQVAKTGSQ VAR