Gene A2cp1_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_0784 
Symbol 
ID7299170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp882459 
End bp883343 
Gene Length885 bp 
Protein Length294 aa 
Translation table11 
GC content71% 
IMG OID643593579 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002491204 
Protein GI220915900 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAAGG GCAGGCTCGG CCTCGAGACG GCGCGGATCC CGCAAGGCGA CCGCCACGGC 
CTGCTCTGGC TGTCGCGCGG CAGCCTGTAC GTCGAGGATG GGACGCTCCG CTTCCGCACC
GCCGGCTGGG CCGAGCTCCC AGCCGGCGAC TATGCCATAC CGTTTCAGAT GGTCACCGCC
GTGCTCCTCG AGCCGGGGAC CACCGTCAGC CACGACGCGC TCAGGCTGCT CGCGCGCCAC
GGGACGGGCC TCGTCGCCAT CGGCGAGGAG GGCACGCGCT TCTACGCGAG CATGCCGTTC
GGCCCGGACG CCTCGGCGCT CGCCCGCCGG CAGGTGATGG CGTGGGCGAG CGCCGCGGAC
GGTCGGTTGC GCGTCGCGCG TCGCATGTAC GCCTGGCGCT TCGGCGAGGT TCTGCCCGAC
GAGGACATCA CCGTCCTACG CGGTATCGAG GGTGCCCGGA TGCGCGAGAT CTACCGGCGC
CTCGCAGAGC AGTACGGCGT TCCATGGTCC GGTCGGCGCT ACGACCGGCA GCGCCCGGAC
CAGAACGATC CCGTGAACCA GGCGATCAAC CACGCCGCGA GCGCGGTCGA GGCCGCGGCG
CTCGTGGCCG TCGCCGTGAC GGGGACGATC CCCCAACTCG GCTTCATCCA CGAGGACTCG
GGGAACGCGT TCGCCCTCGA CGTCGCCGAC CTGTTTCGCT CGGCGATAGC CCTCCCGGCC
GCCTTCTCGG CCGTGCGGGA GTGTGCCAAG GATCCCCGCA AGCCACTCGA GCGCACGGCA
AGGCGCGCCG CGGGTCGTCT CCTGCAGCAG AAGGACGTCA TCCCCGAGAT GATCGACCGC
ATCAAGGAGA TGTTCGATGC CGATGACGGT CATCGTGACC CGTGA
 
Protein sequence
MLKGRLGLET ARIPQGDRHG LLWLSRGSLY VEDGTLRFRT AGWAELPAGD YAIPFQMVTA 
VLLEPGTTVS HDALRLLARH GTGLVAIGEE GTRFYASMPF GPDASALARR QVMAWASAAD
GRLRVARRMY AWRFGEVLPD EDITVLRGIE GARMREIYRR LAEQYGVPWS GRRYDRQRPD
QNDPVNQAIN HAASAVEAAA LVAVAVTGTI PQLGFIHEDS GNAFALDVAD LFRSAIALPA
AFSAVRECAK DPRKPLERTA RRAAGRLLQQ KDVIPEMIDR IKEMFDADDG HRDP