Gene Cphamn1_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2164 
Symbol 
ID6375858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2341431 
End bp2342306 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content43% 
IMG OID642684651 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_001960550 
Protein GI189501080 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0192086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTG AAAAGATAGT CCCGGAGAGC AATCAATCTC GGTTGCTGAT AAAAATCACC 
CGAGACACCT TACCGCAGGT GAAGGATAAA TACCCGTTTC TCTATCTTGA ACGGGGAAGG
CTGGAAATAG ACGATAGTAG TATAAAATGG ATAGATTGCG ACTGTAACGT TGTCCGGTTA
CCTGTGGCGC AGCTCAATTG CTTGTTGCTT GGACCGGGAA CTGCTGTTAC ACATGAAGCT
GTGAAAGTTA TGGCAGCAGC AAATTGTGGT ATATGCTGGG TCGGGGAAGA TAGTCTAATT
TTTTATGCTG CAGGACAGAC GCCTACAAGT GATTCCCGGA ACTTTCGACG ACAAATGGTA
TTGTCTGCCG ATTCAGATAA ATCGCTCAAG GTTGCTCGGC GCATGTTTGC CCGCAGATTT
CCTGATGCGA AACTTGAGAC TAAAAGTCTT AAGCAAATGA TGGGAATGGA AGGTTTGCGT
GTTCGTCAAC TTTATGTACA AAAAGCTCAA GAATACAAGG TGGGCTGGAA GGGGCGACAA
TTTACTCCTG GCAAGTTTGA AATAGGGGAT TTAACTAACA GAATCTTGAC CTCAGCCAAT
GCAGCTCTAT ATGGTATAAT TTGTTCTGCT GTTCACAGTA TGGGTTATTC TCCACACATG
GGTTTTATAC ATACAGGTAG TCCTCTGCCA TTCATTTATG ATTTGGCTGA TTTATACAAA
GAGAGTCTCT CGATTGATCT TGCCTTTCGA TTGACGGCAT TGATGGCAGG AACTTATGAT
AGGCACAAAA TTGCTACTGA ATTTCGCAGG AAAGTTATTG AGATGGATCT TCTTGCACGT
ATTGGGCCTG ATATTGAAGA AATGCTTGGG AGGTAA
 
Protein sequence
MTTEKIVPES NQSRLLIKIT RDTLPQVKDK YPFLYLERGR LEIDDSSIKW IDCDCNVVRL 
PVAQLNCLLL GPGTAVTHEA VKVMAAANCG ICWVGEDSLI FYAAGQTPTS DSRNFRRQMV
LSADSDKSLK VARRMFARRF PDAKLETKSL KQMMGMEGLR VRQLYVQKAQ EYKVGWKGRQ
FTPGKFEIGD LTNRILTSAN AALYGIICSA VHSMGYSPHM GFIHTGSPLP FIYDLADLYK
ESLSIDLAFR LTALMAGTYD RHKIATEFRR KVIEMDLLAR IGPDIEEMLG R