Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2164 |
Symbol | |
ID | 6375858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2341431 |
End bp | 2342306 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642684651 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001960550 |
Protein GI | 189501080 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0192086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACTG AAAAGATAGT CCCGGAGAGC AATCAATCTC GGTTGCTGAT AAAAATCACC CGAGACACCT TACCGCAGGT GAAGGATAAA TACCCGTTTC TCTATCTTGA ACGGGGAAGG CTGGAAATAG ACGATAGTAG TATAAAATGG ATAGATTGCG ACTGTAACGT TGTCCGGTTA CCTGTGGCGC AGCTCAATTG CTTGTTGCTT GGACCGGGAA CTGCTGTTAC ACATGAAGCT GTGAAAGTTA TGGCAGCAGC AAATTGTGGT ATATGCTGGG TCGGGGAAGA TAGTCTAATT TTTTATGCTG CAGGACAGAC GCCTACAAGT GATTCCCGGA ACTTTCGACG ACAAATGGTA TTGTCTGCCG ATTCAGATAA ATCGCTCAAG GTTGCTCGGC GCATGTTTGC CCGCAGATTT CCTGATGCGA AACTTGAGAC TAAAAGTCTT AAGCAAATGA TGGGAATGGA AGGTTTGCGT GTTCGTCAAC TTTATGTACA AAAAGCTCAA GAATACAAGG TGGGCTGGAA GGGGCGACAA TTTACTCCTG GCAAGTTTGA AATAGGGGAT TTAACTAACA GAATCTTGAC CTCAGCCAAT GCAGCTCTAT ATGGTATAAT TTGTTCTGCT GTTCACAGTA TGGGTTATTC TCCACACATG GGTTTTATAC ATACAGGTAG TCCTCTGCCA TTCATTTATG ATTTGGCTGA TTTATACAAA GAGAGTCTCT CGATTGATCT TGCCTTTCGA TTGACGGCAT TGATGGCAGG AACTTATGAT AGGCACAAAA TTGCTACTGA ATTTCGCAGG AAAGTTATTG AGATGGATCT TCTTGCACGT ATTGGGCCTG ATATTGAAGA AATGCTTGGG AGGTAA
|
Protein sequence | MTTEKIVPES NQSRLLIKIT RDTLPQVKDK YPFLYLERGR LEIDDSSIKW IDCDCNVVRL PVAQLNCLLL GPGTAVTHEA VKVMAAANCG ICWVGEDSLI FYAAGQTPTS DSRNFRRQMV LSADSDKSLK VARRMFARRF PDAKLETKSL KQMMGMEGLR VRQLYVQKAQ EYKVGWKGRQ FTPGKFEIGD LTNRILTSAN AALYGIICSA VHSMGYSPHM GFIHTGSPLP FIYDLADLYK ESLSIDLAFR LTALMAGTYD RHKIATEFRR KVIEMDLLAR IGPDIEEMLG R
|
| |