Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2054 |
Symbol | |
ID | 4568738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2379403 |
End bp | 2381598 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639766635 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_912490 |
Protein GI | 119357846 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATGGC TCTACAATCA GATGGCAATG CCCGAAACCA TTTTTCAGGC CTGGTACAAA GTGGCATCGA ACGATGGGCG CCCGGGATGG GACAATACAT CCATTCAGGA CTACTCCCTT CGGCTTGAAG AAAACCTGAA ATCCCTTTCG CACGCCCTGC TGACAGGAAC CTACAGGCAG AGCCCGCTGC TGAAGCTCGT TATGCTGAAG CCTGACGGAA AAGAGCGGGT GCTTCTGATT CCCGGTGTGA TTGACAGAGT TGCCCAGACA GCGGCATCAA TCGTGCTCTC ACCCATTATC GAAGCGGAGC TTGGCAACTG CACCTTTGCC TACCGTCCGG GCATATCGCG TGAAGGAGCA GCACGGGAGA TCGACCGGCT GCATCGCGAA GGGTACCAGT GGGTGCTCGA CGCCGACATC CGCAACTTTT TCGACAATGT TCGCCACGAC CTGCTTTTTC AACGGCTTGT CGAACTTGTT GACGACAAGG AGATGATCTC CCTGTTGCAC CGCTGGCTTA CCGCCGAAAT TGTTGACGGG CTGAACCCCC GTACCCGCAA CACGATGGGT CTCCCTCAGG GATGCCCGAT ATCACCGGCA CTGGCGAACC TCTATCTTGA CCGGTTTGAT GAAACAATGG AACAGCAGGG ATTCAAACTG GTTCGTTTTG CCGACGACTA TCTTGTACTC TGTAAAACCC GTCCCAAAGC CGAAGCTGCC CTCAAGCTCT CTGAAAGCGC GCTTGCTGAG CTGAAGCTTG AACTGCACAG CGATAAAACC CGCATTACCA CCTTTGCCGA AGGATTCAAA TATCTCGGCT ACCTCTTTAT CCGATCGCTG GTGATTCCCA CCAAAATGCA CCCTGAAGAG TGGTATGACA AGCTCGGCAA ATTCAGACTC CGCAAAAAGA GCGTGCACGC CCTTCCCGCT GACCCCGAAA CCATGACCGG CGAAACATCG CAGTTTGAAC TCGAAACCGA TCAGGGCGAA AAAATCGAAC TCTCCAAGGA GGAGCTTCTG CAGACAGAGT TCGGCAAAAA ACTGCTCGAA AGCCTCGATA AAAAGCAGTT GAATGTTGAC GAATTTCTTG AAAAGATCTC CAGGCAGGAC GAAGAGCGAC AGAAAGAGAA ACGGGAGGCA CTGAAAAAAC TCTACTCCCC CTTTCTCAAT ACGCTCTATC TGCAGGAGCA GGGGAGCCTT ATGCGCAAGG ATGGGGAGCG GTTCAGTATT GAAAAGGATG GGTCGGTCAT TAACGAAGTG ATTGTTCGCC GCATCGAACA GGTTGTGATT TTCGGCAACG TCGCCCTCAC CACACCGGTC ATGCAATACT GCCTGCAGAA CGAAATACCG GTCACCTTTC TTTCGCAGCA TGGCAAATAC TTCGGCAGGC TTGAAGCAAC CACTGCCGAC AATGCTGAAA TGCAGCGCTT TCACTTTCTG CGTTCCATAG ACGAACCCTT TGCGCTTGAA ACCGCCCGTT CCATCGTAGC GGCAAAAATC AGCAACAGCA AAACCATGAT TCGCCGCCGA AAAACCGTGG TGCAGGATCG CGACAGCACT CTGCAAAATA AAATGGCATA CAATCTCGAC ATCATGGCCG ATCTTGCCCT GAAAGCGGAA GCATCCACTG ATATCGATGC GCTACGGGGG ATCGAAGGCA AGGCATCGGC ACTCTACTTC GAATGTTACG GCATGCTTTT CAGCAAAAAC CTGCCCTTTC ACACCCGGTC GTTTCTGCGG GTACGACGAC CGCCTACCGA TCCGGTCAAC AGTCTGCTCA GTTTCGGTTA CACCATGCTG CACACCAACA TATTCTCGAT GGTGCAGGCA AGCGGCCTGA ACCCCTATAT CGGTTTTCTT CACGCCGAAC GAAAAGGCAA TCCCGCTCTG GTCAACGATC TGGTCGAAGA GTTCCGCACG ATAGTCGATT CACTCGTGCT CTACACCCTC AACCGGGGTC TTTTGCAGGA AAAAGACTTC TACTACCGCA AAGATGAGCC CGGTTGCTTT CTGTCGAACG ACGCCCGCAA ACGGTTTTTA AACATATTCG AAACAAGGAT GTGGCAGGAA TCCCGTGACG GCTGCACCGG CAAAACGCTC AATTTCAGGC GGCATATCGA AAAACAGGTG AGGATCATGA GAGAGGTTAT AGCCGGAACC CGAACGCAGT ACGACCCGTA CAAGCTACCG GTATAA
|
Protein sequence | MGWLYNQMAM PETIFQAWYK VASNDGRPGW DNTSIQDYSL RLEENLKSLS HALLTGTYRQ SPLLKLVMLK PDGKERVLLI PGVIDRVAQT AASIVLSPII EAELGNCTFA YRPGISREGA AREIDRLHRE GYQWVLDADI RNFFDNVRHD LLFQRLVELV DDKEMISLLH RWLTAEIVDG LNPRTRNTMG LPQGCPISPA LANLYLDRFD ETMEQQGFKL VRFADDYLVL CKTRPKAEAA LKLSESALAE LKLELHSDKT RITTFAEGFK YLGYLFIRSL VIPTKMHPEE WYDKLGKFRL RKKSVHALPA DPETMTGETS QFELETDQGE KIELSKEELL QTEFGKKLLE SLDKKQLNVD EFLEKISRQD EERQKEKREA LKKLYSPFLN TLYLQEQGSL MRKDGERFSI EKDGSVINEV IVRRIEQVVI FGNVALTTPV MQYCLQNEIP VTFLSQHGKY FGRLEATTAD NAEMQRFHFL RSIDEPFALE TARSIVAAKI SNSKTMIRRR KTVVQDRDST LQNKMAYNLD IMADLALKAE ASTDIDALRG IEGKASALYF ECYGMLFSKN LPFHTRSFLR VRRPPTDPVN SLLSFGYTML HTNIFSMVQA SGLNPYIGFL HAERKGNPAL VNDLVEEFRT IVDSLVLYTL NRGLLQEKDF YYRKDEPGCF LSNDARKRFL NIFETRMWQE SRDGCTGKTL NFRRHIEKQV RIMREVIAGT RTQYDPYKLP V
|
| |