Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1433 |
Symbol | |
ID | 4568994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1635196 |
End bp | 1636296 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639766019 |
Product | CRISPR-associated Csd2 family protein |
Protein accession | YP_911885 |
Protein GI | 119357241 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02589] CRISPR-associated protein, Csd2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.562068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACTG TAGATAAACG CTATGATTTT GTTCTACTTT TCGATGTTCA GGACGGAAAC CCGAATGGCG ATCCCGATGC TGGAAATTTT CCACGAATCG ATGCTGAAAC CGGGATAGGG CTTGTGACTG ATGTTTGCTT GAAGCGCAAG GTTAGAAATT TTGTGCAGAT AAATGATGTT CAACAGCCGG GTTATGATAT TTATGTCAAG GAGAAAGCTG TGCTTGGTTT GGCTCATTTC AAGGCATTTA AAGAGTTAGG AATCAGTACT GGCGAAATGT CAAAAAAAGT CATTAACGAT CAAGAGATGA TTGAAATATT TTCTGATCTG ACTTTACCAG AAGGGGTATC TTTTGCCGAG AGTGATGAAC ATGGAGTATT ATCAATTGCT GCTGATGCGG ATAAAAACGA GATCAAAGAT TGGATGAAAG CTGAAAAAGA GAGCCTTTCT AAAAATGTTA TAAAAGTTAT TAGTGAGGCT CTGAAAGAAG CAAAGCCACG GAAGCCAACT GCTGAAGAAA CAAGTAGAGG CAAGGAAAAA ATGTGTCAGG ATTACTATGA TATTCGAACT TTTGGTGCGG TAATGTCTTT GAAGTCAGCT CCCAATTGTG GGCAGGTGCG CGGGCCGATT CAGATGACGT TTGCCCGCTC AGTCGAACCG ATCGTGGCGT TGGAGCACAG CATTACACGA ATGGCTGTAG CAACGGAAGC TGAAGCGGAA AAGCAGAGCG GCGACAACCG AACGATGGGC CGCAAATACA CCGTACCATA TGGCTTGTAT CGCGCTCATG GATTCGTGTC AGCAAACCTC GCCCATCAGA CAGGTTTTTC TGAAAACGAT CTCGATCTGT TCTGGAACGC TCTTTTGAAT ATGTTCGATC ATGACCGTTC GGCAGCTCGC GGGCTGATGT CCACGCGCGG CCTGTATGTT TTCGAGCACA GCTCAGTTTT GGGTAATGCT CCGGCAAGCC AGCTTTTCGA GCGAATCACG GTCAAGCGCA AAGAGGATTC CGAAGGTCCT GCTCGTTCGT TCAAAGAATA CGATGTGCTG ATAGATGAAT CCAGTCTTGG TGAGGTGAAG CTTCTCCGAA AACTTGGATA A
|
Protein sequence | MTTVDKRYDF VLLFDVQDGN PNGDPDAGNF PRIDAETGIG LVTDVCLKRK VRNFVQINDV QQPGYDIYVK EKAVLGLAHF KAFKELGIST GEMSKKVIND QEMIEIFSDL TLPEGVSFAE SDEHGVLSIA ADADKNEIKD WMKAEKESLS KNVIKVISEA LKEAKPRKPT AEETSRGKEK MCQDYYDIRT FGAVMSLKSA PNCGQVRGPI QMTFARSVEP IVALEHSITR MAVATEAEAE KQSGDNRTMG RKYTVPYGLY RAHGFVSANL AHQTGFSEND LDLFWNALLN MFDHDRSAAR GLMSTRGLYV FEHSSVLGNA PASQLFERIT VKRKEDSEGP ARSFKEYDVL IDESSLGEVK LLRKLG
|
| |