Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1298 |
Symbol | |
ID | 3744772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1470144 |
End bp | 1471043 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637769334 |
Product | CRISPR-associated Csh2 family protein |
Protein accession | YP_375201 |
Protein GI | 78187158 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02589] CRISPR-associated protein, Csd2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.841348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC TTACAAAGAG GTACGATTTT GCATTACTGT TTGATGTACA GGACGGCAAT CCAAACGGTG ATCCTGATGC AGGAAATCTG CCCAGAATAG ATGCAGAAAC CGGCATGGGT CTGGTAACCG ATGTTTGCCT AAAACGCAAG GTTAGAAACT ATGTGCAGCT TTCAGGTAAG GATATTTTTA TCAAGGAAAA AGCTGTTTTG AATACCCTAA TCAGCAATGC ATATGAAGAG CAGAAAATAG ACCTTACAAA AGATCCTGTC GATTTGAAAG ATGGCAAGAA ACGCAACAAA GACGGCACTG CGCAAGGTGG TGAGGTCGAG AAAGGCCGTT CCTATATGTG TTCAAGATAC TACGACATCC GCACATTTGG AGCGGTGATG TCTACAGGAG CTAACGCTGG TCAGGTGCGT GGACCCATTC AGATTACTTT TGCCCGGTCG GTTGAGCCGG TCGTTGCATT GGAGCACAGC ATTACCCGTA TGGCTGTCAC AACTGAAGCG GATGCGGAAA AACAAAGCGG CGATAACCGG ACGATGGGCA GAAAGTACAC TGTACCCTAC GGGCTGTATT GTTCACATGG TTTCGTTTCG GCTCACCTCG CAAATCAAAC CGGCTTCTCA GCAGAAGATC TCAAACTGTT CTGGGAAGCC TTACAGAATA TGTTCGAACA TGACCGCTCT GCAGCCCGAG GTATGATGTC TACTCGAGGA CTCTATGTTT TCGAACACAG CACAGCTTTG GGTAACGCTC CGGCCCACAA GTTGTTTGAA CGGATTAAAG TGGAACGGAA GCCGGAATCA GAAGGTCCGG CTCGTTCATT TGAAGACTAT ACCGTTACGA TAGATGAAAG CGGACTTGAC GGCGTGACTT TGCATAAAAT GCTGTGCTGA
|
Protein sequence | MSDLTKRYDF ALLFDVQDGN PNGDPDAGNL PRIDAETGMG LVTDVCLKRK VRNYVQLSGK DIFIKEKAVL NTLISNAYEE QKIDLTKDPV DLKDGKKRNK DGTAQGGEVE KGRSYMCSRY YDIRTFGAVM STGANAGQVR GPIQITFARS VEPVVALEHS ITRMAVTTEA DAEKQSGDNR TMGRKYTVPY GLYCSHGFVS AHLANQTGFS AEDLKLFWEA LQNMFEHDRS AARGMMSTRG LYVFEHSTAL GNAPAHKLFE RIKVERKPES EGPARSFEDY TVTIDESGLD GVTLHKMLC
|
| |