Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0929 |
Symbol | |
ID | 6354166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1012746 |
End bp | 1014941 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642668556 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001942987 |
Protein GI | 189346458 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.400864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATGGC TCTACAACCA GATGGCAATG CCCGAAACTA TTTTTCAGGC ATGGTACAAG GTGGCTTCGA ACGACGGCCG TCCCGGATGG GATAACAAAT CCATCGAGGA CTACTCCCTG CAGCTTGAAG AGAACCTTAA AGCCCTATCG CAAGCTCTTC TGACAGGCAC CTACAAACAG GGTCCGTTGA TGAAACTCGT GTTGCTGAAA CCTGATGGAA AGGATCGGGT TCTTTTGATA CCAGGCGTAA TGGACAGGGT TGCCCAGACT GCGGCGGCAA TCGTGCTGAG CCCGATCATC GAAGCCGAAC TGGGTAACTG TACCTTTGCC TACCGTCCCG GCATATCGCG CGAAGGAGCT GCACGGGAGA TTGACCGGTT GCACCGTGAA GGGTATCAGT GGGTGCTCGA TGCAGATATC CGCAGCTTTT TCGACAACGT CCGCCATGAC CTGCTCTTCC AGAGGCTGGT TGAGCTTATC GACGACAAGG AAATGATTTC GCTGCTGCAC CGGTGGCTTA CCGCTGAAAT CGTTGACGGC ATCAATCCCC GCATACAAAA TACCATGGGC CTGCCACAGG GATGCCCGAT CTCACCGGCT CTGGCCAATC TCTATCTCGA CCGCTTCGAT GAAACAATGG AAAAAGAAGG GTTCAAACTG GTTCGCTTCG CCGACGATTA TCTCGTGCTC TGCAAAACCC GTCCCAAAGC CGAAGCCGCG CTGAAGCTTT CGGAAACCGC GCTTGCCGAA CTGAAACTTG AACTGCACAG CGATAAAACC CGTATTACCA CCTTTGCCGA AGGGTTCAAG TATCTCGGTT ACCTCTTTAT TCGCGCACTG GTTATTCCCA CCAAAATGCA CCCCGAAGAG TGGTACGACA AGCTCGGCAA GTTCAAGCTT CGCAAAAAGA GCGAACATGC CCTGCCCTCC GACCCCGACG CAATGACCGG CGAAACAGCA AAGTTCGAGC TCGAAACCGA TCAGGGCGAA AAAATAGAAC TCACCAAAAA CGAGCTTCTG CAAACCGAGT TCGGGTGCAA GCTGCTTGAA AGTCTCGATA AAAAACAGTT GAGCGTTGAC GAGTTTCTCG AAAAAGTTGC CCGGCAGGAC GAAGAACGGC AGAAAGAGAA GCGCGATGCG CTGAAAAAAC TCTATTCACC TTTTCTGAAC ACCCTCTACC TGCAGGAGCA GGGGAGTCTC ATGCGCAAGG ACGGAGAGCG GTTCAGCATT GAAAAGGATG GAGCGGTCAT CAACGAAGTG ATCGTCCGCC GCATCGAACA GGTTGTGGTG TTCGGTAATA TCGCCCTCAC CACTCCGGTC ATGCAGTACT GCATGCAAAA CGAAATTCCG GTTACCTTTC TTTCGCAGCA TGGCAAATAC TTCGGCAGGC TTGAATCGAC CATGGCCGAC AATGCCGAAA TGCAGCGCTA TCATTTTCTT CGTTCCATCG ACGAACCCTT TGCGCTTGAA ACCGCCCGCT CCATCGTTTC GGCAAAAATC GGCAACAGCA GGACCATGAT TCGCCGCCGA AAATCCGTCA TGCAGGATTG CGACGGCACG CTGCAGAGTA AAATGACCTG CAACCTCGAC ATCATGGCCG ATCTTCTCCT CAAGGCCGAA ACCTCCACAG ACATCGACGT ACTGCGAGGT CTCGAAGGCA AGGCTTCGGC TCTCTACTTC GAGTGTTACG GCATGTTCTT CAGCAAAAAC CTGCCGTTCC ATACCGCTTC GTTTCTGCGG GTTCGACGTC CGCCAACCGA TCCTGTCAAC AGCCTGCTCA GTTTCGGTTA CTCGCTTTTG CATACCAACG TATTCTCAAT GGTGCAAATG AGCGGGCTCA ACCCCTATAT CGGCTTTCTC CATGCAGAAC GAAAAGGCAA TCCCGCCCTG GTCAACGATC TCGTCGAAGA GTTCCGCACC GTAATCGATT CTCTGGTGCT CTATACCATC AACAGGGGTC TTCTGCATGA GAACGACTTC TACTATCGCA AAGATCAGCC GGGCTGCTTT CTCTCGAACG ACGCCCGAAA ACGTTTTTTA CAGATTTTCG AAACAAGAAT GTGGCAGGAA TCCCGGGACG GCTACACCGG CAAAACGCTC AACTTTCGGC GGCACATAGA AAAGCAGGTG AGAATCATGC GGGATGTCAT ATCCGGAACC CGAACGCAGT ACGATCCGTA CAAGCTGGTA TGGTGA
|
Protein sequence | MGWLYNQMAM PETIFQAWYK VASNDGRPGW DNKSIEDYSL QLEENLKALS QALLTGTYKQ GPLMKLVLLK PDGKDRVLLI PGVMDRVAQT AAAIVLSPII EAELGNCTFA YRPGISREGA AREIDRLHRE GYQWVLDADI RSFFDNVRHD LLFQRLVELI DDKEMISLLH RWLTAEIVDG INPRIQNTMG LPQGCPISPA LANLYLDRFD ETMEKEGFKL VRFADDYLVL CKTRPKAEAA LKLSETALAE LKLELHSDKT RITTFAEGFK YLGYLFIRAL VIPTKMHPEE WYDKLGKFKL RKKSEHALPS DPDAMTGETA KFELETDQGE KIELTKNELL QTEFGCKLLE SLDKKQLSVD EFLEKVARQD EERQKEKRDA LKKLYSPFLN TLYLQEQGSL MRKDGERFSI EKDGAVINEV IVRRIEQVVV FGNIALTTPV MQYCMQNEIP VTFLSQHGKY FGRLESTMAD NAEMQRYHFL RSIDEPFALE TARSIVSAKI GNSRTMIRRR KSVMQDCDGT LQSKMTCNLD IMADLLLKAE TSTDIDVLRG LEGKASALYF ECYGMFFSKN LPFHTASFLR VRRPPTDPVN SLLSFGYSLL HTNVFSMVQM SGLNPYIGFL HAERKGNPAL VNDLVEEFRT VIDSLVLYTI NRGLLHENDF YYRKDQPGCF LSNDARKRFL QIFETRMWQE SRDGYTGKTL NFRRHIEKQV RIMRDVISGT RTQYDPYKLV W
|
| |