Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0445 |
Symbol | |
ID | 6354440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 497489 |
End bp | 498643 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642668076 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001942517 |
Protein GI | 189345988 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATCT CAATCCCGAC AAAACAGCCG TTTTACATCT TCTCGAACGG TGTACTCCTT CGAAAGGAGA ATACGATCAG TTTTGTGCCG TATGTGACGC AGGATGAGAT TACGGTTGAA ACTAATCCGT CGCTGTACCT TGAGCCGGAT GAAGAGGAGG CCTATTCCCT TAATCCGGTC AAAGATGAAC ATCTCAATAC GGCTGCAAGA CGGGTGATAC CGATAAACAA TATCGATTCT TTTTTTGTTT TCGGGGAGGT GAGTTTTAAC ACAAAGTTCT TGAACTTCCT GACCAGAAAC CGCATTCCCC TGCATTTGTT CAACTATTAC GGGTTTTATT CTGGGTCATA CTATCCGAGG GAACATCTCC TTTCGGGGTA TCTGGTTGTC AATCAGGTGA AGCACTACAG CTCGACAAAA AAACGTCTTG AAATAGCCAG AGAATTTATC GGGGCAGCGG CGGCCAACAT TATCAGGAAT CTGAAGTATT ACACTGCTGA CTCGAGACAG GGTGTACAGG ATGATGAGAG CCTGGCGATG CTGTTCCATA CCATAGCACA GATAGAATCA CTTGCAAACG GCATTGCTGC TGCGCAGGAC ATTCCCTCGC TGATGGGTGT CGAAGGTAAT ATCCGAAAAG TTTATTATCA GGTATGGCAA CAACTGCTAC GCTCAGCTGA CCCTGCTTTT TCTTTTTCGG AACGTGTCAA GCGCCCGCCG GACAACGCAG TTAATGCTTT GGTTTCGTTC GGCAACAGCC TCATGTATTC CGCATGCTTG ACTGAAATCT ATCGGACACA GCTCAATCCC ACCGTCTCGT TTCTGCATGA GCCGTCAGAA CGTCGATTTT CGCTCGCTCT TGATATGGCA GAAGTCTTCA AACCAATGTT TATCGACCGG TTGATATTTA AGTTGGTTAA CACGAGGGCA ATTCAGGCCA GGCATTTTAC GACAGCCCTG AATTTTTGTC ATCTAAACGA CGCAGGTCGA AAAATCGTGG TCAAGGAGTT CGAAGAGCGA ATGCGGACAA CCATAAAGCA TCGAGGTCTT GACAGGAATG TGTCGTACCG AAGGCTCATC CGCCTTGAAT GCTATAAACT CATCAAACAT CTCATCGGAG AAGAGCCATA CCATGCATTC CGTACATGGT GGTAA
|
Protein sequence | MPISIPTKQP FYIFSNGVLL RKENTISFVP YVTQDEITVE TNPSLYLEPD EEEAYSLNPV KDEHLNTAAR RVIPINNIDS FFVFGEVSFN TKFLNFLTRN RIPLHLFNYY GFYSGSYYPR EHLLSGYLVV NQVKHYSSTK KRLEIAREFI GAAAANIIRN LKYYTADSRQ GVQDDESLAM LFHTIAQIES LANGIAAAQD IPSLMGVEGN IRKVYYQVWQ QLLRSADPAF SFSERVKRPP DNAVNALVSF GNSLMYSACL TEIYRTQLNP TVSFLHEPSE RRFSLALDMA EVFKPMFIDR LIFKLVNTRA IQARHFTTAL NFCHLNDAGR KIVVKEFEER MRTTIKHRGL DRNVSYRRLI RLECYKLIKH LIGEEPYHAF RTWW
|
| |