Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_1154 |
Symbol | |
ID | 6163077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 1237367 |
End bp | 1238398 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641663908 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001790188 |
Protein GI | 171057839 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTCC TCAACACCCT CTACATCACC CTGCCCGACA GCTACCTGCG GCTCGACAAC GACACCCTGC GCGTGGTCGA CGAAGACAAG GAAACCCGCC TGCGCGTGCC GCTGCACCAT CTGCAGGCGG TGGTGTGTTT CGGCCACGTC GGCCTGAGCG CCAAGCTGAT GCACCGGCTG GCCGAAGACG GCATCGCCCT CGTGCTGCTC GATGCCAACG GCCGCTTCAA GGCGCGGCTG GAAGGCGAGA CCAGCGGCAA CGTGCTGTTG CGCCGCGCCC ATCACCAGGC GGTCGACAGC GCCGCGTTCA CGCTCGAAGC GGCTCGTTGC ATCGTGGCCG GCAAGCTGCG CAACCAGCGC CAGGTGCTGC TGCGCGGCGC CCGTGAATCG AAGGATCCGG GTGAAGAAGC CCAGCTCACC CGCGCCGCAC AAGACCTGGC GGCCAGCCTG CGCGCACTGC CCGCGGCGGC CGATCTCGAC GTCCTCCGCG GCATCGAGGG CGAGGCCGCG CGCACCTACT TTGCCGCGCT CAACCTGCTG GTACGTGCCG ACCGGCGCGA TCATTTCCAG ATGAACGGCC GCAGCCGCCG CCCGCCGCGC GACCGCATGA ACGCGCTGCT CAGCTTCTTC TATGCAATGT GGATGAACGA CTGCCGCAGT GCCATCGAGG CCGCCGGGCT CGATCCGCAG ATGGGCTTTC TGCATGCACT GAGGCCGGGG CGCGCGGCGC TGGCGCTCGA TCTGATGGAG GAGTTTCGCC CGTTCGCCGA CCGGCTGGCG CTCACGCTGG TAAACCGCGC GCAGGTCAAC GAAGACGACT TCGTGGAGCG TGAAGGCGGC GCCGTACTGC TGGAGGGCGA TGCGCGCAAG GCGGTGGTGG TGGCGTATCA GGAGCGCAAG CAGGAGGAGT TGACACACCC GCTGCTGGCC GAAAGCGTTC CGCTCGGACT GGTGCCGCTG GTGCAGGCGC GGTTGCTGGC GCGCCATGTG CGCGGCGAGG CGCCGAGTTA CGTGCCATTT GCGATGCGCT GA
|
Protein sequence | MQLLNTLYIT LPDSYLRLDN DTLRVVDEDK ETRLRVPLHH LQAVVCFGHV GLSAKLMHRL AEDGIALVLL DANGRFKARL EGETSGNVLL RRAHHQAVDS AAFTLEAARC IVAGKLRNQR QVLLRGARES KDPGEEAQLT RAAQDLAASL RALPAAADLD VLRGIEGEAA RTYFAALNLL VRADRRDHFQ MNGRSRRPPR DRMNALLSFF YAMWMNDCRS AIEAAGLDPQ MGFLHALRPG RAALALDLME EFRPFADRLA LTLVNRAQVN EDDFVEREGG AVLLEGDARK AVVVAYQERK QEELTHPLLA ESVPLGLVPL VQARLLARHV RGEAPSYVPF AMR
|
| |