Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1431 |
Symbol | |
ID | 4568992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1633521 |
End bp | 1634552 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766017 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_911883 |
Protein GI | 119357239 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0863156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAGC TGCTCAACAC GCTGTTCGTT ACCACGCAGG GTGCATATCT GTCGAAAGAA GGTGAGTGCG CGGTCATAAA AATCGATAAA GTTGAAAAAG TACGATTGCC TCTTCATATG CTTGACGGTA TTATCTGTTT CGGACAGATC ACCTGCAGCC CATTTCTCAT GGGTCACTGT GCAGAAAAGG GTGTGACGGT GACGTTCCTT ACCGAATACG GCAAATTCCT CTGTCAGGTG CAGGGGCCAA CGAAAGGCAA CATTCTGCTC AGGCGCGCAC AATACCGGCA AGCAGACAAC TATCAGCAGT CGGCCATGCT TGCCCGGGCG TTCGTCATAG GAAAAATCGG CAACAGCAGG GTTACCCTTG CAAGGGCCAT GCGGGACCAC CCCGAAAAGG TCGATGGCGA AAAAATGCAT TATGCACAGC AACTGCTTGC CGGTTGCATT AAAAAGCTTG GCGATGAAAC CGATCAAGAG CGAATCAGGG GGATCGAGGG TGAAGCAGGA AGAATTTATT TCGAGGTATT CGACCAGTGC ATCACAACTT CCGACCCGTT GTTCCGGTTT AATGGCCGAA ACCGTCGACC GCCGGTTGAC CGGGTAAATT GTCTGCTTTC GTTTCTCTAT ACCCTTGTGA CGCATGATAT CCGCTCCGCA CTTGAGTCAT GCGGGCTCGA TCCGGCAGCG GGTTTTCTGC ACAAGGATCG CCCGGGTCGT CCGAGCCTCG CTCTCGATAT GCTCGAAGAG TTTCGTTCCT ATATCGCCGA CAGAATGGCA TTGTCGTTAA TCAATCGGGG TCAGATTCAG GCAAATGATT TCACGGTATC CGATACTGGC GCTGTGCTGA TGAAAGACGA TGCAAGAAAA ACCTTGCTAA CGGCTTACCA GAAAAGAAAA CAGGAAGAAA TAGAACATCC GTATGTCAGG GAAAAAATGG CTGTGGGTCT CATCTGGCAT ATGCAGGCTA TGTTGCTGGC TCGGTATATC CGGGGGGATA TCGATATGTA CCCTCCTTTT GTCTGGAGGT AA
|
Protein sequence | MKKLLNTLFV TTQGAYLSKE GECAVIKIDK VEKVRLPLHM LDGIICFGQI TCSPFLMGHC AEKGVTVTFL TEYGKFLCQV QGPTKGNILL RRAQYRQADN YQQSAMLARA FVIGKIGNSR VTLARAMRDH PEKVDGEKMH YAQQLLAGCI KKLGDETDQE RIRGIEGEAG RIYFEVFDQC ITTSDPLFRF NGRNRRPPVD RVNCLLSFLY TLVTHDIRSA LESCGLDPAA GFLHKDRPGR PSLALDMLEE FRSYIADRMA LSLINRGQIQ ANDFTVSDTG AVLMKDDARK TLLTAYQKRK QEEIEHPYVR EKMAVGLIWH MQAMLLARYI RGDIDMYPPF VWR
|
| |