Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2989 |
Symbol | |
ID | 5695848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3588767 |
End bp | 3589696 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641265605 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001530869 |
Protein GI | 158522999 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.786147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACT CTCATTTCGT TCCACTCAAA CCCATACCCA TAAAGGACCG TGTGTCCATG GTATTTGTGT ACTACGGCCA GATTGATGTG AAAGACGGCG CTTTTGTGGT GGTGGATGAT GTCAACGGCG AGCGGAAGCA CATTCCGGTA GGGTCGGTAG CATGCATCAT GCTGGAGCCC GGCACCCGTA TCAGCCACGC CGCCGTGAAG CTGGCGGCCA CCACAGGAAC CTTGTTGATA TGGGTTGGAG AAGCCGGAGT GAGGCTGTAC TCGGCGGGTC AGCCTGGTGG TGCCCGTTCG GACAAGCTGC TGTATCAGGC CAAACTTGCC CTGGATGAAC AGTTGCGGCT GAAGGTAGTG CGCCGAATGT TTGAAATCCG GTTTGGTGAA AAGCCGCCGG AACGCCGCAG TGTGGATCAG CTACGCGGGA TCGAAGGCGC GCGGGTCAGA AAAATATATG AACTTATGGC AAAGCAGTAT GGTGTGCCAT GGAAGGGCAG GCGCTATGAT CCTAAAGACT GGGAGTCCGG TGATGTTATA AATAAATGTT TGAGCGCCGC CACATCATGC TTGTATGGCG TTACCGAAGC CGCCATTCTG GCGGCCGGAT ACGCACCCGC TGTCGGCTTC ATCCATTCTG GAAAGCCACT GTCGTTTGTG TATGATATTG CCGATATCTA TAAGTTTGAT ACAGTGGTGC CGGTGGCGTT TCGTGTGGCA GCAAAAAAAC CGGCTCATCC GGACCGAGAG GTGAGGATCG CCTGCCGGGA TGCGTTTCGG GAAACCCGGC TGCTTCAGAA AATTATTCCG GGAATAGAAG ATATCCTTTC AGCCGGTGGG ATTAAGCCGC CTCCACCCCC TGAAGATGCC CAGCCTCCGG CAATACCGGA GCCTGAATCA ATCGGAGACC AGGGTCACAG GAGCGCATAG
|
Protein sequence | MTDSHFVPLK PIPIKDRVSM VFVYYGQIDV KDGAFVVVDD VNGERKHIPV GSVACIMLEP GTRISHAAVK LAATTGTLLI WVGEAGVRLY SAGQPGGARS DKLLYQAKLA LDEQLRLKVV RRMFEIRFGE KPPERRSVDQ LRGIEGARVR KIYELMAKQY GVPWKGRRYD PKDWESGDVI NKCLSAATSC LYGVTEAAIL AAGYAPAVGF IHSGKPLSFV YDIADIYKFD TVVPVAFRVA AKKPAHPDRE VRIACRDAFR ETRLLQKIIP GIEDILSAGG IKPPPPPEDA QPPAIPEPES IGDQGHRSA
|
| |