Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0835 |
Symbol | |
ID | 5166021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 992075 |
End bp | 992998 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640548333 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001229616 |
Protein GI | 148262910 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.855521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAC CAATCCTCCC CCCATTGAAG CCGCTCCCCA TCAAGGACCG CATCTCGGTC GTTTACGTGG AACGGGGCAA CCTGGATGTC CTTGACGGCG CCTTTGTGGT CGTGGACAAG ACCGGCGTCC GCACCCATAT CCCCATCGGC GGGGTGGCCT GCCTGATGCT GGAGCCGGGG GCGCGGGTTT CCCACTCTGC CGTGGTGCTG GCGGCGCGGG TCGGGTGTCT GCTGGTCTGG ATCGGCGAGG CCGGGGTGCG CATGTATGCC GCCGGTCAGC CGGGGGGTGC CCGGGCCGAC CGGCTTTTGT ACCAGGCAAA GCTGGCCCTG GACGATACAT CGCGGCTGAA GGTGGTGCGC AAGATGTACG CGATCCGCTT CCAGGAGGAG CCGCCGGAGC GGCGCAGTGT GGACCAGTTG CGCGGTATCG AGGGGGTGCG GGTACGGAAA ATGTACGAGC TGCTGGCCCG GCAGCATGGG GTGGAGTGGC AGCGCCGCAA TTATGATCAC AGCGAATGGG GGAGCGGCGA TGTGCCCAAT CGCTGCCTTT CTTCGGCCAC CGCCTGCCTG TACGGCATCT GTGAGGCGGC CATCCTGGCG GCAGGGTACG CCCCTGCGAT CGGTTTCATC CACACCGGCA AGCCCCAGTC ATTTGTCTAC GACGTGGCCG ATATTTTCAA ATTCGAGACG GTGGTCCCGG TGGCGTTTCG TATCGCCGCC AGGCAGCCCC GCAACCCGGA ACGCGAGGTG CGGCTGGCCT GCCGGGATGC CTTCCGTCAA TCCAAGCTGC TGCAGCGGAT CATTCCTACA ATCGAGCAGG TGCTGGCGGC TGGCGGGCTG GAGGTGCCGA AGGCCCATGA AGAGGCGGTA GTGCCCGCCA TTCCAAACAA GGAGGGCCTT GGTGATCCGG GGCAAAGAGT TTGA
|
Protein sequence | MTEPILPPLK PLPIKDRISV VYVERGNLDV LDGAFVVVDK TGVRTHIPIG GVACLMLEPG ARVSHSAVVL AARVGCLLVW IGEAGVRMYA AGQPGGARAD RLLYQAKLAL DDTSRLKVVR KMYAIRFQEE PPERRSVDQL RGIEGVRVRK MYELLARQHG VEWQRRNYDH SEWGSGDVPN RCLSSATACL YGICEAAILA AGYAPAIGFI HTGKPQSFVY DVADIFKFET VVPVAFRIAA RQPRNPEREV RLACRDAFRQ SKLLQRIIPT IEQVLAAGGL EVPKAHEEAV VPAIPNKEGL GDPGQRV
|
| |