Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rfer_3907 |
Symbol | |
ID | 3961547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodoferax ferrireducens T118 |
Kingdom | Bacteria |
Replicon accession | NC_007908 |
Strand | + |
Start bp | 4363903 |
End bp | 4364931 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637918732 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_525137 |
Protein GI | 89902666 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.63849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTCC TCAACACCCT CTACGTCACC ACGGCGGACA CTTATCTGCG TCTGGACAAC GACACCCTGC GCGTGGAGGT GGAGCAAGAA ACCAGGCTAC GTGTGCCACT GCACCACTTG AGCGCAGTGG TGTGTTTCGG TCACACGGGT TTGTCGGCAC CGCTCATGCA CCGCCTGGCT GAAGGCGGCA TTGCGCTGGT GTTGCTGGAT GACAATGGGC GTTTCAAAGC AAGGTTAGAG GGCGCAGTCA CAGGCAATGT TCTGCTGCGC CAAGCTCAGT TTGGGCGTGT GGCAGACCCC GCGTTTACGC TGGACATGGC ACGCGCCTGT GTGGCGGGCA AGATCAAAAA CACCCGGCAG GTGTTGCAAC GCGGTGCCCG CGAAGCCAAG TCAGAAGACG AAGCCCAGGT ATTAACCCGC CTGGCCGACG ACCTGGCTGC CAGTTTGCGC GCGCTCACCG AGGCCACCAG TCTGGACGTC TTGCGGGGCA TAGAGGGTGA GGCCGCGCGG CAGTATTTCA TTGGCCTCAA TTTGCTGGTG CGCCCTGAGT CGCGCGCGGT CTTCCAGATG GATGGGCGCA CGCGTCGCCC GCCGCGTGAC CGCTTCAATG CCATGTTGTC ATTCTTGTAT TCCATGTGGA TGAACGACTG CCGTAGCGCG CTGGAGGCAG CCGGGCTGGA CCCGCAAGTG GGTTTTTTGC ATGCCTTGCG ACCGGGCCGC GCTGCACTGG CGCTGGATTT GATGGAAGAG TTTCGCCCCT GGGCTGATCG CTTGGCCCTG ACGCTCATCA ACCGAGGTCA GTTGACTGCT GATGATTTTG TTTTGCGTGA AGGCGGTGGC GTGTTACTGG AGCCGGATGC GCGCAAGGCG GTGGTGGTGG CGTATCAAGA ACGCAAGCGC GACGAGATCA ATCACCCGCT GCTGGCGCAG TCTGTCCCCT TGGGCCTGGT GCCGCTGGTG CAGGCACGCT TGATGGCGCG CGCTTTGCGC GATGACGGTG CACCGTATGT GCCGTTTGTG GCCAAGTAG
|
Protein sequence | MQLLNTLYVT TADTYLRLDN DTLRVEVEQE TRLRVPLHHL SAVVCFGHTG LSAPLMHRLA EGGIALVLLD DNGRFKARLE GAVTGNVLLR QAQFGRVADP AFTLDMARAC VAGKIKNTRQ VLQRGAREAK SEDEAQVLTR LADDLAASLR ALTEATSLDV LRGIEGEAAR QYFIGLNLLV RPESRAVFQM DGRTRRPPRD RFNAMLSFLY SMWMNDCRSA LEAAGLDPQV GFLHALRPGR AALALDLMEE FRPWADRLAL TLINRGQLTA DDFVLREGGG VLLEPDARKA VVVAYQERKR DEINHPLLAQ SVPLGLVPLV QARLMARALR DDGAPYVPFV AK
|
| |