Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3346 |
Symbol | |
ID | 3904132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3969767 |
End bp | 3970801 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880671 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_482432 |
Protein GI | 86742032 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0114588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.257048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC TCCTCAACAC CCTCTACGCC ACAACACCCG GAACCAGCCT GCACCTCGAC GGCGACGCCG TACGCATCTG GCATCCCGAC AACGACAAAG GCCGCCGCCT TCTCCCCCTC GTCCGCGTCG ATCACATCGT CGTCTTCGGC GGCGTCACCA TCACCGACGA TCTCCTACAA CGCTGCGCCA CCGACCGCCG CTCCGTCACC TGGCTCACCG GCAACGGCCG CTTCCGCGCC CGCGTCGAAG GACCCACCGG CGGCAACCCC CACCTACGCA TCGCCCAACA CGATCACTTC CGCGACGACG AACGACGCCT CACTCTCGCC ATGTCATACA TAGCCGGGAA ACTCCAGAAC AGCCGCCAAC TCCTCCTCCG CGCCGCCCGC GACGCCACCG GCACCCGCCA AACCGCACTC CGCGACACCG CCGCCCACCT CGCCGACGCC CTCCCCACCC TGCGTGACAC CACCAACGTC GCCGAGGCCA TGGGCGTCGA AGGACAGGCA GCCCGCCGCT ACATCGCCAC ATGGCCGCAC CTGCTCACCC CGCACGCGAC CGTCACCGCC CCCGCCGGAC GCACCAGCCG ACCCGCCACC GACCCGGTCA ACGCCGCCCT GTCCTTCGGC TACGGCATCC TGCGCATCGC CGTCCACGGC GCCCTCGACC ACGTCGGCCT CGACCCCCAC ATCGGCTACC TCCACGGCAT CCGCCCCGGC AAACCCGCCC TCGCCCTCGA CCTCATGGAA GAATTCCGCG CCCTGCTCGT CGACCGCCTC GTCTTCACCG CCTTCAACCA GCGCCAGCTC ACCGATGCCG ACTTCGAACA CCACCCCGGC GGCTCCTGCC AGCTCACCGA GTCCGGCCGG AAAAACTACC TCACCCTGTG GAGCCAGGCA CGCGCCCGAA CCTGGCCCCA CACCCTCCTC ACCCACGACA CCCCCGCCGC CACCCTTCCC CTGCTCCAGG CCAGGATCCT CGCCCGACAC CTCCGCGGCG ACATCCCCCG GTACATCCCC TGGAGCCCTA CCTGA
|
Protein sequence | MAELLNTLYA TTPGTSLHLD GDAVRIWHPD NDKGRRLLPL VRVDHIVVFG GVTITDDLLQ RCATDRRSVT WLTGNGRFRA RVEGPTGGNP HLRIAQHDHF RDDERRLTLA MSYIAGKLQN SRQLLLRAAR DATGTRQTAL RDTAAHLADA LPTLRDTTNV AEAMGVEGQA ARRYIATWPH LLTPHATVTA PAGRTSRPAT DPVNAALSFG YGILRIAVHG ALDHVGLDPH IGYLHGIRPG KPALALDLME EFRALLVDRL VFTAFNQRQL TDADFEHHPG GSCQLTESGR KNYLTLWSQA RARTWPHTLL THDTPAATLP LLQARILARH LRGDIPRYIP WSPT
|
| |