Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0023 |
Symbol | |
ID | 3903598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 30875 |
End bp | 31867 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637877353 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_479146 |
Protein GI | 86738746 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.408897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGTGGC TGTCCAATCC GCAGGACCTG CACCGCGTCG AGGACCGCGT CTCCACGCTC TACGTCGAGA AATGCCACGT TGACCGCGAC GACAACGCCG TCGTCCTGGT CAACAAGGAA CGCACCGTCC GTGTCCCCGC CGCGTTCGTC GCCACCGTCC TGCTCGGTCC CGGAACCCGC ATCACCAGCG CCGCCGTGCG CCTGCTCGCC GACTCCGGCA CCGCCCTGTG CTGGGTCGGC GACCGCGGCG TACGCATGTA TGCCGCAGGA CTCGGACCCA GCCGCGGCGC CGGCCTGCTC ATGCGCCAGG CTTACCTCGT CACCCGCACC AGCGAACGCC TTGACGTCGC CCGCCGCATG TACGCCAAAC GCTTCCCCGA CGACGACGTC ACCACCGCCA CCATGCAACA ACTCCGCGGC CGGGAAGGCG CCCGCATCAA AAAGATCTAC CGCGATCACG CCACCCGCAC CGGCGTCACC TGGAACAAAC GCGTCTACAC CCACGGCGAC CCCTTTGCCG ACAGCGACGA CATCAACCGG CTCCTGTCCG CCGGACACAG CTGCCTCTAC GGCATCTGCC ACGCCGCCAT CGTCGGCATC GGCGCCAGCC CCGCCCTCGG ATTCGTCCAC ACCGGAGCCG CCACCTCCTT CGTCCTCGAC ATCGCCGACC TGTACAAAGC CGACTACACC ATTCCCCTCG CCTTCGACCT CGCCGCCGCC GGCCTCACCG ACGAGCGCGA CATCCGCACC GCCTTCCGCG ACAAAGTCGC TGACGGGCAC CTCATGGCCC GCATCATCCA CGACATCAAA GACCTCCTCA TCGAGGAAGG AACCCGAGAT AACGACGAGG ACGCACTCCA CTTGTGGGAC GAGCTCGACG GCCACGTCCC CGGCGGCGTC AACTGGGCCG CCGACCTCGC CGACCAGACC GACGACACCA CCATCCTCGG TGTCACCGGA CCCGACACCG ACCAGCCACC CCCACCCTGG TGA
|
Protein sequence | MWWLSNPQDL HRVEDRVSTL YVEKCHVDRD DNAVVLVNKE RTVRVPAAFV ATVLLGPGTR ITSAAVRLLA DSGTALCWVG DRGVRMYAAG LGPSRGAGLL MRQAYLVTRT SERLDVARRM YAKRFPDDDV TTATMQQLRG REGARIKKIY RDHATRTGVT WNKRVYTHGD PFADSDDINR LLSAGHSCLY GICHAAIVGI GASPALGFVH TGAATSFVLD IADLYKADYT IPLAFDLAAA GLTDERDIRT AFRDKVADGH LMARIIHDIK DLLIEEGTRD NDEDALHLWD ELDGHVPGGV NWAADLADQT DDTTILGVTG PDTDQPPPPW
|
| |