Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3344 |
Symbol | |
ID | 3904130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3968225 |
End bp | 3969112 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637880669 |
Product | CRISPR-associated Csh2 family protein |
Protein accession | YP_482430 |
Protein GI | 86742030 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02589] CRISPR-associated protein, Csd2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00748339 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.380655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTCA ACCTTGATCC CGAGAAGAAG CACGACATGG TGCTGCTGTT CGACGTCACC GACGGCAACC CCAACGGTGA CCCGGACAAC GGAAACCGGC CCCGCACCGA CGACGAAACC GGCCATGGCC TGGTCACCGA CGTCGCGATC AAGAGGAAGG TCCGCGACAC CATCGGCCTG GCCGCCGAAG CCGAAGGCCT CGACCTGACC CGCTACCAGA TCTTCGTCGA AGCCGGCCAC GCGCTGAACA CCCGACTGGA AGAGTCCTAC CTCGTCAAGG GACTCGAACT CGGCAAGAAG ATCGACGATG CGAAAGCCGC GAAGGCCCGG GAATGGCTCG CCAACCGGTA CGTCGACATC CGCCTGTTCG GCGCGGTCCT GTCCACCGGC AAGACCCAGT CGCTGGGGCA GATCCGCGGA CCGATCCAGG TCGGCATGGC CCGGTCCCTC GACCCGGTCC TGCCCGTCGA CCATGCGATC ACCCGGGTCA CCCAGACCAC CCAGGCCGAC ATCGACAAGG GCGAACGCAC CGAGATGGGC GGCAAGTGGA CCGTCCCCTA CGGCCTGTAC CGGGCAGAGA TCCACTACTC GGCGCCCCGA GGCCGCCAGA CCGGTGTCAG CGCCGCCGAC CTCGACCTGT TCCTGTGCAC CCTGGTCAAC ATGTTCGACC ACGACCGGTC CGCGACCCGC GGCGAGATGG CCACCCGTGG CCTGTACGTG TTCAGCCACC ACAACGCCTT CGGCGTCGCA CCGGCCCACA CCCTCTCCGC CCGCATCACC GCCCGGAAGA TCTCCGCGGG TGAACCGCGC AGCTTCGGCG ATTACAAGAT CGACGTCGAT GACGCCGACC TGCCCGACGA CGTGGCCCTC ACCCGCGTGC TGGGATGA
|
Protein sequence | MAFNLDPEKK HDMVLLFDVT DGNPNGDPDN GNRPRTDDET GHGLVTDVAI KRKVRDTIGL AAEAEGLDLT RYQIFVEAGH ALNTRLEESY LVKGLELGKK IDDAKAAKAR EWLANRYVDI RLFGAVLSTG KTQSLGQIRG PIQVGMARSL DPVLPVDHAI TRVTQTTQAD IDKGERTEMG GKWTVPYGLY RAEIHYSAPR GRQTGVSAAD LDLFLCTLVN MFDHDRSATR GEMATRGLYV FSHHNAFGVA PAHTLSARIT ARKISAGEPR SFGDYKIDVD DADLPDDVAL TRVLG
|
| |