Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0020 |
Symbol | |
ID | 3903595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 28057 |
End bp | 29211 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637877350 |
Product | CRISPR-associated Cse4 family protein |
Protein accession | YP_479143 |
Protein GI | 86738743 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.451411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGCT ACATCGACGT CCACATCCTG CAGACCGTCC CTCCGTCCAA CCTCAACCGA GATGACGCCG GTACCCCGAA ACAAGCCGTC TATGGCGGGG TGAAGCGGGC CCGGGTGTCG TCCCAGGCCT GGAAGCGGGC GACGCGAACC GCCTTCGCCG ACCACATCGA TCAGGCCCAG CTCGGAACAC GCACCAAGCG GATCTCCGCG CTGCTCGCGG AACGGCTCGC AACCCGCTGC GCGCTCGACG CGGAAACCAG CACCCGGATC GCCACCAGCC TTCTGACCGC TCTGAAGATC AGTGCGGGGA AGAAGGCGGC GGAGACCGCC TATCTGCTGT TCTTTGGCCG TCCCCAGCTC GAACGGCTCA TCGACCTCAT TGTCGAGGAT GTGCCGCGCC TCGCCGATCT CAGCGACGGC GATCTGCTCG CCGCGGTCAA GGATGTGCCT GTCCTGGCTA CTCTCGGCAG CGACCATCCG ATCGACGTCG CGCTGTTCGG GCGGATGGTC GCCGACCTGG CGTCGTTGAA CGTCGACGCG GCCACCCAGG TCGCGCATGC CCTGTCCACC CATGCCGTCG ACGTCGAGTT CGACTACTAC ACCGCCGTTG ACGACCAGAA CGCCAAGGAC GAGACCGGCG CCGGGATGAT CGGCACGGTC GAGTTCCAGT CCGCGACGCT GTACCGGTTC GCCACCGTCG GCCTGCACCA GCTCGCCGAG AACCTCGGCG GTGACATCGA GGCGACCGTC GAGGCGCTAC GGGTGTTCCT CACCGCGTTC ACCACCTCCA TGCCGACCGG CCATCAGAAC TCCTTCGCCC ACCGCACCGT GCCGAACCTG CTCACCATCG CGATCCGCCC CGACCAGCCG GTCAACCTTG TCTCCGCGTT CGAGAAGCCG GTACTGCCCC GTGGCCGGGG CGTCCTCACC GGATCCCTCG AGCAGTTCGC CATCGAACTC AACAGCGCGT CGACGCTGTG GGGCCTCCAG CCCGACATCC TCGCCTCCAC CTACCGCGCC CCCGACGACA CCAACACCAA CACCGACACC ACGGCGATGA TCGTCAAGGC GCTCGGCGAG CCGAAGCCGT TCGACGAGGT TCTCGACACA GTGGTGGCTG CCGCCCGCGA CCGGCTCATG AGCAGCGTCC GATGA
|
Protein sequence | MRCYIDVHIL QTVPPSNLNR DDAGTPKQAV YGGVKRARVS SQAWKRATRT AFADHIDQAQ LGTRTKRISA LLAERLATRC ALDAETSTRI ATSLLTALKI SAGKKAAETA YLLFFGRPQL ERLIDLIVED VPRLADLSDG DLLAAVKDVP VLATLGSDHP IDVALFGRMV ADLASLNVDA ATQVAHALST HAVDVEFDYY TAVDDQNAKD ETGAGMIGTV EFQSATLYRF ATVGLHQLAE NLGGDIEATV EALRVFLTAF TTSMPTGHQN SFAHRTVPNL LTIAIRPDQP VNLVSAFEKP VLPRGRGVLT GSLEQFAIEL NSASTLWGLQ PDILASTYRA PDDTNTNTDT TAMIVKALGE PKPFDEVLDT VVAAARDRLM SSVR
|
| |