Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0193 |
Symbol | |
ID | 3903220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 227306 |
End bp | 228556 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637877524 |
Product | type I restriction-modification system specificity determinant |
Protein accession | YP_479313 |
Protein GI | 86738913 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.311495 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGCTG AGAGGCTCGA CCTCGTCGAC TCCGGTGTGT CCTGGCTCGG GAAGGTCCCG CCGCACTGGA CCACAAAGCC ACTGTGGTCC ATGTTCGAAC GCATCAAGGA TGTGGACCAT CCGGAAGAGC AGATGCTATC CGTCTTTCGC GAGTACGGCG TCGTGGCGAA GGACTCGCGC GACAACATCA ACAAGACTGC CGAGAACCGC AGCATCTACC AGCTCGTCCA CCCAGGCTGG CTGGTCGCCA ACCGGATGAA GGCATGGCAA GGTTCGGTTG GCATCTCCTC GCTCCGCGGG ATCGTTTCCG GTCACTACAT CTGCTTCGCG CCGCGCCATA GTGAGGACGC CCGCTACCTC AACTGGCTCC TTCGCTCGAC CACCTACACC AACGGCTACG CACTGCTCTC GCGCGGAGTG CGCATCGGTC AGGCCGAGAT CGACAACGAC GAGTTCCGGC TGATGCCGAT CCTGCTCCCG CCGCTAGGGG AGCAGCGCGC CATCGCCGAC TACCTCGACC GCGAGACCGC CCGCATCGAC ACGCTCATCG AGGAACAGCA GCGTCTGATC GAAATGCTCC GTGAGCGGCG CCGGGCAGTC GCTTTGCACG CGATCGATCA GCAGATCCAT GCGGGCGCGA CGACAGACAA ACTAGGTCGC TCTACTCGAA TCGGCAACGG GTCAACTCCG AGGCGTGAGA CTGCCAGCTA TTGGCGTGAC GGAGAGTTCC CGTGGCTGAA CAGTTCCGCT GTCAACGAGT CTCGTGTCAC GCACGCCGAC CAATTCGTGA CCGACATCGC ACTCTATGAA TGTCACTTGC CAGTCGTTGC TCCAGGCTCG GTCTTGGTGG GTCTGACTGG CCAAGGAAAG ACACGCGGAA TGGCAACGCT TCTTGAGATC GAGGCGACGG TGAACCAGCA CGTCGCGTAC ATAGCGCCTG ATCGAGGCAC ATGGTTGCCG GAGTACCTCC TGTGGTCGCT CAGGGCGTCG TATGACGACC TCCGACGCTT GAGTGAAGAG AACGGCAGTA CGAAGGGTGG GCTCACTTGC CAGGCGCTAA AGCAATATCG GCTTGCTGTA CCACCCCTCG ATGAGCAGCG TCGCGTTGCC GCTTACCTTG ACGAGCAGAC TGCGAAGATT GACTCGCTGA TCGGCGAGAC CGAGCGGTTC ATCGAGCTGG CCCGTGAGCG GCGGGTGGCG CTGATCACGG CGGCGGTGAC GGGGCAGGTC GATGTGCGGG GGATGGTCTG A
|
Protein sequence | MIAERLDLVD SGVSWLGKVP PHWTTKPLWS MFERIKDVDH PEEQMLSVFR EYGVVAKDSR DNINKTAENR SIYQLVHPGW LVANRMKAWQ GSVGISSLRG IVSGHYICFA PRHSEDARYL NWLLRSTTYT NGYALLSRGV RIGQAEIDND EFRLMPILLP PLGEQRAIAD YLDRETARID TLIEEQQRLI EMLRERRRAV ALHAIDQQIH AGATTDKLGR STRIGNGSTP RRETASYWRD GEFPWLNSSA VNESRVTHAD QFVTDIALYE CHLPVVAPGS VLVGLTGQGK TRGMATLLEI EATVNQHVAY IAPDRGTWLP EYLLWSLRAS YDDLRRLSEE NGSTKGGLTC QALKQYRLAV PPLDEQRRVA AYLDEQTAKI DSLIGETERF IELARERRVA LITAAVTGQV DVRGMV
|
| |