Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4012 |
Symbol | |
ID | 3906973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4796627 |
End bp | 4797937 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637881341 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_483091 |
Protein GI | 86742691 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.848484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.567977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATG GACTCCGACA TATTTCCATA GAATCGTTCG GCGAGATCTT CCCGGGAAGA ATCTCAACCG TCGGCACAGA ATTCGAGATA CAGTCCGGCA TAACCCTTTC ACCACGGAGA ACGTCCGGCA GGAAAGATGC ACCCTACCTC CGCGTTGCGA ATGTACAACG CGGTCGTCTC ACACTAAGCG ATGTCGCATG GCTAGAGGCA TCAGCTCGCG AACGGATTAG GTACGCACTG GATGACGGAG ATCTACTCGT AGTTGAGGGG CACGCCAACC CAGCCGAGAT CGGAAGATGC GCCCAGGTGG GGCCAGAGTC GAAGAATTGC CTCTACCAAA ACCATCTGTT TAGATTGCGC CCAAGGAATC TTGAAGCGAG ATTTGCGCTA CACTGGCTGA ATTCCAGCTT TTCCCAGTCC TACTGGGGGA GAAACTGCGC CACAAGCTCC GGTTTGTATA CGATTAATTC TCGACAGCTG GGGGCACTTC CAATTCCGGT CCCACCGCCA GATAAACAAC GTAAGATTTC CGAGATCCTG GACGCGGCAG ACGAGGCGAT CCGTTCAACG GAGCGACTCG TCGGCAAGCT CGAACAGGTG TTCGACTCAT TGCGGGGCGA TCTACTTCAG GAGCATGTAA TTCGGTCGGG TCGACTTCCC GACTGCTGGC GGATGGACCG GCTAGACCGT CTGAGCGAGA TCACGGGAGG CGTAACGCTC GGCGGTGTTA CATCCGCTGG CCGTTCAGTC GAGCTTCCCT ACCTTCGGGT CGCAAACGTG CAAGATGGAT ATATCGACAC TACCGACATC AAGACGGTAA CCGTGCGAAC ATCGGAGTTT GATCGCTACC TGCTTCAAGC TGGAGACGTT CTCATGACGG AGGGAGGGGA CTTCGACAAG CTCGGGCGTG GTGCCGTCTG GGACGGGTCG ATTGACCCCT GCCTACACCA AAATCATATC TTCCGTGTTC GCTGCGACAA GATTCGCCTG CTCCCCGAGT ATTTGTCTAC CTACAGCGCA TCCACTGCAG GGCGCAGCTA CTTCATGGGC ATCTCGAAGC AAACTACCAA CCTGGCATCG ATCAACAAGA GTCAGCTATC CGCACTCCCC GTTCCACTAC CTCCACTGGC GACACAGAAA ATGATAATTG GATCACTGGG CGCTGCCGAA CGACAGATAT CCTCGACAAA GGCCGAGCTG GCGAAGTTGC GACTCGTCAA GCAGGGGCTG ATGGATGATC TGTTGATGGG GCGGGTTCAG GTGTCGGGGT TGCGGGATGT GTCGGATGCA GTGGATACGC TGGCGGTATG A
|
Protein sequence | MSDGLRHISI ESFGEIFPGR ISTVGTEFEI QSGITLSPRR TSGRKDAPYL RVANVQRGRL TLSDVAWLEA SARERIRYAL DDGDLLVVEG HANPAEIGRC AQVGPESKNC LYQNHLFRLR PRNLEARFAL HWLNSSFSQS YWGRNCATSS GLYTINSRQL GALPIPVPPP DKQRKISEIL DAADEAIRST ERLVGKLEQV FDSLRGDLLQ EHVIRSGRLP DCWRMDRLDR LSEITGGVTL GGVTSAGRSV ELPYLRVANV QDGYIDTTDI KTVTVRTSEF DRYLLQAGDV LMTEGGDFDK LGRGAVWDGS IDPCLHQNHI FRVRCDKIRL LPEYLSTYSA STAGRSYFMG ISKQTTNLAS INKSQLSALP VPLPPLATQK MIIGSLGAAE RQISSTKAEL AKLRLVKQGL MDDLLMGRVQ VSGLRDVSDA VDTLAV
|
| |