Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4011 |
Symbol | |
ID | 3906972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4793298 |
End bp | 4796630 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881340 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_483090 |
Protein GI | 86742690 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.724918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGGG CTGCCGAGTT CGAGCTGGTT GAGCGTCCGC TTGTCGATCA GCTGCGCGGG TTGGGCTGGG CGTATCTTGC CGGGACGAAG CACGCCCCGG CCGCGAGCGG TCGGGCGTCG TTCTCTGAGG TGCTGTTGGA GGGTCGGCTG CACGCGGCGC TGCGGCGGAT CAATCCAGGG CCGGACGGCG AGTTGTGGCT TGATGATGGC CGGCTCGCGC AGGCCGTTGC CGCGCTGACT CGGCCGAAGG CGACCAGGCT GGTCGAGATC AACGAGGAGT TGACGGAAGC GCTGTTGCTC GGTGTGCCCG TCGAGGGGAT TCCGGAGGTC GATGGTGGTC GGAACCGTCG GGTTCGGTTC ATCGACTGGG ACACGCCGTC GAACAACGAC TTTCTGGTGG TGAACCAGTT TCGGGTGGAC ATCCCGGGCA GCCAGGGGCG CCGCTACATC GAGCCGGACA TCGTGCTGTT CGTCAACGGC ATCCCGCTGG TGGTCGTGGA GGCGAAGAGC CCCACCACGT CCTCGCCGGT GGCGAAGGCG ATCCGTCAGC TGGCCCGGTA CGCCGACCAG CGCGGGGCTG TCACCCGCGA GGGCAACGAG CGGCTATTCT ACACCAACCA GTTCGTCGTC GCCACGAGCT GGGACGAGGC GAAGGTCGGC ACGTTCACCT CGGACCCGGA TCACTTCGCC GAGTGGAAGA CAACCGAGCC GACTCCGGAG ACCGAGGTCG CCGAAGCACT CGGCGTCGAC GGGCTGACCA GCCAGCAGCG GCTCGTCGCC GGCATGCTCG CCCCGGAGCG GCTGTTGGAC ATCGTCCGGC ACTTCACGCT ATTCATGCCC GCGCAGGCCG GGCGGACGAT GAAGATCGTC GCCCGCTACC AGCAGTACCG GGCGGTCCGG CGCACCATCC ACCGCCTGAG CACCGGCAAG ACCCGGGTGG CCGACGGCGA GTTCGACCGG CGCGGCGGGA TCATCTGGCA CACCCAGGGC TCGGGCAAGT CGCTGACCAT GGTGTTCCTG ATCCGGGTCA TGCGGACGCA TCCCGATCTT GTCGGTTTCA AGGTCGTCGT GGTCACCGAC CGGAGGGACC TCCAGGACCA GCTCGCGAAG ACCGCCGAAC TCACCGGAGA GACACCCCGC ACGGCGAGGA ACGTCCAGCA GGTTCGCTCG CTGCTGTCGG TTCCGGGCAA GGCGCTGCTT TTCGCGATGA TCCAGAAGTA CCGGAACCCG GACGCGGCGA AGGACGCCCC GCTGGAGGTG AAGTCACTCG GAGTGCTGGA CCCGTCCGAG GGTGTCGTCG TGCTCGTGGA CGAGGCGCAC CGCTCGCACA GCAGCACTCT GCACTCCGTG CTGCTGGACG CGTTGCCGAA CGCGGCACGG ATCGGGTTCA CCGGGACGCC GATCATCATG GGTAAGCGGA AACGCACCCA TGCGATCTTC GGCCCGTATC TGGACCGGTA CACCATCAGC GAGTCTGAGG CGGATGGCGC GACCGTGCCG ATCCGCTACG AGGGCCGTAC GACGAAGAGC GACGTCCAGG ACTCGGCGGA CCTCGACGAG CTGTTCGAGG ACATGTTCCC CGACCTCACC GATGCGCAGC TGGCGAAGAT CCGACGTCGA TGGGGAACCA TCGGCAACGT GCTGGAAGCG GAGAAGATGA TCACGGCGAA GGCCCGCGAC ATGCTCCGCC ACTATGTCGA CACAGCCTTG CCCGGCGGGT TCAAGGCGCA GATCGTCGCG ACCAGCCGGC TCGCCGCGGT GCGCTACCGC GACGCCCTGC TGGCCGCTCG TGATGATCTC GTCGCCCGGC TCGATGCACT GCCCGCCGAG TTGCGCACCC CCGAGGCCGC GGACCGGGCC ACCTCCCCCG AGGGCATCCC CGGCCTCGGG CGCGCCCGGC TGCGCGACGT CCGGGCCTGG CCGTACCGCG ACCTGATCGC TCGGCTCGAT TTCGTCCCGG TCATCTCTGG TGAGCAGAAC GAGAAGGACT GGCGCGACTG GATCGACGAG ACCCGGCAGA AGGTGGTCAT CGAGGAGTTC AAAAAGCCGC TGCCCGCACC CGATGACCCG GCACCGGACC CGGCCACGAC CAGCACGGTT GCCTTCCTGC TGGTTAAGTC GATGCTGCTG ACCGGCTTCG ACGCCGCCGT CGACCAGGTG ATCTACCTCG ACCGACGGAT CAAGGAAGCT GAACTGCTCC AGGCCATCGC TCGCGTGAAC CGCACCGCTC GGGGGAAGGC CAACGGCTAC GTCGTCGACT ACTTCGGCGT CGCCAAGCAT CTGCACGCCG CGCTGGAAGC GTATGCGGCC GAGGACATCG ACGGCGCGCT GGCGAGCATC ACCGACGAGC TTCCACTGCT GGCCGACCGG CACGCCAGGG TCCGCGCGCT GTTCACAGAC CGCGGCCAGG AGCGTTTCGA CACCCCGGCA GATCAGGAAG CCTGCGTCCA GATCCTGGCC GACGACGCGC TGCGGGCCGC GTTCCAGGTC GCCTACCGGG CGCTGACCCG CAGCCTGGAG ACGGTCCTGC CACGACCGGA AGCGCTGCCC TATGCGGCCG ACACCAAGGC GTTCGGAGTG ATCGGCCTGC TCGCACGGCG ACGCTACGAC CGCGATGATC CGGACTTCGA CGTCTCCGTT TACGGCGAAA AGGTACGCCG CCTGATCGAC GACCACATCG TCGCGCTCGG CATCAGCCAG AAGATCCCAC CTGTGTCGCT CACCGACGCC CGCTTCGACG AGAAGGTCGG CGGGCTGACG TCGAAGCGGG CGAAAGCGTC GGAGATGGAG CACGCCCTGC GCCACCACAT CAGCGGCATG CTCGACGCCG ACCCGGTTCG CGCGCGCACC CTGTCCCAGC AGCTGTCGGA GATCCTCGAC CGGCTCCGTG ACCAGTGGGA CCAGCTCGTC GCCGAACTCG GCGACCTGAT CGACAAGGCC CGCGCGGGAC GGACCACCGC CGAGGATCCC GACGACGCGC CCGACGGTGT CCAGCTGACT CCGATCGAAC GCCTCTTCTT CGACATCCTA CGCGCCGAAC GGGTCGCCGA GGGCAAGGAG ATGACACCAG TCGCCATCGA GGCCGTCGCC GAGCTCGTCA CCTTCATCGT CGATCATCTG TGTCGGGAGA TCGGGAGGGT GCACTTCTGG GGGAACGCCC ACGCCCAGCA ACGCCTGCAC AGCTGGATCA CGGTAGCGGT CAGCGACGTG TCCATCGACG GCGACGACCT GTTCGACCAG GACCGGGCGG AGGCCATCGC CGACCAGATC GTCGAACTGG CACGGCACAA CCACGCTGCG GTTGTCACGA CAGTCCCGCC CACGGTCAGC TGA
|
Protein sequence | MTGAAEFELV ERPLVDQLRG LGWAYLAGTK HAPAASGRAS FSEVLLEGRL HAALRRINPG PDGELWLDDG RLAQAVAALT RPKATRLVEI NEELTEALLL GVPVEGIPEV DGGRNRRVRF IDWDTPSNND FLVVNQFRVD IPGSQGRRYI EPDIVLFVNG IPLVVVEAKS PTTSSPVAKA IRQLARYADQ RGAVTREGNE RLFYTNQFVV ATSWDEAKVG TFTSDPDHFA EWKTTEPTPE TEVAEALGVD GLTSQQRLVA GMLAPERLLD IVRHFTLFMP AQAGRTMKIV ARYQQYRAVR RTIHRLSTGK TRVADGEFDR RGGIIWHTQG SGKSLTMVFL IRVMRTHPDL VGFKVVVVTD RRDLQDQLAK TAELTGETPR TARNVQQVRS LLSVPGKALL FAMIQKYRNP DAAKDAPLEV KSLGVLDPSE GVVVLVDEAH RSHSSTLHSV LLDALPNAAR IGFTGTPIIM GKRKRTHAIF GPYLDRYTIS ESEADGATVP IRYEGRTTKS DVQDSADLDE LFEDMFPDLT DAQLAKIRRR WGTIGNVLEA EKMITAKARD MLRHYVDTAL PGGFKAQIVA TSRLAAVRYR DALLAARDDL VARLDALPAE LRTPEAADRA TSPEGIPGLG RARLRDVRAW PYRDLIARLD FVPVISGEQN EKDWRDWIDE TRQKVVIEEF KKPLPAPDDP APDPATTSTV AFLLVKSMLL TGFDAAVDQV IYLDRRIKEA ELLQAIARVN RTARGKANGY VVDYFGVAKH LHAALEAYAA EDIDGALASI TDELPLLADR HARVRALFTD RGQERFDTPA DQEACVQILA DDALRAAFQV AYRALTRSLE TVLPRPEALP YAADTKAFGV IGLLARRRYD RDDPDFDVSV YGEKVRRLID DHIVALGISQ KIPPVSLTDA RFDEKVGGLT SKRAKASEME HALRHHISGM LDADPVRART LSQQLSEILD RLRDQWDQLV AELGDLIDKA RAGRTTAEDP DDAPDGVQLT PIERLFFDIL RAERVAEGKE MTPVAIEAVA ELVTFIVDHL CREIGRVHFW GNAHAQQRLH SWITVAVSDV SIDGDDLFDQ DRAEAIADQI VELARHNHAA VVTTVPPTVS
|
| |