Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2033 |
Symbol | |
ID | 5670434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2442856 |
End bp | 2445183 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240954 |
Product | N-6 DNA methylase |
Protein accession | YP_001506376 |
Protein GI | 158313868 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.599209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.628554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGTC CCGAGTGGGT GTCGGCCGCC GAGATCGCCC GGGGTGCCGG GGTGAAGCCG GCCGCGGTCA GCAACTGGAG ACGGCGGCAC GCGGACTTTC CGGCTGCCGA GCGGCACGGT GGCCGGGAGG TCTTCCGGGT TGGGGATGTC GCCGCGTGGC TCGACGGCCG TCCGGTCGCC GCCCCCGATC AGCGCCCCGG CGAGGAGCCG GGAGCCACCT ATGGCCACCG CTTCCGCCAG GCCGTCGGGG CCACCGAACC GGTCGCCGGG CCGTCGGCTG CTGCCGGCGT CCCCACGGGC ACCGCCGGCC GACTGTGGTG GGCGCTGGCC GACGCTTACA GAGGCAAGGT CGGGATGTCC GACGACCTGA TGCTGTTGAC GGGCCTCGCG TTCGCGTTTC TCCACCTCCG ATTCCACCGG GCACAGCGAT GGACGGAACT CACCGCAGCG GCTGCGGACG TAGAGCCGGG CACGGTGGTC CGGCTCGTCG AGTCCGCGCT GCGCGCGAGC GGGGAGGCGC CGGCCTCGAT CGCCGACATG TCGGCTCTGC TCGGAGACAC GTCCGACGTC CGCCGGCTGG TACCGCTGAT ACAGATCCTG GATCGACTCG AGCCGGTGCC CGATGCCGGC TCCGGCGACG TCGCGCCGCC CGCCGGCCGC ATCGCCGACG AGCTGCTGGC CCACGCCGCG ACCGTCGGCG GCTGGCGGTC CGCCAACGTT GTGACGCCCC CTTCCGTGGT CCGCACGGCG GTCCGTCTGA CGGACCCCGT CGCTGGCGAC CGGGTTCACG ACCCGTTCTG CCGCGCCGGT GAGTTCCTCG TCGGCGCCGC CGACCACATC AGATCCCGTG GCACCGGCAG CCCGAAGCTG ACCGTCAGCG GACAGGAGAT CAACCCCTCC CTCCGGTGGC TTGCCCGGAT GAACCTGCTT CTGCACAACC TCGGCGCCGA GGACCTGCGG GCCGGGTGGG CCCTTTCTTC TCCCGACCCA CAGCCGGGCG GCCCGTTCGA GGTGGTCCTG GTCAACCCGC CTTTCAACGT GTCGGGCTGG CGGGACGGCG ATCAGAATCC CGATTCATCC TGGCGCTACG GCGTGCCGCC CGGCCACAAC GCGAACTACG CCTGGCTGCA GCACGCGCTG GCCTGCCTGG CCGAGGGCGG TCGGGCCGTG GTCGTGATGC CCGCGGGCGC CGGTTCCTCC GCGAATCTGC AGGAGAGCGC CATCCGGGCC GCCATGGTCG AGGAGGGCGT CGTCGACGCG GTGGTTGCTC TGCCACCCCG GCTCTTCGTC AGCACCTCGA TCCCGGTAAC GCTCTGGGTA CTGCGCTGGC CGAGCCCCGG CCACGACGAC GTGCTGTTCG TCGATGCCCA CGGGGCCGGA AGGATCGTCG AGCGGAACCG TTCGGAACTG CGCGACGAGG ACGTCGACCA CATCGCCGAG GCGTACCGGA ACCGAGCGGC GCGGTCCACG GGGACGGTGC TGTCCCGGCC CGTCGACAGG CGGCGCATCC GGGAGAACGG GTATGCGCTC AGTCCGGCCC GCTATCTCAC GGCCACGACC GAACCCGTCG ATCCCCTACG GGCCCGGGTC GGGATCGAGC AGCTCCGTCG CGACCTGCGC GAGCTCCACC AGCGGGCAGC CGGGGCGCAT GAGCACGCGG AGCGTCAGCT GGACGAGGTG GGCGACCTGT TCGGCGCGGC CACCACTGGC TGGCGGCGAC TGCCCCTGGG CGACGTGTGC GACGTACTCG CGGGTTTCTC CGGCGCGGTC AGGACGGAAC GCGGGCTGCC TTCCGGTATT CCGGTCGTCA AACCGAGGAA TCTCGTGGAC AACCGCATCT CGCCGGAAGG CGTCGACTAT GTCGCGCCCG ACGTGGCGGC GAGAATGGAA CGGTACCGGC TGCGGGCGGG TGACATCGTC TGCGTGCGAA CCGGCCAGCT CGGCCGGCAG GCCCTGGTGA CCGAGGAGCA GAGCGGTTGG CTGATCGGCA CGTCCTGCCT ACGCCTACGC CCGGACGAAT CCGTTGATCC TCGTTATCTG GTCCACTTTC TGGCCCTTCC CCAGATCAGT GAATGGCTGC TCGGCCACTC CACCGGCTCG GCGATCCGGG TGTTGACCGC CGCGACTATG CGTGGGCTTC CCCTCGTCCT TCCGGACCGT CACCAGCAGG GCCGCATCGG CTCGGCGGCA GGTTCGCTGG ACGATCTGGT AGCGGTGCAC GACCAGATCC GTCAGGTCAG TTCCGCGCTC CGCGACGCGC TTCTCCCGTT GTTCCTCCAG GATCCGACAC CGCCGGGCCC TGTCCCCGAG GAGGGATCGA AGTCGTAA
|
Protein sequence | MTSPEWVSAA EIARGAGVKP AAVSNWRRRH ADFPAAERHG GREVFRVGDV AAWLDGRPVA APDQRPGEEP GATYGHRFRQ AVGATEPVAG PSAAAGVPTG TAGRLWWALA DAYRGKVGMS DDLMLLTGLA FAFLHLRFHR AQRWTELTAA AADVEPGTVV RLVESALRAS GEAPASIADM SALLGDTSDV RRLVPLIQIL DRLEPVPDAG SGDVAPPAGR IADELLAHAA TVGGWRSANV VTPPSVVRTA VRLTDPVAGD RVHDPFCRAG EFLVGAADHI RSRGTGSPKL TVSGQEINPS LRWLARMNLL LHNLGAEDLR AGWALSSPDP QPGGPFEVVL VNPPFNVSGW RDGDQNPDSS WRYGVPPGHN ANYAWLQHAL ACLAEGGRAV VVMPAGAGSS ANLQESAIRA AMVEEGVVDA VVALPPRLFV STSIPVTLWV LRWPSPGHDD VLFVDAHGAG RIVERNRSEL RDEDVDHIAE AYRNRAARST GTVLSRPVDR RRIRENGYAL SPARYLTATT EPVDPLRARV GIEQLRRDLR ELHQRAAGAH EHAERQLDEV GDLFGAATTG WRRLPLGDVC DVLAGFSGAV RTERGLPSGI PVVKPRNLVD NRISPEGVDY VAPDVAARME RYRLRAGDIV CVRTGQLGRQ ALVTEEQSGW LIGTSCLRLR PDESVDPRYL VHFLALPQIS EWLLGHSTGS AIRVLTAATM RGLPLVLPDR HQQGRIGSAA GSLDDLVAVH DQIRQVSSAL RDALLPLFLQ DPTPPGPVPE EGSKS
|
| |