Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3767 |
Symbol | |
ID | 5672132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4466428 |
End bp | 4468122 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242648 |
Product | N-6 DNA methylase |
Protein accession | YP_001508068 |
Protein GI | 158315560 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCGC GGAGGAAGTC CGCTCAGGCC GGACCGCCCA CCATGGCCCA GTTGCGCGAG ACGCTGTGGA AGACGGCCGA CAAGCTGCGC GGCTCGATGG ACGCCGCCCA GTACAAGGAC TTCGTCCTCG TCCTGATCTT CCTCAAGTAC GTCTCGGACG CGTTCGCCGA GCGGCGCGAG CAGATCCGGC AGGACGTGCT CGCCGACGGG ATCGACGAGT CCCGGGCCGA GGAGTTCCTC GACGACGTCG ACGAGTACGC CGGGCAGGGT GTGTTCTGGG TGCCGGGGCG GGCCCGGTGG GAGCATATAG CGGCGAATGC GAAAAGCGCC GGCATCGGGG AACTGCTGAA CGCGGCGATG GACGCCGTGA TGAAGACGAA CCCGGCGCTC ACCGGCGTCC TCCCGCGGAT CTTCAACGGC GAGGGCGTCG ACCAGCACCG GCTCGGCGAG CTCGTCGACC TGCTCGGTGA CGCCCGCTTC ACCGGCCACC GGGCGACCGA GCGGCCCCCG TCCACGCCTA CTGGCGAGGA CGGTGCGCTT TTCGGCGAAT CCGCCGCCGG AGTTCCGACG GAAGCGGCGA CCCGGCCGGC GCGGGACGTG CTGGGGGAGG TGTACGAGTA TTTTCTGGAG AGGTTCGCCC GCGCCGAGGG CAAGCGCGGC GGTGAGTTCT ACACCCCGGC CAGCGTGGTG CGGTTGCTTG TCGAGGTGCT GGAACCCTAC GAGGGCCGGG TGTACGACCC GTGCTGCGGT TCGGGCGGCA TGTTCGTCCA GGCGGAGAAG TTCGTCGTCG CGCACCGAGG GCTCACCCAC TCCGGCGACA TCGCCGTCTA CGGCCAGGAG TCGAACGAAC GGACCTGGCG GCTGGCGAAG ATGAACCTCG CCATCCACGG GATCACCGGC GACCTGAGCG CCCGGTGGGA CGACACCTTC CGCAACGACC GGCATCCGGA CCTGCGGGCC GACTTCATCC TGGCGAACCC GCCCTTCAAC ATGTCCGACT GGGCGCGCAC CGTCGACGAC CAGCGCTGGC GGTACGGGAC CCCACCGACC GGCAACGCGA ACTTCGCCTG GCTGCAGCAC ATCATCGCCA AGCTGGGCTC CCGCGGGACC GCCGGCGTGG TGATGGCGAA CGGCTCGATG TCGTCGAAGC AGTCCGGCGA GGGTGAGATC CGCGCCGCGC TGGTCGAGGC CGACCTGGTG GCCTGCATGA TCGCGTTGCC GCCACAACTG TTCCGCACCA CCCAGATCCC GGCCTGCCTG TGGTTCTTCG CGAAGGACAA GGGCCAGCTG GGCGCCCGCT GGCTCGCCGA ACGGCGCGGC GAGACGCTGT TCATCGACGC CCGCGACATG GGCACGATGA TCGACCGCAC CGAGCGCATC CTCACCGACG GTGACCTCGA GAAGATTACC GACACCTACC GTGCCTGGCG TGGCGCGAAG TCGGCCCGCG ACAAGGGCCT GGCCTACGAG AACATTCCCG GCTTCTGCTA TTCGGCCTCC ACCGAGGAGA TCCGCACCCA CGACCACGTC CTGACCCCGG GCCGCTACGT AGGCGCCGCC GAAGCCGACA TTTCCAACGA CGAGCCGATG GCCGAGAAGA TCGAGCGTCT CGCCAAGGAA CTCTTCATCC ACTTCGAAGA GTCCGCTCGC CTGGAAAAGG AAGTTCGCAG CCAACTGGAG CGCCTCGATG CCTGA
|
Protein sequence | MPPRRKSAQA GPPTMAQLRE TLWKTADKLR GSMDAAQYKD FVLVLIFLKY VSDAFAERRE QIRQDVLADG IDESRAEEFL DDVDEYAGQG VFWVPGRARW EHIAANAKSA GIGELLNAAM DAVMKTNPAL TGVLPRIFNG EGVDQHRLGE LVDLLGDARF TGHRATERPP STPTGEDGAL FGESAAGVPT EAATRPARDV LGEVYEYFLE RFARAEGKRG GEFYTPASVV RLLVEVLEPY EGRVYDPCCG SGGMFVQAEK FVVAHRGLTH SGDIAVYGQE SNERTWRLAK MNLAIHGITG DLSARWDDTF RNDRHPDLRA DFILANPPFN MSDWARTVDD QRWRYGTPPT GNANFAWLQH IIAKLGSRGT AGVVMANGSM SSKQSGEGEI RAALVEADLV ACMIALPPQL FRTTQIPACL WFFAKDKGQL GARWLAERRG ETLFIDARDM GTMIDRTERI LTDGDLEKIT DTYRAWRGAK SARDKGLAYE NIPGFCYSAS TEEIRTHDHV LTPGRYVGAA EADISNDEPM AEKIERLAKE LFIHFEESAR LEKEVRSQLE RLDA
|
| |