Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1456 |
Symbol | |
ID | 5669860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1751686 |
End bp | 1753077 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641240376 |
Product | putative RNA methylase |
Protein accession | YP_001505802 |
Protein GI | 158313294 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1041] Predicted DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.270738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGGG GCGCTACCGG GGGTGCGCCT GCACCATGGC GCTGTGAGGG ATGCTGGATC CGGCGTGGGA GGACATGTAA CTTGGGGCTA ATCGAAGAGC CGGATGATCT AGATATCTCT AGCGACAGTG AGCGTCTGCT GGTCCACAGC CTCTTCCGTT TTCCGGCCAA ATTTCATCCG CCGGTGGCTC AAGCTCTTAT CCGTAATTTC TCGGAGCCGG GAGACCTGGT TCTAGACCCC TTCTGCGGAA GTGGAACGCT TCTAGCTGAG GCGGCCCGGA TGAGGCGGAG ATCTATTGGT ACTGACGTCG ATCCTGTCGC AGTGTCGGTG AGTTCTGCCA AAACTGGGCT CCTAGTGGAA AGTGAACTAC TCGAGGCTGC TTCAAGTTTG TTGGCCGCAC TTGACGAGAT TGCACCGCCG CAGGAGATAT ATGAATCACG GAAATTCGAA GATATCTCAC AGGAATCCTT GAAAGAAAAT CTTGATCGGG AATCCTTGTG GGTTCCCGAT ATTCCTAATC TGGATCATTG GTTTCGTCGG TATGTCACTC TTGACCTGGC CCGCATCTAC AGGTGTGTAA TAGATCTTCA GTGTAGTTCT GCGATCAAGA GCTATTTGCT GGTTGTATTT GCTTCCGTCA TTCGCAACGC GTCAAACGCG GACCCGGTCC CGGTGTCTGG TCTTGAAGTC ACAGCACATA TGAAAAGGCT CGATGCGGCA GGAAGAGTTG TCAACCCATA CTTTCTCTTT CGGAAAGCAA TGGGAAAGGC TGTTGCGGCT TCGAAGGAGT ATGGCGAGCA AGTCTCTCCG TACTTTGAGC CAAGGATCTG GGAAGCGGAC GCTACTTCCT TAGACCTCGA CTCCGAAATG TGTGATTTGG TAATTACTAG TCCGCCGTAT CATAATGCTG TCGATTACTA TCGGCGTCAT CAGCTTGAGA TGTTCTGGCT GAGGCATACC CGATCTCAGA AAGAAAGACT AGACCTACTT CCGAAATATA TTGGCCGCCA TCGAATACCT CGGAAGACCC CGATTCTCGC AACTGACGAG AAGTTGCCAG CACTTGCGGC ACACTGGGAA AGCGAGATGG CGCAAGCGAG TATTCAGCGC GCAGTGGATT TTCGTCACTA CGCACTCAGT ATGCGAAACG TATTTAGGCG ATTGTCGCAG GCGGTAAAGA TCGGGGGAAA GGTTGTTCTG GTTGTAGGCC GAAGCAGTTG GAACGGCGAC CGCATCCCGA CCGATGATCT TTTTGTCGAA CTTGCTTCCG AGAACTTCTC CTCGGTAAGA CTAATGTCAT ACCCAGTGAA AAATAGGTAC ATGAGCTACG CTCGGAGGAA TAGCGCTAGC ATCGACCGTG AATATGTGCT GGTGTTGCAA AGAAAGATCT GA
|
Protein sequence | MRRGATGGAP APWRCEGCWI RRGRTCNLGL IEEPDDLDIS SDSERLLVHS LFRFPAKFHP PVAQALIRNF SEPGDLVLDP FCGSGTLLAE AARMRRRSIG TDVDPVAVSV SSAKTGLLVE SELLEAASSL LAALDEIAPP QEIYESRKFE DISQESLKEN LDRESLWVPD IPNLDHWFRR YVTLDLARIY RCVIDLQCSS AIKSYLLVVF ASVIRNASNA DPVPVSGLEV TAHMKRLDAA GRVVNPYFLF RKAMGKAVAA SKEYGEQVSP YFEPRIWEAD ATSLDLDSEM CDLVITSPPY HNAVDYYRRH QLEMFWLRHT RSQKERLDLL PKYIGRHRIP RKTPILATDE KLPALAAHWE SEMAQASIQR AVDFRHYALS MRNVFRRLSQ AVKIGGKVVL VVGRSSWNGD RIPTDDLFVE LASENFSSVR LMSYPVKNRY MSYARRNSAS IDREYVLVLQ RKI
|
| |