Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1205 |
Symbol | |
ID | 5669618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1440503 |
End bp | 1442290 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240137 |
Product | RNA modification protein |
Protein accession | YP_001505565 |
Protein GI | 158313057 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0621] 2-methylthioadenine synthetase |
TIGRFAM ID | [TIGR00089] RNA modification enzyme, MiaB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00267373 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.652733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCCTC GTCCTGCACG CCGGGTAGCT CTCGTCACGC TCGGGTGTTC CCGTAACGAG GTGGATTCCG AGGAGCTTGC CGCCCGCCTC GCCGCCGACG GCTGGGACCT GGTCGACGAC GCCGCCGACG CGGACGCCGT CCTGGTCAAC ACCTGCGGTT TCGTCGAGGC GGCGAAGAAG GACTCGATCG ACGCCCTGCT CGCCGCCGAC GCACTGCGCG GGCCCGGCGG CAACGGGCCC GGTGACGGTG AGTCCGGCGA TGACGGGCCG GGCGGCGGCG CGACCACCGG GCGGGCGACC GCGTCGGGCG GGCCGCGCGC GGTCGTGGCC GTGGGCTGCA TGGCCGAGCG GTACGGCCGG GAGCTCGCCG ACACGCTGCC CGAGGCGGAC GCCGTCCTCG GCTTCGACGC ATATCCGCGG ATCTCGGCGC ACCTCGACGC GGCGCTCGCC GGCTCGGCGC CGGCCTCGCA CACCCCGCGT GACCGGCGCA CGCTGCTGCC GATCTCTCCG GTGGAGCGGG GCGACGCCTC CCGCGCGGCG CACTCGCCGC ACATCCCTGG TCACATCCAG CTCCCCGGTG GTGGACGTGT CCTTCCGTCG CCGGACGGCC GACCGGCGGC GAACAGCCGC CCGGCAACGG ATCTGGCGGA CCTGGCCGGC CTGGCGGAGC CGGCGGGGGA GGGCGCGGCC GGTACCGGCC GGCGGCGGCT CACCTCCAGC CCGGTCGTGC CGCTGAAGCT GTCGAGCGGT TGCGACCGCC GTTGCGCCTT TTGCGCCATC CCGTCCTTCC GTGGCTCGCA CGTGTCCCGC CGGCCGGAGG AGGTACTGGC CGAGGCCGAG TGGCTGGCTG GGCAGGGCGC CCGCGAGCTT GTCCTGGTCA GCGAGAACTC GACGTCCTAC GGCAAGGATC TCGGCGATCT GCGGGCGCTC GAGAAGCTGC TGCCCCTGCT GGCCGCCGTT CCGGGGATTG TCCGTGTCCG CACTGTGTAT CTCCAGCCGG CCGAGCTGCG GCCGTCCCTG CTCGAGGTGC TGCTGACGAC GCCGGGCCTG GCGCCCTACC TTGACCTGTC GTTCCAGCAC GCCAGCCCGG CGGTGCTGCG GCGGATGCGC CGGTTCGGCG GCTCGACCGA CTTCCTGGAC CTGCTGCGGC GGGCCCGCGC GCTGCTCCCC GACCTGGGCG CCCGCTCGAA CGTGATCGTC GGCTTCCCCG GTGAGACCGA CGAGGACGTC GACATCCTGG TGAATTTTCT CGAGCGCGCC GACCTCGACG CTGTCGGGGT GTTCGGCTAC TCCGACGAGG AGGGGACGGA GGCCGCCGGG ATGGCGGGCC ACGTCGACCC CGAGGAGATC GAGAGCCGGC GGGCCGAGGT CACCGACCTC GTCGAGCAGC TCACCGCAGC CCGCGCCGAG CGCCGCATCG GCACGACCGT CGAGGTGCTC GTCGAGGAGG TGGCCGGCGG TCTCGGGTAC GGCTGCGCCG GGCACCAGCA GGCCGACGCC GACGGCTCCT GCACGGTCCG CCTGCCCGCG GGCGGGCCAC CGGGCGGGGT GTCCGTCGGG GACCTCGTCG AGGCCCGGGT CGTGGCGGCC GAGGGCGTCG ACCTGATCGC GGAGTTCACC GGCGTGCTCG ACCGTGCCGG CGCCGGGCTG GCCAGCGCTG GGTCAGCCGG TGCCGGGTCG GTCGGTGCCG GGTCGGTCGG TTCGGCCGGG GCGGGTTCCG TGCTGCCGCC GATCCCGGAC GGGGTCGGCC CGATGGACGC GGTGGGCCAC CCTTCGGGCG TGGCGTGA
|
Protein sequence | MSPRPARRVA LVTLGCSRNE VDSEELAARL AADGWDLVDD AADADAVLVN TCGFVEAAKK DSIDALLAAD ALRGPGGNGP GDGESGDDGP GGGATTGRAT ASGGPRAVVA VGCMAERYGR ELADTLPEAD AVLGFDAYPR ISAHLDAALA GSAPASHTPR DRRTLLPISP VERGDASRAA HSPHIPGHIQ LPGGGRVLPS PDGRPAANSR PATDLADLAG LAEPAGEGAA GTGRRRLTSS PVVPLKLSSG CDRRCAFCAI PSFRGSHVSR RPEEVLAEAE WLAGQGAREL VLVSENSTSY GKDLGDLRAL EKLLPLLAAV PGIVRVRTVY LQPAELRPSL LEVLLTTPGL APYLDLSFQH ASPAVLRRMR RFGGSTDFLD LLRRARALLP DLGARSNVIV GFPGETDEDV DILVNFLERA DLDAVGVFGY SDEEGTEAAG MAGHVDPEEI ESRRAEVTDL VEQLTAARAE RRIGTTVEVL VEEVAGGLGY GCAGHQQADA DGSCTVRLPA GGPPGGVSVG DLVEARVVAA EGVDLIAEFT GVLDRAGAGL ASAGSAGAGS VGAGSVGSAG AGSVLPPIPD GVGPMDAVGH PSGVA
|
| |