Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5691 |
Symbol | |
ID | 5674017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6909713 |
End bp | 6910903 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244544 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001509947 |
Protein GI | 158317439 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGGT TCCGGCTTTA CCCGGCGTCC GAGCAGGCTG CTGTAATGGA GGCCCACTGC GGCCACGCAC GGTTTGTGTG GAATCTTGCC GTAGAACAGC AGTCGTGGTG GACGCCGCGA CGCGGGCCGG CGCCGGACTA CCACGAGCAG TCCCGGCAGC TCACCGAAGC GCGACGCGAG TTCCCGTGGC TCACCGAAGG GTCGCAGACC GTTCAGCAGC AGGCGTTGCG GGATTTTGCA CAGGCCATGG ACAACTACTT CCGGGGCAGT CACCGGAAGC CGACGTTTCG GAAACGTGGC CGGTCGGAGG GGTTCCGGAT CGTCGCTGTC AAATCCTCGG ATATTCGCAC GGTGAACCGG CGCTGGTCCG AGGTGAGGGT TCCCAAGGTC GGCTGGGTGC GTTTCCGTCG CTCGCGGACT GTCCCGAAGG CGAAGTCCTA CCGGGTCACC AGGGACCGGG CTGGCCGGTG GCATGTGGCG TTCGCCGCGA TCCCCGAACC GATCGACGCA CCGGGTACCG GTGCGACCGT CGGTGTCGAC CGTGGGGTGG CTGTGTCGGC GGCGCTGTCG ACGGGCGAAC TGCTGTCCTG CCCGAGACTG CCGCCCACAG AGGCGCAGCG GCTGGTCAGG CTGCAACGGC GACTTGCCCG GGCGAAGCGC GACAGCAACC GGCGCAGCCG TCTCAAGGCC CAGATCGCCC GGGTGAAGGC CCGTGAGGTG GACCGGCGGA AGGACTGGGT GGAGAAGACC AGTACCGACC TTTCCCGCCG ATTCGACCTG ATCCGCGTCG AAGACCTGAA GGTCAGGAAC ATGACCCGCT CGGCGCGGGG CACCGGGGAG GCACCGGGCA GGAACGTCCG CCAGAAAGCC GGCCTGAACC GGGCCATCCT GGCAAGCGGT TGGGGCCTGC TGGTGCAGCG CCTCGAGGAC AAGGCCCCCG GCCGGGTCGA GAAGATACCC GCCGCGTACA CCTCTCAGTG CTGTTCTGCC TGCAGGCATG TCGCTACCGA GTCGCGTGAG AGCCAAGCAC GATTCGCCTG CGTCGCCTGC GGATATGAGG ACAACGCCGA TGTGAACGCG GCTAGGAACA TCGCGGAGGG ACACGCCGTG ACTGCGCGGG GAGGCATCGG ACTGCCGAAG CCCATGAACC GCGAACCTCA ACTAACCGCA CCTCCTCCGG TCACTGCGTG A
|
Protein sequence | MSRFRLYPAS EQAAVMEAHC GHARFVWNLA VEQQSWWTPR RGPAPDYHEQ SRQLTEARRE FPWLTEGSQT VQQQALRDFA QAMDNYFRGS HRKPTFRKRG RSEGFRIVAV KSSDIRTVNR RWSEVRVPKV GWVRFRRSRT VPKAKSYRVT RDRAGRWHVA FAAIPEPIDA PGTGATVGVD RGVAVSAALS TGELLSCPRL PPTEAQRLVR LQRRLARAKR DSNRRSRLKA QIARVKAREV DRRKDWVEKT STDLSRRFDL IRVEDLKVRN MTRSARGTGE APGRNVRQKA GLNRAILASG WGLLVQRLED KAPGRVEKIP AAYTSQCCSA CRHVATESRE SQARFACVAC GYEDNADVNA ARNIAEGHAV TARGGIGLPK PMNREPQLTA PPPVTA
|
| |