Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3904 |
Symbol | |
ID | 5672265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4670047 |
End bp | 4671600 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242783 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001508200 |
Protein GI | 158315692 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.112308 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGTGT ACCGGCCATG GCGCGGCCCG GCACCGGGCT ATGCGGAACA GAACCGGCAG CTGACCGAGG CCCGAGCGGA CAATCCGTGG CTGGCAGCGG GCAACATCAT CGTGCAGCAG CAGGCCCTGC GTGACTTCGC GACTGCGATG GCGAACTTCT TCCGCGGTTC GCATCGCAGG CCCACTTTCC GTAGGCGTGG GCGTGGTGAG GGGTTCCGGA TTGTGGCGGT GAAACCGGGC GACGTTCGGC GGGTGAATCG CCGTTGGGCG CGGGTGCGTG TCCCGAAGGT GGGCTGGGTG CGGTTCCGCT GGTCCCGTGC TGTGCCGGGC GCGAGGTCGT ATCGGGTGAC GCGGGATCGT GCGGGTCGCT GGCATGTGGC GTTCGCCGTG GCCCCGGACC CAATCCCCGC GCCGGGCATG AGCGGGGTTG TCGGGGTGGA CCGTGGGGTG GTGGTGTCGG CGGCGCTGTC GACCGGGGAG CTGCTGTCCT GTCCCGGTCT GCGACCCGGG GAACGGGCAC GGCTGCTCCG GTTGCAGCGC CGACTGTCGA AAGCCAGGCG CGGGTCGCGG CGGCGCGGCC GCGTCAAGGT TCAGATCGCA CGGTTGCGTG CCCGGGAGGT TGACCGGCGC AAGGACTGGG TCGAGAAGAC CAGCACGGAT CTCGCCCGCC GGTTCGACGT GATCCGCGTC GAGGACCTGA AGATCAGGGG GATGACCCGC TCGGCCCGGG GCACCGTCGA GGCGCCTGGG ACAAACGTGC GGCAGAAAGC CGGGTTGAAC CGGGGCATCC TCGCCCACGG CTGGGGTCTG CTCGTCCACC GGTTGGAGCA GAAGGCTCCC GGCCGGGTGG AGAAGATCCC AGCCGCGTAC ACGAGCCAGC GTTGCTCGGC CTGCGGGCAG GTGGCACCGG GGAACCGTGA GAGCCAAGCG GTCTTCCGGT GCGTTGCCTG CGGACACACG GCCAACGCCG ACGTCAACGC GGCGATGAAC ATCGCGGTTG GGTACATCGC GGCCGGACGG GCCGTGACCG CGCGGGAGGC ACGGCGTCGG CCGGGCCCGC GAACCGCGAA CCTCAACACC GCGCACCCCT TCCAGCGGGT GTGTAGGAGT TGGAATCCCC CGCGGTTCAC GCGGGGGAGG ACGTCAAGAG CACGTTTCGC GGCAGGGCAC GACTGGCGGA GCGAGCGTCG TCCTGCACGT GCCGCCCCCT TCGACCCGAA CGTCCGCCGA GCCCGACCTT GCTCTCCGTC CGTGGATGCG AAGATAGCAG CTCGTTTCAC CCGCATGTGC GGTCTTTTCG AAGTTCGGGG CCGGCTGGAG CGGAACGGCT CGGGTGACGC GGAACGGGCC GGGTGGCGCG GAACGGGCCG GATGCGCGCG GCGGATCGGT TCCGGCCGGG TGCCGCCGCG GATCGCCTCC GGTCGGGTCC GCGCCGGCGG CGGCCGGCTG TCCCCGCGGC CCGGAGGCCG CGGGGAGGTG TACCCGACGA GATGGGGGCA CGCGGGCCGC CCTGGCTCGT CAGGCGGCGG CGACGGCCAC CTCGGTCGAC TTGA
|
Protein sequence | MAVYRPWRGP APGYAEQNRQ LTEARADNPW LAAGNIIVQQ QALRDFATAM ANFFRGSHRR PTFRRRGRGE GFRIVAVKPG DVRRVNRRWA RVRVPKVGWV RFRWSRAVPG ARSYRVTRDR AGRWHVAFAV APDPIPAPGM SGVVGVDRGV VVSAALSTGE LLSCPGLRPG ERARLLRLQR RLSKARRGSR RRGRVKVQIA RLRAREVDRR KDWVEKTSTD LARRFDVIRV EDLKIRGMTR SARGTVEAPG TNVRQKAGLN RGILAHGWGL LVHRLEQKAP GRVEKIPAAY TSQRCSACGQ VAPGNRESQA VFRCVACGHT ANADVNAAMN IAVGYIAAGR AVTAREARRR PGPRTANLNT AHPFQRVCRS WNPPRFTRGR TSRARFAAGH DWRSERRPAR AAPFDPNVRR ARPCSPSVDA KIAARFTRMC GLFEVRGRLE RNGSGDAERA GWRGTGRMRA ADRFRPGAAA DRLRSGPRRR RPAVPAARRP RGGVPDEMGA RGPPWLVRRR RRPPRST
|
| |