Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1550 |
Symbol | |
ID | 5669953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1852692 |
End bp | 1853897 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240469 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001505895 |
Protein GI | 158313387 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.734191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.524569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGAGGT TTCGGTTGTA TCCCGATGCG GTGCAGGAAC AGGCCCTGTT GGTGCACTGT GGGCATGCTC GGTTCGTGTG GAATCTCGCG GTCGAGCAGC AGTCGTGGTA TCGGCCATGG CGGGGGCGGG CGCCGGGCTA TGCGGAGCAG AACCGGCAGT TGACCGAGGC CCGGTCGGCC AGTCCGTGGC TGGCGGCGGG CAGTGTCGTC GTGCAGCAGC AGGCTTTGCG TGACTTCGCG ACGGCGATGG GGAACTTCTT CCGTGGTTCG CATCGCAGAC CCACTTTCCG GAGGCGTGGC CATCACGAGG GGTTCCGGAT TGTGGCGGTG AAACCGGGCG ACGTGCGGCG GGTGAATCGC CGGTGGGCGC GGGTGCGTGT CCCGAAGGTG GGCTGGGTGA GGTTCCGCTG GTCCCGTGCT GTGCCGGGCG CGCGGTCGTA TCGGGTGACG CGGGATCGTG CGGGCCGCTG GCATGTGGCG TTCGCCGTGA CCCCCGCTCC GATCCCCGCG CCGGGCACCC ATGCGGTCGT CGGGGTGGAC CGTGGGGTGG TTGTGTCGGC GGCGCTGTCG ACCGGGGAAC TGCTGTCCTG TCCCGGTCTG AGAGCTGGGG AGCGGGCGCG GCTGGTCCGG TTGCAACGCC GGTTGTCCAG GGCCAGGTGT GGGTCCCAGC GGCGCCAGCG CCTCAAGGTG CGGATCGCAC GATCGCGGGC CCGGGAGGTT GACCGGCGCA AGGACTGGGT CGAGAAGACC AGCACCGACC TCGCCCGCCG GTTCGACGTG ATCCGCGTCG AGGACCTGAA GATCAGGCGG ATGACCCGCT CGGCTCGGGG CACCGTCGAG GCGCCGGGAA GCAATGTCCG GCAGAAAGCC GGATTGAACC GGGGCATCCT CGCCCAGGGC TGGGGTCTGC TCGTCCGCCG GTTGGAGGAG AAGGCCCCCG GCCGGGTCGA GAAGGTCCCC GCCGCGTACA CGAGTCAGCG TTGTTCGGCC TGCGGGCAGG TGGCGTCCGG GAACCGTGAG AGCCAAGCGG TCTTCTGGTG CGTGGTCTGC GGGCACACGG CCAACGCCGA CGTCAACGCG GCGGTGAACA TCGCGGTTGG GTACATCGCG GCTGGACGGG CCGTGACCGC GCGGGGAGGC GCGGCATTGG CCGGGCCCGT GAACCGCGAA CCTCAACACT GCGCACCTCT TCTGGTGGGT GTGTAG
|
Protein sequence | MSRFRLYPDA VQEQALLVHC GHARFVWNLA VEQQSWYRPW RGRAPGYAEQ NRQLTEARSA SPWLAAGSVV VQQQALRDFA TAMGNFFRGS HRRPTFRRRG HHEGFRIVAV KPGDVRRVNR RWARVRVPKV GWVRFRWSRA VPGARSYRVT RDRAGRWHVA FAVTPAPIPA PGTHAVVGVD RGVVVSAALS TGELLSCPGL RAGERARLVR LQRRLSRARC GSQRRQRLKV RIARSRAREV DRRKDWVEKT STDLARRFDV IRVEDLKIRR MTRSARGTVE APGSNVRQKA GLNRGILAQG WGLLVRRLEE KAPGRVEKVP AAYTSQRCSA CGQVASGNRE SQAVFWCVVC GHTANADVNA AVNIAVGYIA AGRAVTARGG AALAGPVNRE PQHCAPLLVG V
|
| |