Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3354 |
Symbol | |
ID | 5671725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3970817 |
End bp | 3972028 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242242 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001507662 |
Protein GI | 158315154 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGTGT CCAGGTTCCG GTTGTATCCC GACGCGGCGC AGGAAGAGGC TCTGCTGGTG CACTGTGGGC ACGCCCGGTT CGTGTGGAAC CTCGCGGTCG AGCAACAGTC GTGGTACCGG CTGTGGTGTG GTCGGGCGCC AGGTTATGTG GAGCAGAACC GGCAGTTGAC CGAAGCCCGG TCGGATAATC CATGGCTGGC GGCGGGCAGT GTCATCGTGC AGCAGCAGGC TTTGCGTGAC TTCGCGACGG CGATGGCGAA CTTCTTCCGC GGTTCGCATC GCAGGCCCAC CTTCCGTAGG CGTGGGCGTG GTGAGGGGTT CCGGATTGTG GCGGTGAGAC CGGGCGACGT CCGGCGGGTG AATCGTCGGT GGGCGTGGGT GCGTGTCCCG AAGGTGGGCT GGGTGCGGTT TCGCTGGACT CGTGAGGTGT CGGGTGCGCG GTCGTATCGG GTGACGCGGG ATCGTGCGGG CCGTTGGCAT GTCGCGTTCG CCGTGGCCCC GAATCCGATT CCCGCGCCGG GAACCGAGAA GGTTGTCGGT GTGGACCGTG GGGTGGTGGT GTCGGCGGCG CTGTCGACCG GGGAGCTGCT GTCCTGTCCC GGTCTGCGAG CCGGGGAGCA GGGGCGGCTG GTCCGGTTGC AGCGCCGATT GTCGAGGGCC AGGCGTGGGT CGCGGCGGCG CGGGCGCGTC AAGGCCCGGA TCGCACGGCT GCGTGCCCGG GAGGTTGACC GGCGCAAGGA CTGGGTCGAG AAGACCAGCA CGGATCTCGC TCGTCGGTTC GACGTGATCC GGGGCGAGGA CCTGAAGATC AGGGGGATGA CCCGCTCTGC CCGGGGCACC GTCGAGGCGC CGGGAAGCAA CGTCCGGCAG AAGGCCGGGT TGAACCGGGG CATCCTCGCC CACGGTTGGG GTCTGCTCGT CGCACGGTTG GAGCAGAAGG CCCCCGGCCG GGTGGAGAAA GTCCCCGCCG CGTACACGAG CCAGCGTTGC TCGGCCTGCG GGCATAGGGC GCCCGGGAAC CGCGAGAGCC AAGCGGTCTT CCGGTGCCTG GCCTGCGGGC ACACGGCCAA CGCCGACGTC AACGCGGCTA TGAACATCGC GGTTGGGAAC ATCGCGGCCG GACGGGCCGT GACCGCGCGG GGAGGCACGG CGCTGGCCGT GCCCGCGAAC CGCGAACCTC AACACCGCGT ACCCCTTCCG GTGGGTGTGT AG
|
Protein sequence | MVVSRFRLYP DAAQEEALLV HCGHARFVWN LAVEQQSWYR LWCGRAPGYV EQNRQLTEAR SDNPWLAAGS VIVQQQALRD FATAMANFFR GSHRRPTFRR RGRGEGFRIV AVRPGDVRRV NRRWAWVRVP KVGWVRFRWT REVSGARSYR VTRDRAGRWH VAFAVAPNPI PAPGTEKVVG VDRGVVVSAA LSTGELLSCP GLRAGEQGRL VRLQRRLSRA RRGSRRRGRV KARIARLRAR EVDRRKDWVE KTSTDLARRF DVIRGEDLKI RGMTRSARGT VEAPGSNVRQ KAGLNRGILA HGWGLLVARL EQKAPGRVEK VPAAYTSQRC SACGHRAPGN RESQAVFRCL ACGHTANADV NAAMNIAVGN IAAGRAVTAR GGTALAVPAN REPQHRVPLP VGV
|
| |