Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3502 |
Symbol | |
ID | 5671873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4164164 |
End bp | 4165384 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242390 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001507810 |
Protein GI | 158315302 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGTACG GAGCGGGCCC GGCGAGGCGG GCGTTCAGGT TCCTGCTGCG CCCGACCATG CGGCAGGCCG CCGCGCTGAC GGCGATGCTC GATGACCATC GGGCGCTGTA CAACGCCGCG TTGCAGGAAC GACGCGACGC CTACCGGCAT CCGTCGAGGG TGACGGTTCG CTACGGCGAC CAGTCCGCGC AGCTCAAGGA GATCCGCGCC TGCGACCCGG ATCAGGGGCG CTGGTCGTTC TCCTCCCAGC AGGCCACCCT GCGCCGGCTC GACAAGGCGT TCGCCGGCTT CTTTCGCCGC GTCAGGGCAG GCCAGACCCC CGGCTACCCG CGGTTCAAGG GCGCGGGCCG GTTCGACACG GTCGAGTGGC CGAAGGACGG GGACGGCTGC CGGTGGAACT CCCAGCCCGA GCATCCCACC CGGACCCGGG TTCGGCTTCA AGGTGTCGGT CACGTCAAGG TTCACCAGCA TCGGCCGGTG GCGGGCACGG TCAAGACGGT CTCGGTGCGG CGGGAAGGCC GCCGCTGGTA TGTGGTCCTC TCCTGCGACG ACGTGCCCGC GCGGCCGCTG CCTGCCACCG GGGTGGTGGT GGGGGTGGAT GTGGGTGTGG CGTCGCTGGT GACCCTCTCT GATGGCCGTC AGGTCGGTAA CCCGCGTTTT CTCGCCGCGG CGGCCGGTCG GCTCGCGCGT GCGCAACGGG AACTGGCCCG TAAGAAGCGG GGGTCGACCC GGCGCCGGAA GGCCGTCGCG AAGGTCGCCG CGCTGCACGG CAGGGTTCGC CGGCAGCGCC TCGACCTCGC GCACACGGTC GCCCGCGATC TTGTCCGCGA CCATGATCTG ATCGCTGTGG AAGCGTTGCG GGTCGTGAAC ATGACTCGCC GGGCCGCGCC GAGACCCGAC CCCGACCGGC CCGGAGTGTT CGTGGCGAAC GGGCAGGCGG CGAAGTCCGG GCTGAACAGA AGCGTTCTCG ACGCGGGATG GGGGGTGTTC CTCGCTGTGC TGCGTGCCAA GGCTGAAAGT GCCGGACGGG TGGTCGTCGA GGTCAACCCC GCCAACACCT CCCGCACCTG CGCGGTCTGC GGGCACTGCC ACGCCGACAA CCGCAGAACA CAGGCCGCGT TCACCTGCGT CGCGTGCGGG CATGCCGCGC ACGCCGACGT GAACGCGGCG GTCAACATCC TTCGGGCCGG GCTGGCCCGT CAGGCCACCG AAGCGGCCTG A
|
Protein sequence | MAYGAGPARR AFRFLLRPTM RQAAALTAML DDHRALYNAA LQERRDAYRH PSRVTVRYGD QSAQLKEIRA CDPDQGRWSF SSQQATLRRL DKAFAGFFRR VRAGQTPGYP RFKGAGRFDT VEWPKDGDGC RWNSQPEHPT RTRVRLQGVG HVKVHQHRPV AGTVKTVSVR REGRRWYVVL SCDDVPARPL PATGVVVGVD VGVASLVTLS DGRQVGNPRF LAAAAGRLAR AQRELARKKR GSTRRRKAVA KVAALHGRVR RQRLDLAHTV ARDLVRDHDL IAVEALRVVN MTRRAAPRPD PDRPGVFVAN GQAAKSGLNR SVLDAGWGVF LAVLRAKAES AGRVVVEVNP ANTSRTCAVC GHCHADNRRT QAAFTCVACG HAAHADVNAA VNILRAGLAR QATEAA
|
| |