Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6840 |
Symbol | |
ID | 5675153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8340332 |
End bp | 8341522 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245689 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001511080 |
Protein GI | 158318572 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.643066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCT ACCGGCTCGA CCCGGCACCC GAACAGGTCG CTGGAATCGA GGAGCACTGC GGGCACGCGC GTTTTGTTTG GAATCTTGCG GTCGAACAGC AGTCGTGGTG GAAGCCCGGC CAGGGAAGGG CGCCGAACCA TGCGGAGCGT TGCCGGCAGT TGACCGAAGC GCGGGCAGAG TTCGAGTGGC TGCGAGCCGG TTCGCAGACC GTTCAGCAGC AGGCGCTTCG AGACTTCGAC CAGGCTATAC GGAACTTCTT CAACGGCTCG CACCGCCACC CGACCTTTCG GAAACGTGGC CGGTCCGAGG GTTTCCAGAT CGTAGGGAAG AACGCACGGG TCGAGAAGCT GAACCGGAAG TGGTCCCAAT GCTGGATCCC GAAGGTCGGC TGGGTGAAGT TCCGCGTGTC GCGGATAATC CCGGACTTCA GGTCGTACCG GGTGACCCGG GACCGGGCGG GCCGCTGGCA TGTGGCGTTC GCCGTAGCAC CCGACCCGGT CCCCGCGCCA GGAACCGGTG AGGTCGGCGA GGTCGTCGGT GTGGACCGGG GTGTCGCCGT GTCCGCGGCG CTGTCCACCG GGGAGCGGCT GTCCTGCCCG ACGCTGAGGC CGAAGGAAGC CGAACGACTC CGCCGGCTCC AGCGCCGGCT GGCGAAAGCC ACACGCGGGT CGAACCGGCG CGGCCGGCTG AGGACCCAGA TCGCCCGGGT GAAGGCCCGG GAGGCCGACC GGCGGAAAGA CTGGGTGGAG AAAACCTCCA CCGATCTTGC CCGCCGGTTC GACGTCATCC GGGTGGAAGA TCTCCGCATC ACGAACATGA CCCGGTCGGC CCGAGGCACT GTCGAACAGC CGGGCCGGAA TGTTCGGCAG AAAGCCGGAC TGAACCGGGG CATCCTCGCC AACGGCTGGG GTCTGCTTGC CCGGCGGTTG GAACAGAAAG CACCCGGCCG GGTGGAGAAG ATCCCGGCCG CCTACACCAG TCAGTGTTGC TCGTCCTGCG GGCATGTGGC GCCCGGGAAC CGCGAGAGCC AAGCGGTGTT CCGGTGCGTC GCCTGCGGAC ACACGGCCAA CGCGGACGTG AACGCTGCAT GCAACATCGC GGCTGGACGG GCCGTGACCG CGCGGGGAGG CGCAGCATTG GCCGCGAACC GCGAACCTCA ACACTCCACG CCTCCTCTGG TGGATGGGTA G
|
Protein sequence | MSRYRLDPAP EQVAGIEEHC GHARFVWNLA VEQQSWWKPG QGRAPNHAER CRQLTEARAE FEWLRAGSQT VQQQALRDFD QAIRNFFNGS HRHPTFRKRG RSEGFQIVGK NARVEKLNRK WSQCWIPKVG WVKFRVSRII PDFRSYRVTR DRAGRWHVAF AVAPDPVPAP GTGEVGEVVG VDRGVAVSAA LSTGERLSCP TLRPKEAERL RRLQRRLAKA TRGSNRRGRL RTQIARVKAR EADRRKDWVE KTSTDLARRF DVIRVEDLRI TNMTRSARGT VEQPGRNVRQ KAGLNRGILA NGWGLLARRL EQKAPGRVEK IPAAYTSQCC SSCGHVAPGN RESQAVFRCV ACGHTANADV NAACNIAAGR AVTARGGAAL AANREPQHST PPLVDG
|
| |