Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3509 |
Symbol | |
ID | 5675715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4170166 |
End bp | 4171362 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242396 |
Product | DNA (cytosine-5-)-methyltransferase |
Protein accession | YP_001507816 |
Protein GI | 158315308 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCT ACCGGCTCGA CCCGGCAGCC GAACAGGCGG CCGGAATGGA GGAGCACTGC GGGCACGCGC GTTTTGTCTG GAATCTCGCG GTCGAACAGC AGTCGTGGTG GAAACCCGGC CGGGGGAACG CGCCGAACCA TGCGGAGCGT TGCCGGCAGT TGACCGAAGC GCGGGCGGAG TTCAAGTGGC TGCGGGCCGG TTCGCAGACC TTTCAGCAGC AGGCGCTTCG AGACTTCGAC CAGGCTATGC GGAACTTCTT CAACGGCTCG CACCGCCGCC CGATCTTCCG GAAACGTGGC CGGTCCGATG GTTTCCAGAT CGTGGGGAAG AACGCACGGG TCGAGAATCT GAACCGGAAG TGGTCCCGAT GCTGGATCCC GAAAGTCGGC TGGGTGAGGT TCCGCGTGTC GCGGACGATC CCGGACTTCA GGTCGTACCG GGTGACACGG GATCGGGCGG GCCGCTGGCA TGTGGCGTTC GCCGCGGCAC CTGACCCGAT CCCCGCGCCG GGAACCGGCG AGGTCGGCGA GGTCGTCGGT GTGGATCGGG GTGTTGCCGT GTCCGCGGCG CTGTCCACCG GGGAGCTGCT GTCCTGCCCG AAACTGAGAC CGAAGGAAGC GGAACGGCTC CGCCGGCTCC AGCGCCGGTT GGCGAAAGCC ACACGCGGGT CGAACCGGCG TGGCCGGCTG AAGACCCAGA TTGCCCGGGT GAAAGTCCGG GAGGCCGACC GGCGGAAAGA CTGGGTGGAG AAAACCTCCA CCGATCTTGC CCGCCGGTTC GACGTCATCC GGGTGGAGGA TCTCCGTATC AAGAACATGA CCCGGTCGGC CCGAGGTACT GTCGACCAGC CGGGCCGGAA TGTTCGACAG AAAGCCGGGC TGAACCGGGG CATCCTCGCC AACGGGTGGG GTCTCCTTGC CCGGCGGTTG GAACAGAAAG CACCCGGCCG GGTGGAGAAG ATCCCGGCCG CCTACACCAG TCAGTGTTGC TCGTCCTGCG GGCATGTGGC GCCCGGGAAC CGCGAGAGCC AAGCGATGTT CCGGTGCGTC GCCTGCGGGC ACACGGCCAA CGCGGACGTG AACGCGGCAC GCAACATCGC GGCTGGACGG GCCGTGACCG CGCGGGGAGG CACGGTGTTG GCCGCGCCCG CGAACCGCGA ACCTCAACAT TCCACACCTC CTCTGGTGGG TGTGTAG
|
Protein sequence | MSRYRLDPAA EQAAGMEEHC GHARFVWNLA VEQQSWWKPG RGNAPNHAER CRQLTEARAE FKWLRAGSQT FQQQALRDFD QAMRNFFNGS HRRPIFRKRG RSDGFQIVGK NARVENLNRK WSRCWIPKVG WVRFRVSRTI PDFRSYRVTR DRAGRWHVAF AAAPDPIPAP GTGEVGEVVG VDRGVAVSAA LSTGELLSCP KLRPKEAERL RRLQRRLAKA TRGSNRRGRL KTQIARVKVR EADRRKDWVE KTSTDLARRF DVIRVEDLRI KNMTRSARGT VDQPGRNVRQ KAGLNRGILA NGWGLLARRL EQKAPGRVEK IPAAYTSQCC SSCGHVAPGN RESQAMFRCV ACGHTANADV NAARNIAAGR AVTARGGTVL AAPANREPQH STPPLVGV
|
| |