Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2953 |
Symbol | |
ID | 3903768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3493925 |
End bp | 3497644 |
Gene Length | 3720 bp |
Protein Length | 1239 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637880274 |
Product | putative DNA methylase |
Protein accession | YP_482040 |
Protein GI | 86741640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.468848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.142335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGAGG CGAGACGCCT GTTGGCCGAT CTCCAGCGGC AGGTCCGCGG CCTGGAGGCG GACCTGCGCG CGCGGGCCCG GTCGGACCGG GACGTCGACG ATCGGCTGCA CAAGCGGTAC CAGGAGGCGA AGACCGGCTG TCGCACCGGC GTCGGCTACG AGACGTGGCT CGACCAGCAG CTCACCCAGG TGGCGGTCGG TTGGGTGCTG GCCTGCGTGT TCACCCGATT CTGCGAGGAC AACACGCTGC TGGGACGTCC GATGCTGGCC GGCCCGGTCC GCCCCGAGCA GGAGGCCGAC GAGACAGGCG AGGCCAGGGA ACGGGTCGAC GGGGTGGCCG AGGCCCGGGA ATGGCAGACT TTCTACTTCC AGGCGGAGGA CCACAAGAAT GACTCCGACC TGGACTACCT GCGGGCGGCG GTCGCCCGGC TGGAGGCGTA CGACGCAACC CGGGATCTGG TCAACCGGCA CAACCCGCTG CACCTGGTGG ACATCTCACC GGACGCGGCG ACCGGCCTGC TGGAGTTCTG GCGGCGGATC GACCCGGAAA CCGGCGCTCT CGTCCACGAC TTCTCCGATC CGGCGTCCTC CACCCGCTTC CTCGGCGATC TGTACCAGGA TCTTTCCGAA CAGGCGCGTA AGGACTACGC GCTGCTCCAG ACCCCCGAGT TCGTCGAGGA GTTCATCCTC GGCCGGACCC TTGCCCCGGC GATCGACGAG TTCGGCCTCG CCGAGGTTCG GATGATCGAC CCGGCCTGCG GCTCCGGGCA CTTCCTGCTC GGCGCCTTCG ACCTCCTGCT CGACCGGTGG CAGAAGCAGG AACATGGGAT TGACGTCAAG GTCCACGTGG AGCGCGTACT CGGCCAGGTC CACGGTGTCG ACATCAACCC GTTCGCGGCG GCGATCGCCC GGTTCCGCCT CGTCATCGCC GCACTGGCCG TCTGCGGGAT CACCCGGCTC GTCGACGCCC CCGTCTGGCG CATCCGCATC GCCATCGGTG ACTCCCTGCT GTGGGGCACC GAGGATCAGC AGGACGAACT CGACGGCGTC GCCACCCACG CCCTGACGGG TTCCACCGGT GAGGGTGAGT TCACCTACGA GTACGAGGAC GCTCGCGAGC TGAAGGAGAT CCTGGAAGCC CGCTACCACG CGGTTGTCGG CAACCCGCCG TACATCACGG TCAAGGACCG GGCCCCCAGC AACGCCTATC GGGTGCGCTG GAGCGCATGC CATCGCCAGT ACGCGCTCAG CGTGCCGTTC GCGCAGCGGT TCTTCCGACT CGCGGTCAAC GGCGGCTTCA CCGGCCAGAT CACCGCGAAC TCCTTCATGA AGCGCGAGTT CGGCAAGAAA CTCATCGAGG AGTTCTTCCC GAGCGTCGAC CTCACCCTGC TCGTCGACAC CAGCGGCGCC TACATCCCCG GCCACGGAAC CCCGACCGTC CTCATCTTCG GCCGCCACCG CCGCCCCTCC CTGACCACGG TGCAGACTGT CCTTGGGATC CGGGGCGAGC CCAGCGCACC GGCCGACCCG GCCAAAGGCC TCGTGTGGAC ACGAATAGTA GAGAACTTCG CGAATGCTGA CCACACGGAT GCGTATATCA GCACCCTCGG AATCAAGCGG ATAGATCTTG CGAGGCATCC CTGGTCACTC GCTGGCGGTG GAGCGGCTCA GTTGCAAGCT CAGATAGACA CAACCAGCGC GCGTCTACAT AATCTCGGAG TTGACATCGG ACGCACCGTC CAGCTCGGCG AAGATGACGC TTGGATACTT GCCCATAGTT CTCCGGCGGT GACCAGGTTT CAGGACAACA TGGTCCCCCA TGTTTTCGGC GAGCTAGTAC GTGACTACAC GATCGAACAA CCTCCCATTG CAATTAATCC GTATGCCGAC ATAACCAAAG GAATCCCCCT TTTAAAAGAT CAGGAAGTTG TACAAAATCT CCTCTGGCCA AACCGAACAA TCCTGTCTGC TCGCACAATA TTCGGTAGAA GTCTTGCCGA AAACGATCGA CCATGGTATG CATATTTAGA AATATATGCA AACCGACTGC ACACCCCGCT GGCGATCGCC TTTCCGTTCG TCTCGACCCA CAACCACTTC ACACTCGACC GAGGCGGCAA GATATTCAAT CGTACCGCAA CCGTTCTCAA GCTGCCCGAG GGAGCCACGG AGGAGAAACA TCTGCGGCTG GTCGGGGTGC TGAACTCGTC GGCGGCGTGC TTCTGGCTCA AGCAGGTGAG CCACGACAAA GGAATCCGTG GCCAGGGAGG AGGCTTCACC AGTGACGACT GGGAACACTT CTACGAGTTC ACCGGGACCA AGCTGAAGGA GTTTCCGCTG CCGGACGGGG CGCCACTGGT GTTGGCGACG CGGCTGGACG GGTTGGCTCA GGAGCTCCAG CGGGTCGCGC CGGCCGCGGT GGCCAGGGAC GCTGTTCCGA CCCGGGAGGC ACTCGTGCAC GCGCGGGCGG AGTGGGAGCG GATCCGGGCG GAGATGATCT CGGCGCAGGA GGAGCTGGAC TGGGAGGTCT ACGGTCTCTA CGGGCTGCTC GGCGACGACG CGGACGGGTT GATCGGTTCG AGTGTGACGA AGCCGCCGCT GGCGCTTGGC GAACGGGCGT TCGAGATCGT GCTGGCGCGG CAGCACGATG CCGGCGAGAC CGAGACGGAA TGGTTCACCC GGCACCGCTC GACGCCGATC ATCCAGCTGC CCGCGCACTG GCCGCAGGAC TACCGGGCCC TCGTCGAGCG GCGGCTGGCG AAGATCGACG ATGATCCGTA CCTTCACCTC ATCGAGCGGC CGGAATGCAA GCGGCGCTGG GCGAGCCGGC CGTGGGCGGA GATGGAGGCC GAGGCGTTGC GCGCCTGGCT GCTCGACCGG CTGGAGGCCC GCGAACTGTG GCACCGACCG GAACCGACCC CGCGCACCGT CGCCCAGCTC GCCGACGAGC TGCGGACCGA CGCCGAGTTC ACCGCCGTCG CCAGGCTCTA CGCCCGCGAC ACCGCTCTTG GTGACGTGGT CGCCGATCTC GTGCGCGACG AGCATGTACC GTTCCTCGCC GCCTGGCGGT ACACCGACAT GGGGCTGCGG GTCCGGGCGC AGTGGGAACG CACCTGGGAT CTCCAGCGGG AGCAGGATGC CGAGGACGAG CGGATCCGGG CCGAGGAGGA GCGGCGTAAG GAGTCCGACG AGCCGCTGCC ACCGGCCCCA CCGCGTAAAA TCATCGACAT CAAGGTGCCG CCGAAGTACA AACAGACCGA CTTCCGCGAG ACGTCCTACT GGCGCAGCCG CGGGAAGCTG GACGTGCCCA AGGAGCGGTT CATCTCGTAC CCGGATGCTT CCCGGGACGG GACCCTGCTG CTGGGCTGGG CCGGCTGGGA TCATCTCCAG CAGGCGCAGG CGCTGGCCAC CTATATCGCC GATCGGCGCG AGGTCGACGC CTGGGACGCC GAGAAAACCA AGCCGTTGCT CGCCGGGCTG CTGGAACTCC TGCCGTGGGT CGCGCAATGG CATTCGGAAC TGGACCCGGA GTTCGGTATT CGGCCCGCGG ACGCCTACAC CGGCTTCCTC GACGAGCAGG TGCGCCAGCT CGGGCTGACC CGTGACGATC TCACCGGCTG GCTCCCCGCG GCCAGGAAGA CGAGAACCAC GGTGAAGAAG CCGGCGAAGA AGCCGGCGAA TAAGCCCGCC ACGACTCCGG CGAAGGCGGC CCGGTCATGA
|
Protein sequence | MIEARRLLAD LQRQVRGLEA DLRARARSDR DVDDRLHKRY QEAKTGCRTG VGYETWLDQQ LTQVAVGWVL ACVFTRFCED NTLLGRPMLA GPVRPEQEAD ETGEARERVD GVAEAREWQT FYFQAEDHKN DSDLDYLRAA VARLEAYDAT RDLVNRHNPL HLVDISPDAA TGLLEFWRRI DPETGALVHD FSDPASSTRF LGDLYQDLSE QARKDYALLQ TPEFVEEFIL GRTLAPAIDE FGLAEVRMID PACGSGHFLL GAFDLLLDRW QKQEHGIDVK VHVERVLGQV HGVDINPFAA AIARFRLVIA ALAVCGITRL VDAPVWRIRI AIGDSLLWGT EDQQDELDGV ATHALTGSTG EGEFTYEYED ARELKEILEA RYHAVVGNPP YITVKDRAPS NAYRVRWSAC HRQYALSVPF AQRFFRLAVN GGFTGQITAN SFMKREFGKK LIEEFFPSVD LTLLVDTSGA YIPGHGTPTV LIFGRHRRPS LTTVQTVLGI RGEPSAPADP AKGLVWTRIV ENFANADHTD AYISTLGIKR IDLARHPWSL AGGGAAQLQA QIDTTSARLH NLGVDIGRTV QLGEDDAWIL AHSSPAVTRF QDNMVPHVFG ELVRDYTIEQ PPIAINPYAD ITKGIPLLKD QEVVQNLLWP NRTILSARTI FGRSLAENDR PWYAYLEIYA NRLHTPLAIA FPFVSTHNHF TLDRGGKIFN RTATVLKLPE GATEEKHLRL VGVLNSSAAC FWLKQVSHDK GIRGQGGGFT SDDWEHFYEF TGTKLKEFPL PDGAPLVLAT RLDGLAQELQ RVAPAAVARD AVPTREALVH ARAEWERIRA EMISAQEELD WEVYGLYGLL GDDADGLIGS SVTKPPLALG ERAFEIVLAR QHDAGETETE WFTRHRSTPI IQLPAHWPQD YRALVERRLA KIDDDPYLHL IERPECKRRW ASRPWAEMEA EALRAWLLDR LEARELWHRP EPTPRTVAQL ADELRTDAEF TAVARLYARD TALGDVVADL VRDEHVPFLA AWRYTDMGLR VRAQWERTWD LQREQDAEDE RIRAEEERRK ESDEPLPPAP PRKIIDIKVP PKYKQTDFRE TSYWRSRGKL DVPKERFISY PDASRDGTLL LGWAGWDHLQ QAQALATYIA DRREVDAWDA EKTKPLLAGL LELLPWVAQW HSELDPEFGI RPADAYTGFL DEQVRQLGLT RDDLTGWLPA ARKTRTTVKK PAKKPANKPA TTPAKAARS
|
| |