Gene Francci3_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2953 
Symbol 
ID3903768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3493925 
End bp3497644 
Gene Length3720 bp 
Protein Length1239 aa 
Translation table11 
GC content66% 
IMG OID637880274 
Productputative DNA methylase 
Protein accessionYP_482040 
Protein GI86741640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.468848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.142335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAGG CGAGACGCCT GTTGGCCGAT CTCCAGCGGC AGGTCCGCGG CCTGGAGGCG 
GACCTGCGCG CGCGGGCCCG GTCGGACCGG GACGTCGACG ATCGGCTGCA CAAGCGGTAC
CAGGAGGCGA AGACCGGCTG TCGCACCGGC GTCGGCTACG AGACGTGGCT CGACCAGCAG
CTCACCCAGG TGGCGGTCGG TTGGGTGCTG GCCTGCGTGT TCACCCGATT CTGCGAGGAC
AACACGCTGC TGGGACGTCC GATGCTGGCC GGCCCGGTCC GCCCCGAGCA GGAGGCCGAC
GAGACAGGCG AGGCCAGGGA ACGGGTCGAC GGGGTGGCCG AGGCCCGGGA ATGGCAGACT
TTCTACTTCC AGGCGGAGGA CCACAAGAAT GACTCCGACC TGGACTACCT GCGGGCGGCG
GTCGCCCGGC TGGAGGCGTA CGACGCAACC CGGGATCTGG TCAACCGGCA CAACCCGCTG
CACCTGGTGG ACATCTCACC GGACGCGGCG ACCGGCCTGC TGGAGTTCTG GCGGCGGATC
GACCCGGAAA CCGGCGCTCT CGTCCACGAC TTCTCCGATC CGGCGTCCTC CACCCGCTTC
CTCGGCGATC TGTACCAGGA TCTTTCCGAA CAGGCGCGTA AGGACTACGC GCTGCTCCAG
ACCCCCGAGT TCGTCGAGGA GTTCATCCTC GGCCGGACCC TTGCCCCGGC GATCGACGAG
TTCGGCCTCG CCGAGGTTCG GATGATCGAC CCGGCCTGCG GCTCCGGGCA CTTCCTGCTC
GGCGCCTTCG ACCTCCTGCT CGACCGGTGG CAGAAGCAGG AACATGGGAT TGACGTCAAG
GTCCACGTGG AGCGCGTACT CGGCCAGGTC CACGGTGTCG ACATCAACCC GTTCGCGGCG
GCGATCGCCC GGTTCCGCCT CGTCATCGCC GCACTGGCCG TCTGCGGGAT CACCCGGCTC
GTCGACGCCC CCGTCTGGCG CATCCGCATC GCCATCGGTG ACTCCCTGCT GTGGGGCACC
GAGGATCAGC AGGACGAACT CGACGGCGTC GCCACCCACG CCCTGACGGG TTCCACCGGT
GAGGGTGAGT TCACCTACGA GTACGAGGAC GCTCGCGAGC TGAAGGAGAT CCTGGAAGCC
CGCTACCACG CGGTTGTCGG CAACCCGCCG TACATCACGG TCAAGGACCG GGCCCCCAGC
AACGCCTATC GGGTGCGCTG GAGCGCATGC CATCGCCAGT ACGCGCTCAG CGTGCCGTTC
GCGCAGCGGT TCTTCCGACT CGCGGTCAAC GGCGGCTTCA CCGGCCAGAT CACCGCGAAC
TCCTTCATGA AGCGCGAGTT CGGCAAGAAA CTCATCGAGG AGTTCTTCCC GAGCGTCGAC
CTCACCCTGC TCGTCGACAC CAGCGGCGCC TACATCCCCG GCCACGGAAC CCCGACCGTC
CTCATCTTCG GCCGCCACCG CCGCCCCTCC CTGACCACGG TGCAGACTGT CCTTGGGATC
CGGGGCGAGC CCAGCGCACC GGCCGACCCG GCCAAAGGCC TCGTGTGGAC ACGAATAGTA
GAGAACTTCG CGAATGCTGA CCACACGGAT GCGTATATCA GCACCCTCGG AATCAAGCGG
ATAGATCTTG CGAGGCATCC CTGGTCACTC GCTGGCGGTG GAGCGGCTCA GTTGCAAGCT
CAGATAGACA CAACCAGCGC GCGTCTACAT AATCTCGGAG TTGACATCGG ACGCACCGTC
CAGCTCGGCG AAGATGACGC TTGGATACTT GCCCATAGTT CTCCGGCGGT GACCAGGTTT
CAGGACAACA TGGTCCCCCA TGTTTTCGGC GAGCTAGTAC GTGACTACAC GATCGAACAA
CCTCCCATTG CAATTAATCC GTATGCCGAC ATAACCAAAG GAATCCCCCT TTTAAAAGAT
CAGGAAGTTG TACAAAATCT CCTCTGGCCA AACCGAACAA TCCTGTCTGC TCGCACAATA
TTCGGTAGAA GTCTTGCCGA AAACGATCGA CCATGGTATG CATATTTAGA AATATATGCA
AACCGACTGC ACACCCCGCT GGCGATCGCC TTTCCGTTCG TCTCGACCCA CAACCACTTC
ACACTCGACC GAGGCGGCAA GATATTCAAT CGTACCGCAA CCGTTCTCAA GCTGCCCGAG
GGAGCCACGG AGGAGAAACA TCTGCGGCTG GTCGGGGTGC TGAACTCGTC GGCGGCGTGC
TTCTGGCTCA AGCAGGTGAG CCACGACAAA GGAATCCGTG GCCAGGGAGG AGGCTTCACC
AGTGACGACT GGGAACACTT CTACGAGTTC ACCGGGACCA AGCTGAAGGA GTTTCCGCTG
CCGGACGGGG CGCCACTGGT GTTGGCGACG CGGCTGGACG GGTTGGCTCA GGAGCTCCAG
CGGGTCGCGC CGGCCGCGGT GGCCAGGGAC GCTGTTCCGA CCCGGGAGGC ACTCGTGCAC
GCGCGGGCGG AGTGGGAGCG GATCCGGGCG GAGATGATCT CGGCGCAGGA GGAGCTGGAC
TGGGAGGTCT ACGGTCTCTA CGGGCTGCTC GGCGACGACG CGGACGGGTT GATCGGTTCG
AGTGTGACGA AGCCGCCGCT GGCGCTTGGC GAACGGGCGT TCGAGATCGT GCTGGCGCGG
CAGCACGATG CCGGCGAGAC CGAGACGGAA TGGTTCACCC GGCACCGCTC GACGCCGATC
ATCCAGCTGC CCGCGCACTG GCCGCAGGAC TACCGGGCCC TCGTCGAGCG GCGGCTGGCG
AAGATCGACG ATGATCCGTA CCTTCACCTC ATCGAGCGGC CGGAATGCAA GCGGCGCTGG
GCGAGCCGGC CGTGGGCGGA GATGGAGGCC GAGGCGTTGC GCGCCTGGCT GCTCGACCGG
CTGGAGGCCC GCGAACTGTG GCACCGACCG GAACCGACCC CGCGCACCGT CGCCCAGCTC
GCCGACGAGC TGCGGACCGA CGCCGAGTTC ACCGCCGTCG CCAGGCTCTA CGCCCGCGAC
ACCGCTCTTG GTGACGTGGT CGCCGATCTC GTGCGCGACG AGCATGTACC GTTCCTCGCC
GCCTGGCGGT ACACCGACAT GGGGCTGCGG GTCCGGGCGC AGTGGGAACG CACCTGGGAT
CTCCAGCGGG AGCAGGATGC CGAGGACGAG CGGATCCGGG CCGAGGAGGA GCGGCGTAAG
GAGTCCGACG AGCCGCTGCC ACCGGCCCCA CCGCGTAAAA TCATCGACAT CAAGGTGCCG
CCGAAGTACA AACAGACCGA CTTCCGCGAG ACGTCCTACT GGCGCAGCCG CGGGAAGCTG
GACGTGCCCA AGGAGCGGTT CATCTCGTAC CCGGATGCTT CCCGGGACGG GACCCTGCTG
CTGGGCTGGG CCGGCTGGGA TCATCTCCAG CAGGCGCAGG CGCTGGCCAC CTATATCGCC
GATCGGCGCG AGGTCGACGC CTGGGACGCC GAGAAAACCA AGCCGTTGCT CGCCGGGCTG
CTGGAACTCC TGCCGTGGGT CGCGCAATGG CATTCGGAAC TGGACCCGGA GTTCGGTATT
CGGCCCGCGG ACGCCTACAC CGGCTTCCTC GACGAGCAGG TGCGCCAGCT CGGGCTGACC
CGTGACGATC TCACCGGCTG GCTCCCCGCG GCCAGGAAGA CGAGAACCAC GGTGAAGAAG
CCGGCGAAGA AGCCGGCGAA TAAGCCCGCC ACGACTCCGG CGAAGGCGGC CCGGTCATGA
 
Protein sequence
MIEARRLLAD LQRQVRGLEA DLRARARSDR DVDDRLHKRY QEAKTGCRTG VGYETWLDQQ 
LTQVAVGWVL ACVFTRFCED NTLLGRPMLA GPVRPEQEAD ETGEARERVD GVAEAREWQT
FYFQAEDHKN DSDLDYLRAA VARLEAYDAT RDLVNRHNPL HLVDISPDAA TGLLEFWRRI
DPETGALVHD FSDPASSTRF LGDLYQDLSE QARKDYALLQ TPEFVEEFIL GRTLAPAIDE
FGLAEVRMID PACGSGHFLL GAFDLLLDRW QKQEHGIDVK VHVERVLGQV HGVDINPFAA
AIARFRLVIA ALAVCGITRL VDAPVWRIRI AIGDSLLWGT EDQQDELDGV ATHALTGSTG
EGEFTYEYED ARELKEILEA RYHAVVGNPP YITVKDRAPS NAYRVRWSAC HRQYALSVPF
AQRFFRLAVN GGFTGQITAN SFMKREFGKK LIEEFFPSVD LTLLVDTSGA YIPGHGTPTV
LIFGRHRRPS LTTVQTVLGI RGEPSAPADP AKGLVWTRIV ENFANADHTD AYISTLGIKR
IDLARHPWSL AGGGAAQLQA QIDTTSARLH NLGVDIGRTV QLGEDDAWIL AHSSPAVTRF
QDNMVPHVFG ELVRDYTIEQ PPIAINPYAD ITKGIPLLKD QEVVQNLLWP NRTILSARTI
FGRSLAENDR PWYAYLEIYA NRLHTPLAIA FPFVSTHNHF TLDRGGKIFN RTATVLKLPE
GATEEKHLRL VGVLNSSAAC FWLKQVSHDK GIRGQGGGFT SDDWEHFYEF TGTKLKEFPL
PDGAPLVLAT RLDGLAQELQ RVAPAAVARD AVPTREALVH ARAEWERIRA EMISAQEELD
WEVYGLYGLL GDDADGLIGS SVTKPPLALG ERAFEIVLAR QHDAGETETE WFTRHRSTPI
IQLPAHWPQD YRALVERRLA KIDDDPYLHL IERPECKRRW ASRPWAEMEA EALRAWLLDR
LEARELWHRP EPTPRTVAQL ADELRTDAEF TAVARLYARD TALGDVVADL VRDEHVPFLA
AWRYTDMGLR VRAQWERTWD LQREQDAEDE RIRAEEERRK ESDEPLPPAP PRKIIDIKVP
PKYKQTDFRE TSYWRSRGKL DVPKERFISY PDASRDGTLL LGWAGWDHLQ QAQALATYIA
DRREVDAWDA EKTKPLLAGL LELLPWVAQW HSELDPEFGI RPADAYTGFL DEQVRQLGLT
RDDLTGWLPA ARKTRTTVKK PAKKPANKPA TTPAKAARS