Gene Franean1_0945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0945 
Symbol 
ID5669359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1104870 
End bp1107977 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content69% 
IMG OID641239872 
Producthypothetical protein 
Protein accessionYP_001505307 
Protein GI158312799 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.203937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.591607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGAA CCAGTCGCAG ACCGCTGCTG ACACGCACAC GCCTGCTCGT GCCGGTGCTG 
GTCATTCTGG TCCTGCTCGT CGTGTTTCTC GGCGTCTTCA CTCGCCTCTA CACAGATCTT
CTGTTCTACC GCTCGGTGGA CTTCAGCAAC GTCTTCGGCA CCGTCGTATT CACACGAATA
CTTCTGTTCG TGCTCTTCGG TGCGGTCATG GCAATTGCGG TGGGAACGAA CATAGTACTC
GCCTACAAGT TGCGGCCGCC CATCCGGCCC CTCTCGACCG AGCAGCAGAA CCTCGAGCGG
TACCGCGTTG CGATCGAACC GTACATGCTG CTCGTCCTGC TGGCGGTCTC GACCCTGTTC
GGGCTGATCG CGGGGCTGTC GGCGTCCGGG CGCTGGCGGA CCTGGCTGCT GTGGATCAAC
AGCGAGCCGT TCAATCAGAC CGACGCCCAG TTCGGCCGGG ACATCAGCTA CTACACCTTC
AGCTATCCGT TCCAGCGCTT CCTGCTTGCC TTCCTGCTCA CCGCGGTCAT GCTCTCGCTG
CTGGTCGCGG TGCTCACCCA CTACCTGTTC GGCGGAATCC GGCTGCAGTC CGCCGGCGAG
CGGGTGGCCC CGGCGGCGAA GGCGCACATC TCCGTGCTGC TCGGGCTGGT CGCGCTACTC
AAGGCGTGGG CCTACTACCT GGACCGCTTC GGCCTGGTGT TCTCGGCCCG GGGCGTGTCG
ACCGGAGCGT CCTACACCGA CGTCCACGCG GTGCTGCCGG CGAAGCTCAT CCTGCTGTTC
ATCTCGCTGG CCTGCGCCGT GCTGTTCATC TACAACATCT TCCAGCGGGG CTGGACGCTG
CCGTTGCTCG GTGCCGGCAT CCTCGTGCTG TCCTCGGTGG TCATCGGCGG GATCTACCCG
GCGATCGTCC AGCAGTTCCA GGTCCGGCCG AACGAGGCGT CGCGCGAGGA GCCCTACATC
GCGCGCAACA TAGCCGCGAC CAGGTCCGCG TACGACATCC AGGACGTCAA ACCGGTCCCG
TATCCGGCGA CCACGGGCGC GACGGCGCAG CAGATAGCCG ACGACAAGGG CACCGTCCCG
AACATCCGCC TGCTCGACCC GAGCAAGCTG TCCACCACGT TCCAGCAGCT CCAGCAGATC
CGTGGTTACT ACGGGTTCCC GCAGACCCTC GACGTCGACC GCTACACCAC GACCTCGGAC
GGGAAGACCA CGACCCGGGA CTATGTGGTG TCCGTCCGTG AGCTCAACCA GGACGGCCTC
GGTGAGGACC AGCGGAACTG GATCAACCAG CACCTGACCT ACACCCACGG GCGGGGCTTC
GTCGCGGCGC CGTCCAACAC CGCCGATGAG GGCCGCCCCG CGTTCACCGA GCGCAATCTG
CCGGAGACCG GTGACTTCGG CGTCACCGAG AACCGGATCT ACTTCGGTGA GATGTCCCCG
CAGTACTCGA TCGTCGGCAC CCGCCAAGCG GAGATCGACG GCCCGGGGCC GAACGACACC
CAGCTCACGA CCAGCTACAC CGGCGACGGC GGCGTCTCGG TCGGCTCGAC CTTCCGGCAG
GCGCTGTTCG CGCTGCGCTT CGGGGAGCCG AACATCCTGC TCTCCGGCGA CATCACCGGC
CAGTCGCGCA TCCTCTACGA GCGCAACCCG CGGGACCGGG TCAGCAAGGT GGCGCCCTGG
CTCACCCTGG ACGGCGACCC CTACCCGGCG GTGGTCGACG GGCGGGTCAC CTGGATCCTC
GACGGCTACA CCACCTCGGA CGGCTACCCC TACTCGGCCC GGCGGACCTT CGGTGACGTC
ACGGCCGACG CCGTGACCAC CCAGAGCCGC AACCGCACCC AGCAGCCGGA GAACCAGGTC
AACTACATCC GGAACTCGGT GAAGGCGACT GTCGACGCCT ACAACGGCAC GGTGACGCTC
TACGCCTGGG ACGAGCAGGA CCCGGTGCTG CGGACCTGGA TGAAGGCCTT CCCCGACACG
GTGCGCCCCA AGGCCGACAT CCCGCCGGCG CTGATGGAGC ACTTCCGCTA CCCGGAGGAC
ATGTTCAAGG TCCAGCGGGA CCTCCTCGGG CAGTACCACG TCTCCAACCC GCGGGACTTC
TACTCCCAGG AGGACTTCTG GACGGTCTCC GAGTCCCCGG ACGACACCCG CGAGCCGCAG
CCGCCGTTCT ACGTCTACAG CCAGCTCCCC GGGCGCAAGG AACCGTCCTA CAACCTGACC
TCGCCGCTGA TCTCGGCGCG GTCGTCCAAG CTCGCGGCCT ACATGGCCGT CTCGATGGAT
TCGGACAACT ACGGCCAGTT CACCCTGCTG CAGCTGCCGC CGGGCGACAC GATCAACGGT
CCTGTCCAGG TGCAGGCCGC CATCGAGTCG AACGGCGACG TCGTCCGGCA GCTCAACCTC
TGGCGCGGCG CGGGCTCACA GACGATCGAG GGCAACCTGC TGACGCTGCC CGTGGCCGGC
GGGCTGCTCT ACGTGGAGCC GTACTACGTC CAGGCCCGCG GATCGACGGG ATATCCGACC
CTGCAGGGGG TGGCGGCAGC CTTCGGTGAA CGGATCGGGT TCGGTTCCTC GCTCGCGGAG
GCGCTGAACG CCGTCTTCGG GGCGGGGGCC GGCGCGTCCG CGGCCGGGGC GGGCACCTCG
GCCGCGCCCT CCACGGGCAC CGAACCGGGG ACAACGTCCC CACCGTCGAG CCCCGCGCCG
TCGGCACCGC CGGCCGGGGA TCTGGCGGGC GCCGTCGCCG AAGCCGAACG CGCCTACCAG
GCGGGCCAGG ACGCACTCGG CAAGAACCCG CCGGACTTCG CCGCCTACGG CAGGGCCCAG
ACGGACCTCC AGGCGGCGTT GGACAGGCTG CGCCAGCTGT CACCGACGAC CCCGGCGCCG
ACGACGGCAC CTCCGGCGAC GGCACCGACG AAACCGCCGG CGACGACGGC GCCCGCCTCC
GCGCCGCCGG CCACCGCCGA GCCCGCCTCG ACGACCCGCG CGGCGCCAGC CGCCGCGCCC
GCCGCCATGG ACGGGGCGGG GGATCCGGGC CCACCGGTGC GGCCGGCCGT CGCCGAGAAC
CCACCGGCGG TCCGCCCGGC CGTCGCGCGG ACCGGGGATC CCGGCTGA
 
Protein sequence
MAGTSRRPLL TRTRLLVPVL VILVLLVVFL GVFTRLYTDL LFYRSVDFSN VFGTVVFTRI 
LLFVLFGAVM AIAVGTNIVL AYKLRPPIRP LSTEQQNLER YRVAIEPYML LVLLAVSTLF
GLIAGLSASG RWRTWLLWIN SEPFNQTDAQ FGRDISYYTF SYPFQRFLLA FLLTAVMLSL
LVAVLTHYLF GGIRLQSAGE RVAPAAKAHI SVLLGLVALL KAWAYYLDRF GLVFSARGVS
TGASYTDVHA VLPAKLILLF ISLACAVLFI YNIFQRGWTL PLLGAGILVL SSVVIGGIYP
AIVQQFQVRP NEASREEPYI ARNIAATRSA YDIQDVKPVP YPATTGATAQ QIADDKGTVP
NIRLLDPSKL STTFQQLQQI RGYYGFPQTL DVDRYTTTSD GKTTTRDYVV SVRELNQDGL
GEDQRNWINQ HLTYTHGRGF VAAPSNTADE GRPAFTERNL PETGDFGVTE NRIYFGEMSP
QYSIVGTRQA EIDGPGPNDT QLTTSYTGDG GVSVGSTFRQ ALFALRFGEP NILLSGDITG
QSRILYERNP RDRVSKVAPW LTLDGDPYPA VVDGRVTWIL DGYTTSDGYP YSARRTFGDV
TADAVTTQSR NRTQQPENQV NYIRNSVKAT VDAYNGTVTL YAWDEQDPVL RTWMKAFPDT
VRPKADIPPA LMEHFRYPED MFKVQRDLLG QYHVSNPRDF YSQEDFWTVS ESPDDTREPQ
PPFYVYSQLP GRKEPSYNLT SPLISARSSK LAAYMAVSMD SDNYGQFTLL QLPPGDTING
PVQVQAAIES NGDVVRQLNL WRGAGSQTIE GNLLTLPVAG GLLYVEPYYV QARGSTGYPT
LQGVAAAFGE RIGFGSSLAE ALNAVFGAGA GASAAGAGTS AAPSTGTEPG TTSPPSSPAP
SAPPAGDLAG AVAEAERAYQ AGQDALGKNP PDFAAYGRAQ TDLQAALDRL RQLSPTTPAP
TTAPPATAPT KPPATTAPAS APPATAEPAS TTRAAPAAAP AAMDGAGDPG PPVRPAVAEN
PPAVRPAVAR TGDPG