Gene Franean1_7125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7125 
Symbol 
ID5675801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8696761 
End bp8700966 
Gene Length4206 bp 
Protein Length1401 aa 
Translation table11 
GC content72% 
IMG OID641245963 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001511352 
Protein GI158318844 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG AGCGTTCGGA GCGCGGCGGA CCGCACGGTT CCGGTGAGGA TTCGCCGGGC 
GCCGCGGCCC GGCCGGACCA GCGGACCTCT CGGACTGTGG CCGGGAACGC GGCGGAGATG
CGTACGCCGC CGCTGGTCGA GTCCCAGGCG GCGCCCACCC AGATCATCGA CGGATACCCG
TCGAAGCCAC CGACCAGCGT CCCGCCGGCC GGTGCGTACT CGACGGCCTG CCCGTACCCG
GGGCTGCGCG GCTTCGACGA GTCGAGTGAG CAGTGGTTCT TCGGGCGTGA GCGGATGGTC
GCCGACCTGG TGGCGAGGGT CTCCTCGGAG GCGTCGAGGG TGGGGCCGCT GGTGCTCGTC
GGCGCCTCGG GGTCGGGGAA GTCGTCACTG CTGCGTGCCG GGCTCCTGCC CGCCCTCGCG
CGCGGAGCGC TTCCCGGCTC GCGTGGCTGG CCACGGCTGG TGATGACACC GGGCGAGCAC
CCGCTCGAGG CGCTCGTCCA GCGTGTTGTC GAGGCGACCG GGATGCCGAC GATGGCGCGG
ATGCTGGGTG ACAGCCTGCG CCGGGAGCCG GAGCGGCTCA GCGAGATCGT CCGTGAGCTG
CTCGCCGCGC GCGCCACGCC TCGGTCTGCC CCAGAGGGGG GCGAGACCAC GATCCGCCGC
GGCCCACGAA ACGACGCCAC CACGCTCTAC AGGCCAGGCA ACCCCCACGA GCCAGCCGAT
CCCCGCGAGC CGCGTGAATC CGCCGCGCCG CACGAACCCC GCGAGCTCTC CGCGTCACGC
GCGCCCCTCA AACCCGCCGT GCCCCACGAA CCCCCCACAC CGCGCAAAGC CCACGAACCA
CGCAAAGCTC ACGAACCACA CGCTCCGCAC GAGCCGCGGA TGGTGATCGC GGTCGACCAG
TTCGAGGAGG TTTTCACGCT CTGCGCGGAC CAGGCCGAGC GGGAGGCTTT CGTCCGGGCC
CTGTGCGCGA CCGCGGCGGG CAGTGCGGTC GTCGTGATCG GCCTGCGGGC GGACTTCTAC
GGGGCGTGCG CCTCCTTCCC CGAGCTCGTC GAGGTCCTGC AGGTCAATCA GGTGGTGGTC
GGGCCTATGG CGGCCGCCGA CATCCGCGAG ATCGTGGTGA ATCCGGCTCG TGCCGCGGGT
GGGGACGTTG AGCCCGGGCT CGTCGAGCTC GTCCTGCGCG ATCTCGGCGC CGCCGCTGGC
ACGGGGATCA CCGGAAGCGA CCAGGGCACC GGCCACGGCA GCAGTCGCAG CCCCGAGCTC
GTCTCCAACC TCAGCTCCGG GTTCGTGGCT GATCCGGGCT CGCTGCCACT GCTCGCCCAC
GCGCTGCGCG CGACCTGGTT CGCCCGTGCG GGCGAGGCGC TCACCGTCGC CGATTATCTG
CGCGTCGGCG GGCTGACCGG GGCCATCGCC CAGACCGCCG AGGCGGCCTA CACGAGTCTG
GACGGGGCCG CCCAGCAGGC CGTCCGACCA CTACTGATGC GCATGATCCG GCTTGGTGAG
AACGGCGCGG ACACCCGGCG GCGGGTACGG CGGGCGGCGC TGCTCGCGGA GGTGCCGGGG
CCGGAGTCGG CGACCGTCCT GGATGCGCTG GTAGCCGCGC GTCTCGTCGA GGCCGATGCC
GATGGGGATC AGGACGGTCT GCAGATCGCG CACGAGGCGT TGCTGCGTTC CTGGCCACGG
CTGCGCGAGT GGATGGACAT GGACCGCGCC GGCGCCGTGG CGTTGCAGCA GCTGCGCGAC
GCCGCTGAGG TGTGGGAGCG GGGCGGGCGG GATCCGTCCT ACCTTTTCGC CGGTTCGCGT
CTCGCCGCGG CCCGCGACTG GATGGACGAC GACCTGGACG TCGATGCGAC GACCAGGCAG
TTCTTCGACG AGAGTGTCCG CGCAGAGGCC GAGCAGCAGC GGGCGGCGGC GCGCCGGACC
CGTAGGCTGC GCCAGCTCGT GGCGGCGCTG GCCGTCCTGC TGCTCGTCGC CGCGTCGCTG
GCGGGCCTGA CGTTCCAGCA GAGCGCCTCG TCGGGGCGGG CTCGGGACCA GGCGCTCTCG
CAGCGGATCG CGACGCAGGC GGAGGCTGCC AGGCGTAACA ATCCGGCGCT GGCCGCCCAG
CTCAGCCTGG TCGCGCTGCG CACGGCGGAC ACGCCGGAGG CACGCGGGGC CGTCCTGTCG
TCGTTCAACG GCGGTAGCGG GGTGCCGACT CGGTACCAGG CGCATACCAA GTCGGTCGGG
ACGGTCGCCT ACAGCAGGGA CGGCCGCCTG CTCGCGACGG GCAGCGACGA CTGGACAGCC
GCGGTCTGGG ACGCGGCCGA TCCGCGGCGG CTCACGCCAC TCGCACGCAT CCCGGACGAA
CGCGGTGGCG GGCATGGCAG GGCGGTGAAG GCGGTGGCCT TCAGCCGGGA CGGGACGGTG
CTGGCGACCG GCGGCGCCGA CGGCCTGGCC AAGCTGTGGA ACATCACCGA CCGGGCCCGA
CCGCGCCTGC TGGCGACCCT GCCGAAGGCG GATTCGGAGG TCTACGGCCT GGCGTTCGAT
CCGACGTCTG ATCGTCTCGC CGTCGGTGGT TACGGCAAGT CGGCGTACAT CTACGACGTC
TCCGATCCGG CGCGCCCGGA GAAGAAGGGG CAGCTCTTCC TGCACCTGGC GCAGGTGGTG
GCGCTGGAGT TCAGCCCGGA CGGCGCGTTC CTGATCGCCG GGGACGAGGG CGGTTCCGCC
CTGTTGTGGT CGGTCAGTGA TCCGAGCAAC CCGAGGCCGC TCAAGGTTCT CGTCGAGGAC
GGCGGCCCGA GCTCGGACGG CGCGGGCGCG ATCCGGTCGA TCACCTTCGG CGGCGACGGC
CACACGGTCT ACACCGCCGG GGACGGCGGC TACGTCCGCA AGTTCTCCGG GCCGGACCTG
CCGCGTCTGG AGTACGACGG CCGGGCGGGG ACCGGGGACG CGCCGATGAC CGGCCTCGCC
GTGGACCCGG TCAGCGGCCT GGTCGGCGTG GGCGGGTTCC GGTACGTCGG AGTTCCGATT
TTCGATGTGG ACGTCGACCA GTACAGCCTG ACCTTCCTCG ACGAGGGCGC CACGGTCTGG
GATGTGGCCT TCAGCCCGGA CGGCCGCCGG CTCGCGTCGG TCTCGGTGGA CGGCTCGCTG
CGGGTCTGGG AGATGCCCGG CCCGGCGCTG ATCGGGCGCA ACGGCGCGCA GGAGGACGCC
GTCGTCAACC CGGTCACCGG CATCGTAGCG ATCACCACCG ACAAGGCGGT CGAGCTGTGG
GACGTCGATG ATCCTTACGC GCCCCGCCGA CTGCACGTGC TCACCGACGT GATCGTGGAC
GAGTACGACC CGACGGGCTC GTCGGCCTTC AGCCCGGACG GGAACGTGCT CGCGGTGGGC
ACGGGCAAGA ACATCGTCTT CTACGACGTC CGCGACCCTG CGAAGCCGTC CCGGATCTCG
GACGTGCCGG GGCCGGCCGG GGGCACCGCG GAGCTGTTGT TCAGCCCGGA CGGGAGGACC
CTGGCGCTCG GCGGCCTGAA CTCCCCGCCG GAGCCGGCAT TCCAGGCCAG GGTCGAGACC
TGGGACGTCA CCGATCTCTC CCGGCCGCGG CGGCTCGCGT CGCTGATCGC GCACCGCTCA
TCCGTCCGTG ACCTGACCTT CTCACCGGAC GGCCGGACCC TGGTGAGCGC GGCCGAACGA
TCGGTGAAGC TCTGGGATGT GACGGACCCG CGACGGCTCC GGCTGGTCTC GGAGCTGCCC
GAGTTCCCCG GCGGGGTCTG GGAGGTGCGC TTCTCCCCCG ATGGCAGGAC GCTCGCGGCC
GGCGGGGCCA ACCCGTTCGC GACGCTGTGG GACGTGACCC GCATGGACGC GCCGCGCCAG
ATCGCGGACC TGCCGGGCCA TTCGGCGTCC GTGACCAGCG TCGCGTTCAG CCCGGACGGC
ACGCAGCTCG CCACTGGCAG CAACGACAAC ACCGTACGGA TATGGGACGT TACGGAACAC
GACTCCCCGA CGCTCATCGA GAAGCTCGCA CGCTCGGCGG GCAGCGAGGC CGGTATCGAG
GAGATCCTCT ATACCCGGGA CGGCGAGAAG CTGGTCGGCG TGATCTTCAC CGTGCCCGCG
GTGGTGTGGG ACCTCGATGT CAACCGGGTG CGGGCCCGGA TCTGCGAGCG GGCGGGCGTG
GGCATCACGG CCGCGGAGTG GCGCCGTTTC CTACCGGATC TGCCCTACGA TCCGGTCTGT
GACTGA
 
Protein sequence
MTSERSERGG PHGSGEDSPG AAARPDQRTS RTVAGNAAEM RTPPLVESQA APTQIIDGYP 
SKPPTSVPPA GAYSTACPYP GLRGFDESSE QWFFGRERMV ADLVARVSSE ASRVGPLVLV
GASGSGKSSL LRAGLLPALA RGALPGSRGW PRLVMTPGEH PLEALVQRVV EATGMPTMAR
MLGDSLRREP ERLSEIVREL LAARATPRSA PEGGETTIRR GPRNDATTLY RPGNPHEPAD
PREPRESAAP HEPRELSASR APLKPAVPHE PPTPRKAHEP RKAHEPHAPH EPRMVIAVDQ
FEEVFTLCAD QAEREAFVRA LCATAAGSAV VVIGLRADFY GACASFPELV EVLQVNQVVV
GPMAAADIRE IVVNPARAAG GDVEPGLVEL VLRDLGAAAG TGITGSDQGT GHGSSRSPEL
VSNLSSGFVA DPGSLPLLAH ALRATWFARA GEALTVADYL RVGGLTGAIA QTAEAAYTSL
DGAAQQAVRP LLMRMIRLGE NGADTRRRVR RAALLAEVPG PESATVLDAL VAARLVEADA
DGDQDGLQIA HEALLRSWPR LREWMDMDRA GAVALQQLRD AAEVWERGGR DPSYLFAGSR
LAAARDWMDD DLDVDATTRQ FFDESVRAEA EQQRAAARRT RRLRQLVAAL AVLLLVAASL
AGLTFQQSAS SGRARDQALS QRIATQAEAA RRNNPALAAQ LSLVALRTAD TPEARGAVLS
SFNGGSGVPT RYQAHTKSVG TVAYSRDGRL LATGSDDWTA AVWDAADPRR LTPLARIPDE
RGGGHGRAVK AVAFSRDGTV LATGGADGLA KLWNITDRAR PRLLATLPKA DSEVYGLAFD
PTSDRLAVGG YGKSAYIYDV SDPARPEKKG QLFLHLAQVV ALEFSPDGAF LIAGDEGGSA
LLWSVSDPSN PRPLKVLVED GGPSSDGAGA IRSITFGGDG HTVYTAGDGG YVRKFSGPDL
PRLEYDGRAG TGDAPMTGLA VDPVSGLVGV GGFRYVGVPI FDVDVDQYSL TFLDEGATVW
DVAFSPDGRR LASVSVDGSL RVWEMPGPAL IGRNGAQEDA VVNPVTGIVA ITTDKAVELW
DVDDPYAPRR LHVLTDVIVD EYDPTGSSAF SPDGNVLAVG TGKNIVFYDV RDPAKPSRIS
DVPGPAGGTA ELLFSPDGRT LALGGLNSPP EPAFQARVET WDVTDLSRPR RLASLIAHRS
SVRDLTFSPD GRTLVSAAER SVKLWDVTDP RRLRLVSELP EFPGGVWEVR FSPDGRTLAA
GGANPFATLW DVTRMDAPRQ IADLPGHSAS VTSVAFSPDG TQLATGSNDN TVRIWDVTEH
DSPTLIEKLA RSAGSEAGIE EILYTRDGEK LVGVIFTVPA VVWDLDVNRV RARICERAGV
GITAAEWRRF LPDLPYDPVC D