Gene Franean1_4252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4252 
Symbol 
ID5672607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5068674 
End bp5072948 
Gene Length4275 bp 
Protein Length1424 aa 
Translation table11 
GC content70% 
IMG OID641243125 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001508542 
Protein GI158316034 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.871737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGGA TTTTTGTCAG CTTTCACGCG GCCGAGGGCG GGGCGCTGGC TGACCGGGTA 
GTGCGTGCGC TCGAAGCGGC CGGCTACACC GACTACTACA ACTATCGGGT GCCGGGGCAC
CGGACTGCTC CGGGGGCGGC ATGGGCGCCG GACATCAGGA GAAACCTTCT GGTGAGCGAG
GCGTTCATCG CCATCACGAC CCCGAAAGCG CTTGACGAGT GGTGTTCAAC CGAGGTCGCG
ATCTTCCGTG AACGCAAGCC GCGTGCCCCG TTCCTCGAGA TCGTCGTGGG CGAGGCCTCC
CGACGTGCGC TGACCGGAGC CCTGCAGGGA ATCCGGGTCC GCCCGGACAA CGAGGAGTCC
GTCGCGGCCG CGTTAGCCAC GGTGATCGCG TTCCTGCAGG CGCGCGGGGT CGGGACCGGG
CCCGGTCGAA CCTCGCCCTA CCCCGGCCTT GCGGCATTCG GGGAGCGGGA CGCCGCGGTC
TTCTTCGGCC GGGATGACGA CATCGGCAGG CTGGCGGCCG CCATCGAGCC GCGTGCGCAC
GGCGCCATGG CGGTCGTCGG CCCTTCCGGG GTTGGAAAGT CGTCGCTGGT TCTGGCCGGC
CTGGTGCCTC GGCTGCGTGG TATGCCCTCG TCCGCCGCAC GGACAGCGCG CTGGCATGTC
CTCGGCCCGG TCACGCCGAG CACGGGCGCG GTTCGCGGGC TCGCCGAGGC GCTGTCGATC
CAGCGCGCCG AGGTCGGCCT GCCCCCGGTC CCCGCGGCGA CACTCGAGGC GCGCATCCGC
GCCGACCGCG TTGCTCTGGC CGCCATGCTG GACGAACTCA TAGCCGCGGC GTCCGCCCGA
CCGCCCGGCG GACAGATACG TGTGCTCATT GTCCTCGACC AGGCCGAGGA GCTCGTGTTG
GCGGCATCCG AGCCGGTCCG GGGGCGGCCC GAGAAAAGGC CGGATGAGCG GTGGAACGAG
TCGTCCGGCG GACGGTCCGA CGACCTGTCA GTCCAGCAGG CGGAGGCCGT GGTGCTGACC
GAGGCGCTCG TCGAGGCTTC GTACCGCAAC GCCTGGCTGG TATACACGGT GCGGTCCGAC
TATCTCGACG ACCTGCTTCG CGGGACGACG TTTCGTGCGC TGATGCAGCA GGCCCATCTG
GTCCGTCCTC TGGATCGGCG CGAGTTGCTC GCTGTGGTCA GTGAGCCCGT CCATCAGCTG
GGCTGGTCGT ATGAGACGGA TGCGCTGAGC GCGATCGTCG AGGACGGTGC CCGCGGCTCG
CTACCAATGC TGGCGTACGC CTTGGCGCGA CTGTGGGACC GGGTCAACCC GGGCGGGGAC
CGGGAGCCGC GCGCGATCCG CCGGGGCGAG TACGACCGCG CCGGCCGGGT GCAGGACGTG
CTGCGCGAAC AGGCCGAAGC GGCGCTGGCG GAGGCGCTCG AGCAGATCGG CCGGGCCGTC
CTCGGCCGTT CAGGCGCGAC GGTCGACCGG CGCGCCGCAG AGCATCGCAT GCTCGACCTC
CTCAGCAGGC TGGCGTCGGT CGGCCCGGAC CGGGGCTTTA CCCGGCGGCC GTTGATGATC
ACGGAAATTG GCGACGAGGA GCTTGGACTG CTAAGGCCGT TCGTCGAGCG CCGAGTGCTG
ACCACCACCC GGTCGCTGTT TGTCGCCCAG GGCACGGAAC AACGGGCTGG GGAGCTCGCG
GAACGGGTCC TCGGCATCGC GGACGTCGGC GACAACGCTC TGGAGGTCGC ACACGAGAGC
CTGTTCGGCG ACTGGCCGCG GCTGCGTGCC CACCTCGGTC GGCAGCGTGA GGCCCTGCGG
GCCCGCGGTG AGCTGGAGGA ACTCGCGGCC GCCTGGCTGG CGGGTGGTAA GCGGGACGAG
GACCTCATAG GCCAGTCCCG TGTCCAGTTC CTGCTCGCTG ACCTCCTGCC GTCCCGGGAC
GTGCGGGGCG ATCAGACGGT GACGGCGTGG GGATGGCCGA ACCTGGCGGA CGCACTGGTC
ACGGCTGGCA TCTCCGCTGA CGCCGTCCAG CTGGCTGGGC TGTCAATCCA GCGGGTTCTG
GGCCGGCTCG TCCAGGAGGC GGTGCTGCTG GAGCGGCCGG AGGACGGCCT GCGTCTTCTA
GTCGGCGCGG GAACCAACTC GGACCCCGAG ACACTGGCTG TTCAGCTGTC CGCGCCGAGC
CTCGAGGGCT GGCGGGCAGC GGTGCACCGA GCCGAGGCTC AGCTGCGGAG CACCAGGCTG
CTGACAGACC GTTCCGTCGG CCGTTCCGCG GGTATGTGGG GAGTCGCCTG GTCGCCGGAC
GGATCGCGGG TGGCCACGGG CTCACGCGAT GGCGTCATCC GGATCTGGGA TCTGAGCGCC
GGCGACCTCG TCAACGCGTT CGCGCACGGC GAGGACCAGA TTGAGCAGGT CGATGGATGG
GTGCGTTCGG TTGCCTGGTC ACCGTCCGCG GATCTGGTGG CATCCGCCTC GACCGACGAG
ACGACACGAG TGTGGTCACT TGACACGGGG CGAGAGGTCC GGTCGTTCCG CCTGCCGGAC
CGTCCCTGGG CAGTCCGGTT CAGCCCGGCC GGCGACCAGG TGCTGACCGC CTGTGCCTCG
GGCCATGCGG TGCTCTGGTC CACCGAGCGG CGCAGTCGTC GCCCGGACTA CGAGCTTCTC
AGCAAGCCAT CCGAGGCGGC CACGGCGGAG GCCGGTCCGA CCGAGGGGGG CCAGTCTGAT
GGAGGCTCTC GGGGTGGGGG CCCTCGGAGC CGGTCGGACC CGGAACGCGG CATCCGAATC
TGGGACGCCG ACATCGTGAC CGGGGACGTC ACGCGGATCG CGACGGCCTG CGAGGACGGG
GTGGTCGATC TGTGGACCCT GCCCTCGGAT CGGGACGCCG ATGTCCCGGC GCCCCGCCGC
ATCGCGGCGC ATCCCGGTGT AACCGTTCGG TCTGTGCGCT TCAGCCCGGG CGGCGGGCGG
ATTGCGACCG CGGGGCAGGA CAACATCATC AAGGTTTTCG AGATCGAAGG CGGAGTCGAG
CTGCGCCGGA TGTCCGGCCA CACGGACCAG GTCCGCAGGG TCGCGTGGTC ACCGCTCGGC
AATCGGGTTG CCAGCGCGTC GGCCGATACC TCCATCGGAG TGTGGGACGT CCATTCCGGC
CGGAGGGTCC TCACTCTGCG CGGTCACCGG CAGGGCGTCT GCGACGTGGC CTGGTCCGCC
GCCGGCGACC AGCTGATCTC CGTCTCGGAC GACGGGACCG CTCGGGTGTG GAAAATCGGC
GCGGTCGAGA CGGCGTTGGT CCTGGTCGGG GCGCCGGCCA GTGCGATGGA CCGGTCACGG
GTCACGGGTG AAATCGCGGT GGCCGTTGCC GCGGGCCCGA CCAGCGCTAC CGGCAACAGT
TTTGATGTCG TCGTGCTGCG GCCGGACGGC ACGCGGACGC GGACGCTCTC CGCCGCGCAC
CGGAACACGA TTCGGTCTGT GTCGTGGTCT CCGTCAGGCA CCCGGCTGCT GACCGCGTCA
CGGGACAACA CCGCCCGGAT CTGGCGCGTC GACAACGTGG ACTACGCGCT GGAAAACACG
CTGTACGCCC GCGAGGGCGT GGAGGACGCC CGGTGGTCGC CGGACGGCGT CGAACTGGTC
ACCGCTGCCC GCGACCGGGT GGTCCGTAGA TACCGTGCCG ACGGTGAGTG GCTCGACCGG
GACGAGCGGC GCCCGCACCC GAACTTCCTG CGAGCGGTGG CATGGCACCC GAGCCGTCCC
GAGTTCGCAT TGGCGGCGGA GGACAACCGG CTCACCATCA ACACCGCGGA GGCGACCGTG
GACGAGCATC GCACTGACAA GGTGCTCACC TCCGTTGCCT GGAGCCCGAC CGGGGAGAAG
CTCGCCCTTG GCGCGACAGA CGGCACGGTG CTGGTGCTCG GCGCCGCGGA CGGAAGGATA
GCCGGGGAGC TGCACCGACT CGTCGGTCAT GAAGGTGAGA TCAGTACGGT CGACTGGTCC
ACGGACGGCG AGCGCATCGT CACCTCGTCC GCGGACCACA CGGCGTCGGT ATGGGACTCA
CACACCGGAG CACGGTTGAC CTCGCTGGTC GGGCATACTG GCCCGGTTGT CGGGGCGGTG
TGGGCCGTCG ATGACGCCAG CACCACGAAC ATGGTCGTCA CCACCGCGTC CGCCGACGGG
ACGATTCGGA CCTGGGACGT CTCCGATATC AGGCGGACGC CGCTCAGCGG CCTGCCGGAC
ACGGAGCCCG GCGAGCCCTC GGCCCACGCT ACGGACCAGC TGATACGCGA CGCGCGCGTG
CGCCTCGGTC GATAG
 
Protein sequence
MGRIFVSFHA AEGGALADRV VRALEAAGYT DYYNYRVPGH RTAPGAAWAP DIRRNLLVSE 
AFIAITTPKA LDEWCSTEVA IFRERKPRAP FLEIVVGEAS RRALTGALQG IRVRPDNEES
VAAALATVIA FLQARGVGTG PGRTSPYPGL AAFGERDAAV FFGRDDDIGR LAAAIEPRAH
GAMAVVGPSG VGKSSLVLAG LVPRLRGMPS SAARTARWHV LGPVTPSTGA VRGLAEALSI
QRAEVGLPPV PAATLEARIR ADRVALAAML DELIAAASAR PPGGQIRVLI VLDQAEELVL
AASEPVRGRP EKRPDERWNE SSGGRSDDLS VQQAEAVVLT EALVEASYRN AWLVYTVRSD
YLDDLLRGTT FRALMQQAHL VRPLDRRELL AVVSEPVHQL GWSYETDALS AIVEDGARGS
LPMLAYALAR LWDRVNPGGD REPRAIRRGE YDRAGRVQDV LREQAEAALA EALEQIGRAV
LGRSGATVDR RAAEHRMLDL LSRLASVGPD RGFTRRPLMI TEIGDEELGL LRPFVERRVL
TTTRSLFVAQ GTEQRAGELA ERVLGIADVG DNALEVAHES LFGDWPRLRA HLGRQREALR
ARGELEELAA AWLAGGKRDE DLIGQSRVQF LLADLLPSRD VRGDQTVTAW GWPNLADALV
TAGISADAVQ LAGLSIQRVL GRLVQEAVLL ERPEDGLRLL VGAGTNSDPE TLAVQLSAPS
LEGWRAAVHR AEAQLRSTRL LTDRSVGRSA GMWGVAWSPD GSRVATGSRD GVIRIWDLSA
GDLVNAFAHG EDQIEQVDGW VRSVAWSPSA DLVASASTDE TTRVWSLDTG REVRSFRLPD
RPWAVRFSPA GDQVLTACAS GHAVLWSTER RSRRPDYELL SKPSEAATAE AGPTEGGQSD
GGSRGGGPRS RSDPERGIRI WDADIVTGDV TRIATACEDG VVDLWTLPSD RDADVPAPRR
IAAHPGVTVR SVRFSPGGGR IATAGQDNII KVFEIEGGVE LRRMSGHTDQ VRRVAWSPLG
NRVASASADT SIGVWDVHSG RRVLTLRGHR QGVCDVAWSA AGDQLISVSD DGTARVWKIG
AVETALVLVG APASAMDRSR VTGEIAVAVA AGPTSATGNS FDVVVLRPDG TRTRTLSAAH
RNTIRSVSWS PSGTRLLTAS RDNTARIWRV DNVDYALENT LYAREGVEDA RWSPDGVELV
TAARDRVVRR YRADGEWLDR DERRPHPNFL RAVAWHPSRP EFALAAEDNR LTINTAEATV
DEHRTDKVLT SVAWSPTGEK LALGATDGTV LVLGAADGRI AGELHRLVGH EGEISTVDWS
TDGERIVTSS ADHTASVWDS HTGARLTSLV GHTGPVVGAV WAVDDASTTN MVVTTASADG
TIRTWDVSDI RRTPLSGLPD TEPGEPSAHA TDQLIRDARV RLGR