Gene Franean1_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2751 
Symbol 
ID5671142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3255833 
End bp3259900 
Gene Length4068 bp 
Protein Length1355 aa 
Translation table11 
GC content70% 
IMG OID641241663 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001507083 
Protein GI158314575 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.865283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGG GGTACGGCCA ACCGACTACG GACCGGACCG GTCACACGAC CGATTTTTTC 
GTGTCGTACG CGGCGGCGGA TCAGGAGTGG GCGGAGTGGA TCGGCTGGCA GCTGGAGGCC
GCCGGCTACC GGATCAGACT GCGGGCCTGG GATTTCACCA GCGGTTCGAA CATCGTGACC
GAGACGCAGC GGGTGCTCGC AACCTCGGCG AAGATGATCG CGGTGATGTC GTCGGCGTAT
CTGGTCTCGG CGATGGAGAG CGCGCAGTGG CAGGCTGTCT GGGTTGACGA CCCGACCGGA
GCGAAGCGCC GACTGGTGGT CGTGCAGGTG CAGGACTGTC CGCAGCCAGG GCTGCTGCGT
CCGCTGGTCG GGGTGCCCCT GTTCGGCTTG GACGAGGACA CCGCCCGCGA GCGGCTGCTC
GGCGCGGCTG CGGCTTCCCG GCAGAAGCCG ACCGGTCCGC CGCCGTTTCC CGCCAGCGCA
GCGCCTGATT TCCCCGACCC GTATGTCACC GGCGGCGGTG AGCCGCGGCT GTCGCCGTTT
CCGGGACTGG CGGCGTTCGA CACGAATCGC GCTGCGGTGT TCCGGGGTCG GGAAGCGGCG
ACCCGTCATC TGGTGGACCG AACGCTCGCC CAGGCTGAGA CAGGCGGCCT GATCGTGGTG
GTCGGCCCGT CGGGGTGTGG TAAGTCGTCG CTGGTTGCTG CCGGCTTGGC CCCGCGCATG
GCGGGCGAGG CGGACTGGTT GGTGACGGCG CCGATGACCC CTGGCGACCA GCCGATACGG
GCGTTGGCGG TGGTGCTGGC GGACGCTGGT CGACGCACCG GTCTGGACTG GGACGCCGAG
ACGCTGACGC GCAGGCTGGC CGACCCCGCC GACGTCGGCG GGGTCGCCGC CGAGCTCCTC
ACCGCTGCCC GGCCGGCACG CTGGCTACTC CTGCTCATCG ACCAGGTCGA GGAGCTACTC
GTGCGTGCCA GCCCCGCTGA CCGGGACCGG TTTCTGATCC TGCTGGCCGC TGCGGACCGT
GCCCGGGTGC GAGTGGTGGC GACGCTGCGC TCGGAGTACC TCGACGCCCT CCTCGACGCG
ACCGCCCCGA TCGGGTTGTC CGTTCCTGCC GAGACGCTCC AGCCGCTGTC GCGGGACCTG
CTTCCGCTGG TGATCGCCGA ACCGGCACGG ATGTCCGGGC TGCACATCGA GGATGAACTC
GTCGCCCATA TGGTCGCCGA CACCGGCGAT GGCCTCGCCC TGCCTCTGCT CGCCTACACC
CTCCAACGCC TGCACCTTGC TGCCCAGGCT GTGGCGACCC ACGTGCTTTC CGCGGCGCTG
TACGAGCAGA TCGGCGGAGT TCAGCAAGCG CTCGTCGAGC ACGCCGAGGC CGCGTTGGCC
GCCGCGGCCG AGGCCACTGG CCGCACTCGA CAGCAGGTCT TGGCCGGGCT GCTGGGGCTC
GTCACCGTCG ACACCAGCGG GCGGCCTACG CGCAGGCGCG TCCCGCTCGG TCAGCTGTCC
GACGCCACGC GGGCCGCACT GGTCCCGTTC GTCACCGGCC GGCTGCTCGT TCTCGATGCC
ACCCCTGACG GGCCGGTCAC TGTCGAGGTC GCCCACGAGC GGCTCCTCGC CGCCTGGCCG
CCGCTCGCTC AGGCCATCGC CGACGACGCC GAACGCCTAC GCCAGCGCGG CCAGGTCGAG
ACCGCCGCCC GAGACTGGCA GCAGGCTGGC CGGCGACCGG CGCTGCTGTG GAGCTACAGC
AGAGCAACAT CCGTCCTCGA CGTCCTCATC CACGACGACC TCACCCTGGC CGGCCGGCAG
TTCCTGACCA TCAGCCGGCG TCAAGCCCGC CGCCGCTTCT TCACCGGGTT CACGCTCTTG
ACCGTCCTGG CTATGACGGC CACGGGCCTC GGGATCGCCG CCTACCTACA GAGGCAGACC
GCTGACGACC GTCGACGCAC CGCCGTCGCC GAGCGGCTTC TCACCCTCGC CGACAACAGG
CGCGACACCG ATCCCACTAC CGCCATGCGC CTCGCCGTCG CCGCCCACGC CATCGTGCCC
ACCCACGAGC AGGCCGCCCG CCGCAACCTC CTCCAGACCC TCATCGGCAC CCCCTACCTG
CGTACCACCC ACACGAGCGA CACCGGCCCC ATCGCAGCGT TGGCGGTCGG GCCAGGCGGG
TTGCTGGCCT CCGGCAGCGC CGACGGGACG ATGCGGCTGT GGGACACCAG CGACCCAGAC
TCGATTCGCC TCCTCGGTGC GCCGCCGCTC ACCGACACAG CCGGCACGCT CTCGCTAGCG
TCCGGTCCGG GCGGGTTGCT GGCCATTGGC GCCGGCGACG CGGTGCGGCT GTGGGATGCG
AGAACTCCGA CCGCGCGCCG CCTCCTCGGC ATCCCGCTCA CTGATTCCGA CGGGAGCGTG
GCGCTCGGTC CAGAGGGGTT GCTGGCTACC GGCGGCAGCG ACGGGGCGGT GCGGCTATGG
GATATCCACA ACCCGAACTC GGCCCGCCTC CTCGATACCC TCAGCCCTGC CGCCAGCGAC
GCCGCGCTCG AAGAGGGCGG CGGGCGCCTC GACGTGGAAC TGGCGTTCGG TCCGAGCGGA
CTGCTGGCCG CCAGCTTCGG CAGCGGGCAG GTGAGGCTTT GGGACATCAG TGACTCGTCC
TCGCCCCGCT CCCTCGGTAC GGCAGAACTG GTCGGCCCGG TCTACGCGGT GGCCTTCGGT
CCAGACGGTC TGCTGGCTAC CGGCGGCGGC GAGGGGACCG TGAGGCTCTG GGACACCAGC
GACCCGTCCT CGCCCCGCCT TCTCAACACA CTGCTCACGG GCGCGGTCAG CCTGGTCTCC
GCGGTGGCGT TCGGTCCAGA CGGACTGCTG GCCAGTGGCG ACAGCGACGG AGCCCTGCGG
CTGTGGGATA CCAAGCATCC GACCTCGCCT CGCCTTCTCG ACACCACGCT CACCGGCACG
GCCCGCGCAG TCACGATGAC CTACGGACCG GATGGCCTGC TGGCCACCGG CAGCGGTGAT
GGAATGGTCC AGCTGTGGAA CACCAGCCGA CATGGCCCGC CCCGCCTGCT TGGTACCCTC
CCCTCCGGCC TCACCGGCCG GTTCGCTGTC GACGCGGTGG CCTTCGGTCC AGACGGTCTG
CTGGCCACCA GCAGCTATGA CACGGGGGTC CGGCTGTGGG ACATCAGCGA CCCGACCTCG
CCCCGCCTCC TCAGCGCCCC GCGTACCGGC AGCGCCGACT CAATAGGCGA GGACAACGAG
GTTGCCTTCG GCCCAGGCGG TTTGTTGGCC AGCGTCTCCA GGTACGACGG GACGGTACGG
CTGTGGGATA CGAAAACTCC GGCCTCACCC CGCCTCCTCA GTACCTCGCG CGCCTCCACG
GAGCCGGGCT ACTTGTTCGG CGGCAGTTCT GACCTCGTAG CGTTCGGCCC AGACGGTTTG
CTTGCCAGCG GCGGCGGTAG TGACGGGGCG GTGCGGTTGT GGGACATTAG CGACCCGACC
TCGCCCCGCC TGCTCGGCAT CCCGCTCACC AGCCCCGTCA CCCCGGTCGA CCTGCGATCC
GGACCATCCA ACCCGGTCGG GGCAGTGGCG TTCGGCCCGG GCGGGCTGCT GGCCATCGGC
GCATCTGATG GGGCAGTGTG GCTGTGGGAC ACGAAGAACC CGACGTCGGC CCACCTCGTT
GGCGCCACGC TCTCATACTC CGCCTCGTAC CCCATGGACG GTTTCCCGAT CCGCACTGTA
TTGATCAGCC TAAACGGATT GTTGGCTATC GGCGGTGGTG ATGGAGTGGT GCGGTTGTGG
GACATTGGCG ACCCGACTTC GCCCCGCCAG CTCGGCACAC CGCTCACCGA CCCGAACGGT
GAGGCCATCT CGGCCAACGC AGCGGCCTTC GGTCCAGATG GGTTGTTGGC CACCGGCGGC
GACGACGGGG CAGTGCGGCT GTGGGACACC ACTGTGATCG AGCACATCCA AAGCGAGGCA
GTGCAGGTAG CCTGCCAACG TGCAGGCCGC GGTCTGGATC GACAAGAATG GGCCCGCTAC
CTGCCCGGCG AGCGATATCG GGAAACGTGC CCCACCGGCG CCCGGTAG
 
Protein sequence
MVTGYGQPTT DRTGHTTDFF VSYAAADQEW AEWIGWQLEA AGYRIRLRAW DFTSGSNIVT 
ETQRVLATSA KMIAVMSSAY LVSAMESAQW QAVWVDDPTG AKRRLVVVQV QDCPQPGLLR
PLVGVPLFGL DEDTARERLL GAAAASRQKP TGPPPFPASA APDFPDPYVT GGGEPRLSPF
PGLAAFDTNR AAVFRGREAA TRHLVDRTLA QAETGGLIVV VGPSGCGKSS LVAAGLAPRM
AGEADWLVTA PMTPGDQPIR ALAVVLADAG RRTGLDWDAE TLTRRLADPA DVGGVAAELL
TAARPARWLL LLIDQVEELL VRASPADRDR FLILLAAADR ARVRVVATLR SEYLDALLDA
TAPIGLSVPA ETLQPLSRDL LPLVIAEPAR MSGLHIEDEL VAHMVADTGD GLALPLLAYT
LQRLHLAAQA VATHVLSAAL YEQIGGVQQA LVEHAEAALA AAAEATGRTR QQVLAGLLGL
VTVDTSGRPT RRRVPLGQLS DATRAALVPF VTGRLLVLDA TPDGPVTVEV AHERLLAAWP
PLAQAIADDA ERLRQRGQVE TAARDWQQAG RRPALLWSYS RATSVLDVLI HDDLTLAGRQ
FLTISRRQAR RRFFTGFTLL TVLAMTATGL GIAAYLQRQT ADDRRRTAVA ERLLTLADNR
RDTDPTTAMR LAVAAHAIVP THEQAARRNL LQTLIGTPYL RTTHTSDTGP IAALAVGPGG
LLASGSADGT MRLWDTSDPD SIRLLGAPPL TDTAGTLSLA SGPGGLLAIG AGDAVRLWDA
RTPTARRLLG IPLTDSDGSV ALGPEGLLAT GGSDGAVRLW DIHNPNSARL LDTLSPAASD
AALEEGGGRL DVELAFGPSG LLAASFGSGQ VRLWDISDSS SPRSLGTAEL VGPVYAVAFG
PDGLLATGGG EGTVRLWDTS DPSSPRLLNT LLTGAVSLVS AVAFGPDGLL ASGDSDGALR
LWDTKHPTSP RLLDTTLTGT ARAVTMTYGP DGLLATGSGD GMVQLWNTSR HGPPRLLGTL
PSGLTGRFAV DAVAFGPDGL LATSSYDTGV RLWDISDPTS PRLLSAPRTG SADSIGEDNE
VAFGPGGLLA SVSRYDGTVR LWDTKTPASP RLLSTSRAST EPGYLFGGSS DLVAFGPDGL
LASGGGSDGA VRLWDISDPT SPRLLGIPLT SPVTPVDLRS GPSNPVGAVA FGPGGLLAIG
ASDGAVWLWD TKNPTSAHLV GATLSYSASY PMDGFPIRTV LISLNGLLAI GGGDGVVRLW
DIGDPTSPRQ LGTPLTDPNG EAISANAAAF GPDGLLATGG DDGAVRLWDT TVIEHIQSEA
VQVACQRAGR GLDRQEWARY LPGERYRETC PTGAR