Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2751 |
Symbol | |
ID | 5671142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3255833 |
End bp | 3259900 |
Gene Length | 4068 bp |
Protein Length | 1355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241663 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001507083 |
Protein GI | 158314575 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.865283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACGG GGTACGGCCA ACCGACTACG GACCGGACCG GTCACACGAC CGATTTTTTC GTGTCGTACG CGGCGGCGGA TCAGGAGTGG GCGGAGTGGA TCGGCTGGCA GCTGGAGGCC GCCGGCTACC GGATCAGACT GCGGGCCTGG GATTTCACCA GCGGTTCGAA CATCGTGACC GAGACGCAGC GGGTGCTCGC AACCTCGGCG AAGATGATCG CGGTGATGTC GTCGGCGTAT CTGGTCTCGG CGATGGAGAG CGCGCAGTGG CAGGCTGTCT GGGTTGACGA CCCGACCGGA GCGAAGCGCC GACTGGTGGT CGTGCAGGTG CAGGACTGTC CGCAGCCAGG GCTGCTGCGT CCGCTGGTCG GGGTGCCCCT GTTCGGCTTG GACGAGGACA CCGCCCGCGA GCGGCTGCTC GGCGCGGCTG CGGCTTCCCG GCAGAAGCCG ACCGGTCCGC CGCCGTTTCC CGCCAGCGCA GCGCCTGATT TCCCCGACCC GTATGTCACC GGCGGCGGTG AGCCGCGGCT GTCGCCGTTT CCGGGACTGG CGGCGTTCGA CACGAATCGC GCTGCGGTGT TCCGGGGTCG GGAAGCGGCG ACCCGTCATC TGGTGGACCG AACGCTCGCC CAGGCTGAGA CAGGCGGCCT GATCGTGGTG GTCGGCCCGT CGGGGTGTGG TAAGTCGTCG CTGGTTGCTG CCGGCTTGGC CCCGCGCATG GCGGGCGAGG CGGACTGGTT GGTGACGGCG CCGATGACCC CTGGCGACCA GCCGATACGG GCGTTGGCGG TGGTGCTGGC GGACGCTGGT CGACGCACCG GTCTGGACTG GGACGCCGAG ACGCTGACGC GCAGGCTGGC CGACCCCGCC GACGTCGGCG GGGTCGCCGC CGAGCTCCTC ACCGCTGCCC GGCCGGCACG CTGGCTACTC CTGCTCATCG ACCAGGTCGA GGAGCTACTC GTGCGTGCCA GCCCCGCTGA CCGGGACCGG TTTCTGATCC TGCTGGCCGC TGCGGACCGT GCCCGGGTGC GAGTGGTGGC GACGCTGCGC TCGGAGTACC TCGACGCCCT CCTCGACGCG ACCGCCCCGA TCGGGTTGTC CGTTCCTGCC GAGACGCTCC AGCCGCTGTC GCGGGACCTG CTTCCGCTGG TGATCGCCGA ACCGGCACGG ATGTCCGGGC TGCACATCGA GGATGAACTC GTCGCCCATA TGGTCGCCGA CACCGGCGAT GGCCTCGCCC TGCCTCTGCT CGCCTACACC CTCCAACGCC TGCACCTTGC TGCCCAGGCT GTGGCGACCC ACGTGCTTTC CGCGGCGCTG TACGAGCAGA TCGGCGGAGT TCAGCAAGCG CTCGTCGAGC ACGCCGAGGC CGCGTTGGCC GCCGCGGCCG AGGCCACTGG CCGCACTCGA CAGCAGGTCT TGGCCGGGCT GCTGGGGCTC GTCACCGTCG ACACCAGCGG GCGGCCTACG CGCAGGCGCG TCCCGCTCGG TCAGCTGTCC GACGCCACGC GGGCCGCACT GGTCCCGTTC GTCACCGGCC GGCTGCTCGT TCTCGATGCC ACCCCTGACG GGCCGGTCAC TGTCGAGGTC GCCCACGAGC GGCTCCTCGC CGCCTGGCCG CCGCTCGCTC AGGCCATCGC CGACGACGCC GAACGCCTAC GCCAGCGCGG CCAGGTCGAG ACCGCCGCCC GAGACTGGCA GCAGGCTGGC CGGCGACCGG CGCTGCTGTG GAGCTACAGC AGAGCAACAT CCGTCCTCGA CGTCCTCATC CACGACGACC TCACCCTGGC CGGCCGGCAG TTCCTGACCA TCAGCCGGCG TCAAGCCCGC CGCCGCTTCT TCACCGGGTT CACGCTCTTG ACCGTCCTGG CTATGACGGC CACGGGCCTC GGGATCGCCG CCTACCTACA GAGGCAGACC GCTGACGACC GTCGACGCAC CGCCGTCGCC GAGCGGCTTC TCACCCTCGC CGACAACAGG CGCGACACCG ATCCCACTAC CGCCATGCGC CTCGCCGTCG CCGCCCACGC CATCGTGCCC ACCCACGAGC AGGCCGCCCG CCGCAACCTC CTCCAGACCC TCATCGGCAC CCCCTACCTG CGTACCACCC ACACGAGCGA CACCGGCCCC ATCGCAGCGT TGGCGGTCGG GCCAGGCGGG TTGCTGGCCT CCGGCAGCGC CGACGGGACG ATGCGGCTGT GGGACACCAG CGACCCAGAC TCGATTCGCC TCCTCGGTGC GCCGCCGCTC ACCGACACAG CCGGCACGCT CTCGCTAGCG TCCGGTCCGG GCGGGTTGCT GGCCATTGGC GCCGGCGACG CGGTGCGGCT GTGGGATGCG AGAACTCCGA CCGCGCGCCG CCTCCTCGGC ATCCCGCTCA CTGATTCCGA CGGGAGCGTG GCGCTCGGTC CAGAGGGGTT GCTGGCTACC GGCGGCAGCG ACGGGGCGGT GCGGCTATGG GATATCCACA ACCCGAACTC GGCCCGCCTC CTCGATACCC TCAGCCCTGC CGCCAGCGAC GCCGCGCTCG AAGAGGGCGG CGGGCGCCTC GACGTGGAAC TGGCGTTCGG TCCGAGCGGA CTGCTGGCCG CCAGCTTCGG CAGCGGGCAG GTGAGGCTTT GGGACATCAG TGACTCGTCC TCGCCCCGCT CCCTCGGTAC GGCAGAACTG GTCGGCCCGG TCTACGCGGT GGCCTTCGGT CCAGACGGTC TGCTGGCTAC CGGCGGCGGC GAGGGGACCG TGAGGCTCTG GGACACCAGC GACCCGTCCT CGCCCCGCCT TCTCAACACA CTGCTCACGG GCGCGGTCAG CCTGGTCTCC GCGGTGGCGT TCGGTCCAGA CGGACTGCTG GCCAGTGGCG ACAGCGACGG AGCCCTGCGG CTGTGGGATA CCAAGCATCC GACCTCGCCT CGCCTTCTCG ACACCACGCT CACCGGCACG GCCCGCGCAG TCACGATGAC CTACGGACCG GATGGCCTGC TGGCCACCGG CAGCGGTGAT GGAATGGTCC AGCTGTGGAA CACCAGCCGA CATGGCCCGC CCCGCCTGCT TGGTACCCTC CCCTCCGGCC TCACCGGCCG GTTCGCTGTC GACGCGGTGG CCTTCGGTCC AGACGGTCTG CTGGCCACCA GCAGCTATGA CACGGGGGTC CGGCTGTGGG ACATCAGCGA CCCGACCTCG CCCCGCCTCC TCAGCGCCCC GCGTACCGGC AGCGCCGACT CAATAGGCGA GGACAACGAG GTTGCCTTCG GCCCAGGCGG TTTGTTGGCC AGCGTCTCCA GGTACGACGG GACGGTACGG CTGTGGGATA CGAAAACTCC GGCCTCACCC CGCCTCCTCA GTACCTCGCG CGCCTCCACG GAGCCGGGCT ACTTGTTCGG CGGCAGTTCT GACCTCGTAG CGTTCGGCCC AGACGGTTTG CTTGCCAGCG GCGGCGGTAG TGACGGGGCG GTGCGGTTGT GGGACATTAG CGACCCGACC TCGCCCCGCC TGCTCGGCAT CCCGCTCACC AGCCCCGTCA CCCCGGTCGA CCTGCGATCC GGACCATCCA ACCCGGTCGG GGCAGTGGCG TTCGGCCCGG GCGGGCTGCT GGCCATCGGC GCATCTGATG GGGCAGTGTG GCTGTGGGAC ACGAAGAACC CGACGTCGGC CCACCTCGTT GGCGCCACGC TCTCATACTC CGCCTCGTAC CCCATGGACG GTTTCCCGAT CCGCACTGTA TTGATCAGCC TAAACGGATT GTTGGCTATC GGCGGTGGTG ATGGAGTGGT GCGGTTGTGG GACATTGGCG ACCCGACTTC GCCCCGCCAG CTCGGCACAC CGCTCACCGA CCCGAACGGT GAGGCCATCT CGGCCAACGC AGCGGCCTTC GGTCCAGATG GGTTGTTGGC CACCGGCGGC GACGACGGGG CAGTGCGGCT GTGGGACACC ACTGTGATCG AGCACATCCA AAGCGAGGCA GTGCAGGTAG CCTGCCAACG TGCAGGCCGC GGTCTGGATC GACAAGAATG GGCCCGCTAC CTGCCCGGCG AGCGATATCG GGAAACGTGC CCCACCGGCG CCCGGTAG
|
Protein sequence | MVTGYGQPTT DRTGHTTDFF VSYAAADQEW AEWIGWQLEA AGYRIRLRAW DFTSGSNIVT ETQRVLATSA KMIAVMSSAY LVSAMESAQW QAVWVDDPTG AKRRLVVVQV QDCPQPGLLR PLVGVPLFGL DEDTARERLL GAAAASRQKP TGPPPFPASA APDFPDPYVT GGGEPRLSPF PGLAAFDTNR AAVFRGREAA TRHLVDRTLA QAETGGLIVV VGPSGCGKSS LVAAGLAPRM AGEADWLVTA PMTPGDQPIR ALAVVLADAG RRTGLDWDAE TLTRRLADPA DVGGVAAELL TAARPARWLL LLIDQVEELL VRASPADRDR FLILLAAADR ARVRVVATLR SEYLDALLDA TAPIGLSVPA ETLQPLSRDL LPLVIAEPAR MSGLHIEDEL VAHMVADTGD GLALPLLAYT LQRLHLAAQA VATHVLSAAL YEQIGGVQQA LVEHAEAALA AAAEATGRTR QQVLAGLLGL VTVDTSGRPT RRRVPLGQLS DATRAALVPF VTGRLLVLDA TPDGPVTVEV AHERLLAAWP PLAQAIADDA ERLRQRGQVE TAARDWQQAG RRPALLWSYS RATSVLDVLI HDDLTLAGRQ FLTISRRQAR RRFFTGFTLL TVLAMTATGL GIAAYLQRQT ADDRRRTAVA ERLLTLADNR RDTDPTTAMR LAVAAHAIVP THEQAARRNL LQTLIGTPYL RTTHTSDTGP IAALAVGPGG LLASGSADGT MRLWDTSDPD SIRLLGAPPL TDTAGTLSLA SGPGGLLAIG AGDAVRLWDA RTPTARRLLG IPLTDSDGSV ALGPEGLLAT GGSDGAVRLW DIHNPNSARL LDTLSPAASD AALEEGGGRL DVELAFGPSG LLAASFGSGQ VRLWDISDSS SPRSLGTAEL VGPVYAVAFG PDGLLATGGG EGTVRLWDTS DPSSPRLLNT LLTGAVSLVS AVAFGPDGLL ASGDSDGALR LWDTKHPTSP RLLDTTLTGT ARAVTMTYGP DGLLATGSGD GMVQLWNTSR HGPPRLLGTL PSGLTGRFAV DAVAFGPDGL LATSSYDTGV RLWDISDPTS PRLLSAPRTG SADSIGEDNE VAFGPGGLLA SVSRYDGTVR LWDTKTPASP RLLSTSRAST EPGYLFGGSS DLVAFGPDGL LASGGGSDGA VRLWDISDPT SPRLLGIPLT SPVTPVDLRS GPSNPVGAVA FGPGGLLAIG ASDGAVWLWD TKNPTSAHLV GATLSYSASY PMDGFPIRTV LISLNGLLAI GGGDGVVRLW DIGDPTSPRQ LGTPLTDPNG EAISANAAAF GPDGLLATGG DDGAVRLWDT TVIEHIQSEA VQVACQRAGR GLDRQEWARY LPGERYRETC PTGAR
|
| |