Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1551 |
Symbol | |
ID | 5669954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1854025 |
End bp | 1857204 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240470 |
Product | zinc finger SWIM domain-containing protein |
Protein accession | YP_001505896 |
Protein GI | 158313388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.712758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCCGC AGGCCTCGCT GTGCCTTCGA ACGCCGGTTC GAATGTGGGT GCGGGGCCGC GAGGACGGAA CGGAACTGTC GGACCCCGTA GGCAGAATCG CCGTCGTGTC GAACACCACC AGTCCGCCCG TCGAGCGCGG GCAGGCGCTC GCCCTGGCGC CGGATCAGGC GTCCGTGAAG GCGGGCGAGA GGCTCGCGGT CGCCGGGTCC TGGCCGCTGG CGGGCGGCGA CGCCGAGGCG CTGTGGGGTG AGTGCAGGGG CAGCGGAAAG TCGCCCTACC GGGTCGTCGT GGCCCTCGCG GATCATGCCT CGAAGTGCTC GTGCCCGAGC CGGAAGTTCC CCTGCAAGCA CGCCCTCGGC CTGATGCTGC TCGCCGCGGC CGGCGGCGTG GCCGGTGCCG GCGGCGGGCG TCCGCCGTGG GCCGCGGAAT GGCTCGACCG GCGGGTGGCG CGCTTCGCCG CGGCGGCGAG CGCGGGCACG CGGGCCGCCA CCGGGCCTGA GTCCGAGCGC TCCGCGGCGG GGGCGGCCCG GCGCGAGGCG AGCCGTGCGG CCAAGGTCGA CGCCGGTGTC GCGGAGCTGG CCCGCTGGCT GTCCGACCTG GCCGGGGAGG GGCTCGGAGC CGCCCAGGCC AGGCCGGCCG ACTGGTGGCG GGCCGCCGCC GCCCGCATGG TCGACGCGCA GGCACCCGGC CTCGCCGAGA TGATCCACGA GGCCGCGCAG ATCGCCGGGT CGGCGGCCCG GCGGCCTGAC TGGCCGTCCC GGCTGGTCGA CCGTGTCGGG CTGCTGCACC TGCTGTGCGA GGGCTGGGCC CGCCGCGCCG ACCTGCCCGC CGACGTCGTG GCGGTCCTGC GCGACCGGAT CGGTTTCACC GTTCCCGTGG CCACTGTGCT CGCCGGCGAG CACATCGTCG GCGAGGTCGA CGTGCTGGGG GCGCACGAGT TCGGCGCGGG GCGGGCGCGG GGGCGCCGGC AGTGGCTACG TCTGGTCGAG TCGGGGCGGC TGGCGGTGCT CGTCGACTTC GCGGTGAACG GCCAGGGGAT GCCACCGCCG CTGCCCCGGG GCTCGCGGGT CAGGGCCGAT CTCGCGCCGT ATCCCGGGCG CCGCCCGGCG CGGGTGGCGC CCGGCGCGCC GGGCGGCGCC GCGGCCGTCG TCGACCCCGT CGGCACAGAT CTCTTCGCGC CCACCTGGCG GGCCGCGCTG GCCTGCGTCG CGTCGGCCCT GGCGGTCGAC CCGTTCGCGG CCGTCGTCGC GCTGACCGTC CGCGGTGTCA CGGTGCTGCC GCCGGGCGCC GGGGCCGGTC CGCTGCCCAG GAGCGGTCCG TGGCTGCTGC GGGACTCCGC CGGCGAGGCG TTGCCGCTCG CCGACGAGGC CGTGGCGCAG TGGGGCTGGC ACATGCTCGC GGCCGGCGGG GGAGCGGGCC TCGACGTGGT CGCCGAATGG GACTCGTTCA CCCTGACACC CCTGGCCGCC ACGCCGTCGC GCACCGCGCG GGCGGAGGAC ATGCCCGGCC GGGCCCCCGT CCCGGTAGTC GCGGCACCGG TCGCCCCGGT GCCGACGCCG GGGTGGACGG ACCTCGTCGA CGCCGCGGTG ATCGGCGTGA GCCGCCGCCC GTCGCCGGTG ATTCCCGGGC TCCCTGATCC CGGCGGGCGT CCGGGCGAGG AGGAACGCCT GCTCCGGGTC GCGGCGCTGG CCGCGGTGAC CAGGCGGGCC GGTCAGCTGG CCGCCGATGC CTCCGCGGTG CCCGCGCCGC CTCCCGCCGC GGCGGATGAG CACCCACCGT GCCCCCCGAC GGCGGTCGTG CCCGTTCCCA CCGAGGTGGG CGCCGCCGAG ACAGCCGAGC TCGAGGAGTG GCTGGACCTG CTGGCCGAAG GAGGCTGGCG CCCGCCGGAC ACCCTGCTTC CCGCGCTGGT CGAGCTCGGC CGGCGGTCGA CCGAGCTGCG CCCCCGCCTG CTACGGGTCC TCGGCCCGCG CGGCCGCTGG TTCGCCGCGC TCCACCCGGA CGCCGGCTGG GCGGGGCCCG TCGGCGTCGC GGGCTGGCCG AGCGCGAGCG CGCAGCAGCG CCGATCCCTC ATCGCCGCGC TGCGGCACAC CGACCCGGCG GCCGCCGCCG AGCTGCTGCG CGGCGGCGCG GGCGAACCTC CCTTCGTCCC GTTCCGCCGG GCCGGCGGAG CCGAGCGCCT CGCCTTCGTC CAGGCGCTGC GCACCGGCCT CGGCCCGTAC GACGAGGAGC TGTTGGAGGC AGCCCTGGAC GACCGCCGGT CCGACGTGCG CGACGCGGCG GTCCAGGTTC TGCTGGAACT TCCCGGTACC CGTTTCGCGG CGCGGGCCGC CGCCCGCACC GAGCGGGCCT TCACGGTGCA TCGCGGCACC CTGCGCGTGC ACCCGCCGGC GGTGCTCACC GACGAGATGG CGCGGGACGG CGTCTCGCCT GACGGGCCGT CCCGCGCCAG GGCGCAGGCC GACGGGCCGG CCCGGATGCT GCTCGCGGAG ATCGCCCGGG TGGACCCGCG GCTGTGGCCG GAGCGGACCG GCCTGTCCCC GCAGAAGTTC CTGAGCGCCA GGGCGCTCTG GGAGCCCGCG CCGAGCTTCC CCGCGCTGAC CCTGGCGCCC TACCTCGTCG GCCCGGTCGT CCGGCACCGT GACCCGGAAT GGGCGCTCGC GCTGATCCCG AAGGTCGAAC CGCGCTCGCA GGGGGTGCTG ATCGGCTGCC TGCCGGAGCC CGACCGGCTG CGCGCGTTCG ACCTCGCCCT GAGCGGGACC GGCCATGCCA CGCAGCGCTG GGGCCTGCTC CTCGGCAGAT CCGGCGGGCA CGGCCTCGGC AACACCGCCC TGGGGAACCT CATCACGCTT CTCGCCGGGA TACCCGGGCC CTGGTCGGCG GAATTCACCC GGGCGGCCGG CCCCGCCATC ACGGCCATCG TCACCGCACC GGTCGACGCA CCGGGCTCCG ATCCCCAGCA GCGGGCAACG GCCCGCACGC TGCACGCCCG GGTGCGGGCG CTGCTCGCCA ACCTCGCCTG GCGGGTGGAG CCGTACGTCG GCATCCCCGA CCTGGACCCG TCCACCATGC CGGCCGAGAC GGTGTCCGGC TACCAGCGCC TCGCGGCCGT GCTCGCCCAG CGGCGAGCCC GCCGCGACAC CCTGCTCTCC CGTTCGAGCT CGAAGAAAGG CTCGTCATGA
|
Protein sequence | MWPQASLCLR TPVRMWVRGR EDGTELSDPV GRIAVVSNTT SPPVERGQAL ALAPDQASVK AGERLAVAGS WPLAGGDAEA LWGECRGSGK SPYRVVVALA DHASKCSCPS RKFPCKHALG LMLLAAAGGV AGAGGGRPPW AAEWLDRRVA RFAAAASAGT RAATGPESER SAAGAARREA SRAAKVDAGV AELARWLSDL AGEGLGAAQA RPADWWRAAA ARMVDAQAPG LAEMIHEAAQ IAGSAARRPD WPSRLVDRVG LLHLLCEGWA RRADLPADVV AVLRDRIGFT VPVATVLAGE HIVGEVDVLG AHEFGAGRAR GRRQWLRLVE SGRLAVLVDF AVNGQGMPPP LPRGSRVRAD LAPYPGRRPA RVAPGAPGGA AAVVDPVGTD LFAPTWRAAL ACVASALAVD PFAAVVALTV RGVTVLPPGA GAGPLPRSGP WLLRDSAGEA LPLADEAVAQ WGWHMLAAGG GAGLDVVAEW DSFTLTPLAA TPSRTARAED MPGRAPVPVV AAPVAPVPTP GWTDLVDAAV IGVSRRPSPV IPGLPDPGGR PGEEERLLRV AALAAVTRRA GQLAADASAV PAPPPAAADE HPPCPPTAVV PVPTEVGAAE TAELEEWLDL LAEGGWRPPD TLLPALVELG RRSTELRPRL LRVLGPRGRW FAALHPDAGW AGPVGVAGWP SASAQQRRSL IAALRHTDPA AAAELLRGGA GEPPFVPFRR AGGAERLAFV QALRTGLGPY DEELLEAALD DRRSDVRDAA VQVLLELPGT RFAARAAART ERAFTVHRGT LRVHPPAVLT DEMARDGVSP DGPSRARAQA DGPARMLLAE IARVDPRLWP ERTGLSPQKF LSARALWEPA PSFPALTLAP YLVGPVVRHR DPEWALALIP KVEPRSQGVL IGCLPEPDRL RAFDLALSGT GHATQRWGLL LGRSGGHGLG NTALGNLITL LAGIPGPWSA EFTRAAGPAI TAIVTAPVDA PGSDPQQRAT ARTLHARVRA LLANLAWRVE PYVGIPDLDP STMPAETVSG YQRLAAVLAQ RRARRDTLLS RSSSKKGSS
|
| |