Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3637 |
Symbol | |
ID | 5672004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4309272 |
End bp | 4313129 |
Gene Length | 3858 bp |
Protein Length | 1285 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242521 |
Product | TPR repeat-containing protein |
Protein accession | YP_001507941 |
Protein GI | 158315433 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGACG CAGAGCTGCT GGTGGATCTT GATTCTGGCA GGCTATCGAC CACGATCGGA CTGGGATCAC CGATCGATGC GGCGGAGCTG GAGGACCTGC GCTGGTATCT GGAGGATTAT CTCCAGACGC CGTTCGGGGT TTACTCTGAC CGCGGTTCCC GGATCGCGGG CCAGCTCGCC GACTGGGGCC GGGCGCTGTT CGACTCGGTG CTCGAGGCGG TGCGGGTAGG CCGCGAGGAC GCCGACTTTC CTGTCCGGGG ACTCTCGGAG ATCGTAGTGC GATCGGAAAT TCCCGAACGG CTCGGGCTGC CGTGGGAACT GATGCGGGCG CCGGGCGCGT CCGTGCCGCT CGTCCTGGAC GGAATCGGGG TAACCCGATG CCTGGTCGCG AAGTCGCCGG ATGATTCCGT TGACGCGGCC GGAGACCGGC TCCGCGTGCT CATGGTGATT TCCCGGCCCG CGAGTGTCAG GGATGTCGGC TATCAGATGA TAGCTCGCCC GCTGCTACGG TCCATGGCGT TGGCACACGG CGAGGTGGAC CTGGAAGTGC TGCGCCCGCC GACGCTGGAG GCGCTGGCAG CTCGGCTGCG GGGCGCGCGC GAGGCGGGGG CGCCCTTCCA GGTGGTGCAT TTCGACGGGC ACGGCGCCAC CGACCGAGGC GCGCTGGCGT TCGAGAGACC GGGTGGTGGC GCGGACTACG TACCGGCCGG GCGCCTCGCG GGTGTACTCG CGGCGGCGGA CGTCCCGGTC GCGGTGCTCA ACGCATGCCA GTCGGGGGCG ATCGGGAAGC GGCTGGAGGC GGCGGTCGCG ACTGGTCTGG TGTTGGGTGG CGTCGATGCC GTCGTGGCCA TGTCCTACCG GGTCTACGCG GTGGCCGCCG CCGAGTTCAT GACGGCGTTC TACGGTCAGC TGCTTACGGG CGGCACCATA AGCGAGGCAG TGCGCGCGGG CCGTTCGCGG ATGGCGCAGC ACCCGGGACG GCCCAGCCCC AAGGGCGAAC TGCCCCTGGA GGACTGGGCC GTCCCGGTCT ACTACCGTCG CCGCGAGGTC CGCTTTCCGC AACTGCGCGC GGTGCCGCCG GTGTCCGGCG GCGGCCCGGA CCGCGACGAG CTGAAATCGG AGGCGGAGTT CGTCGGCCGC GACGACCTGT TCTGCACCCT GGAAATGGCC GCGCGGACCG ACCGGGTGCT TGTACTGCAC GGCCCCGGCG GCACCGGGAA GACGGAGCTC GCCAAGGCGT TCGGACGATG GTGGCGCGAC ACCGGCGGAG TCGACCGCCC GGACGGCGTC TTCTGGCACT CGTTCGAACC GGGGGTCCCC TCCCTTGGCC TGGGCGGTGC CATCCGGGAG ACCGGCCTGC GCCTGTACGG TCCCGAGTTC GCACTAAGAG ATCCCGCTGC CCGCCGTGAC CTGGTCCTGG AGCAGCTGCG CGCGCGCCGG CTGCTGCTGA TATGGGATAA CTTCGAATCG GTCGCCACGA TGCCCGGCGA CGTCGCCTCG CCGCTCGACG AGACCGCATG CGTCGAGCTG AAGGACTTCC TGCGCGAGGC GGCTCACGGG CAAAGCACGA TCCTGGTGAC CAGCCGCACC CCTGAAAGCT GGCTAGGTGA CGTGCGCCGC CTCGCTGTCG AGGGCATGCT CCCGCACGAG GCCATCGAGT ACGCCGATCA GGTTCTCGCG CCCTTTCCCG CGGCTGCGGG CCGGCGGGCG GACCGGTCGT TCGGCGAATT GCTGGAATGG CTAGAGGGGC ACCCGCTGAG CATGCGGCTG ATCCTTCCGC ACCTGGCGAC CACGGAACCA GCCGCCCTGC TCGCGGGCCT CCATGGCACC CAGCCCCTGC CCATAGCGGA GGAGGGTGCC GCAGCAGGAG AAAACGCCAG AACGCGGTCC CTGACAGCCA GCGTCGGGTA TTCCGTCGTG CACCTTTCGC CCGCGGCCCG GCGCCTGCTC GTCGGGCTGA GCCTGCTGCG CGGCGTCGCC GACGCGGAGA TCCTCGGTGC CTTCTCCGCC CAGGCGGACA CGCCGCAGCG ATTCCGCGGC GTCAGCGCCG ACGAGTGGGA CATGGTGCTC GACGAGGCCG CTCGCCTGGG CCTGCTGACC ACCCGCGCCG ACGGCGGTTA CGGGATCCAC CCCGCGCTCC CCGCGTATCT CGCCGCTCAG TGGCAGGCCG AGGAACCCAT GATCTACCCT GGTGCCCGGG CCGCGGCCGA CAGTGCCCTG CTCGCCGCCT GGGCCGCGGC CTGCCAGCTC TGGGGCAAGA CGATGGTCGC CGAGGCCAGT GCGGAAGCGT ACGCGTTCAT CGACCGGAAC CGCCGGACGA TGGGACGCAT GCTCGGCTAC GCGCTCGACA ACCGCCGCTG GGCCGCGGCG GAGCCGATCT ACATCACGCT GTACCAGTAC CTGGACCACA GCGGTTTGGC GGAGGAGGCC CGTGGCTGGT CGGACCGCGC CCGGCGCGCG GTCGAGGGTC CCGGCGGGGC TCTGTCCGCC GGGAAACTGC CCACGCTCCG GCTGGAGGAC CTGCGAGACC CGCGGGCGCT GTCCCCGGCA AAGCTGGCGG AATACCGGTC GGTTCTTTCC CTGTGGAGGG CCGTCCTGCC GGCCTTAGCC TTGTGGAACG TGGTCGTGAT GTCGCAGGCG AACACGCTGA CCCGGGCCGG GGACCTCGAC GCCGCCGAAG CCCGCTACCA GGAGATTCTC CAGGTGCAGG AGAACACGCC GGGGCTGTCG CCGGCAGCAC GTGCCCTGAC CTACGGCAAG CTCGGATTCG TCGCAGAAGA GCGAGGGCGG TTCGCTCGGG CGGACGAGAT GTACCGCAGG GCCCTGGTCA TCCAGGAGGA GGGGGGCGAC CGGCAGGGGA TCGCGCAGTC ACTTCGTCAC CTGGGCAGCG TGGCGCTGGA TCAAGGACGC TGGGACGAAG CCGCGCAGCA CTACCGGCGG TCGCTCGCGA TCTTGAAGGA GACGCCCGGA GACCTGCACA GCACCCAGCT CGTCTATGGC AAGCTCGGCG ACCTGGCGCT GCTTCGCGGC CATCTCGACG AGGCGGAGGA CTGGCACGCC GAGTCGCTGG CCCTAGCGGA ACAGCGACAC GACTGGGACG GCATGGCGCG CTCCCTGCAC AGCCTTGGAA TGCTTTCCTT CAAGCGCGGT CAACTGGCGG AGGCGGAACG CTGGTACCTT CGCTGCCTGG CGGCGGCCGA GAGAGTCCGT GACCGGCCTG GCATGGCGGC CGTCTACCAT CAGCTCGGCA TGGTGGCACT GAGACGCGGC CAGGACGAGT CGCGGGAATG GTTCCGCCGG TCCCTGGCCC TCCACGATGA GCTGGGTGAC CGTAGCGGCG TTGCCAGGGA TCACCGCATG CTCGGGATGA CCGCGATCGC GGGTGGCCAG GCGGACGAGG CCAAACGGCT GCTGTTGGAG TCGCTCGCCG CCGTGGTGGA CATTGGTGAG CAGAAGAGCA TCATCGAGTG CTACCACCTG CTCGGAATGC TGGCGCACCA GCAGGGCTGG TGGGACGAAG CCGAACGGTG GCTGCGGAAG TGCCTCGCCC TGGAGGAGAA GATGGGGCAC GAGGCCGGCG TCGCAGTCTG CTGCCTGCAA TTGGGCGTAC TCGCACAAGC CCGGAACGAC GACGCCCGAG CTTTGGGATG GACGGTTCGC GCTCTGGCCG TGGCCCGGCG TTTCCCGCAG TTGGCGGTCC GACTGGCATC CGGCCCGGTG CTGAGCACAG TGACGAGTCG GCTCGGGATC GGGGCGGTGG AGGAGTGCTG GGAGCGCGTG ACCGGCGAGT CCTTGCCCGG CGATGTCCGC CTTCTTGCCC TGGAGCACGC CCACGACGAA CCTCTAGCGG ACAGGTGA
|
Protein sequence | MADAELLVDL DSGRLSTTIG LGSPIDAAEL EDLRWYLEDY LQTPFGVYSD RGSRIAGQLA DWGRALFDSV LEAVRVGRED ADFPVRGLSE IVVRSEIPER LGLPWELMRA PGASVPLVLD GIGVTRCLVA KSPDDSVDAA GDRLRVLMVI SRPASVRDVG YQMIARPLLR SMALAHGEVD LEVLRPPTLE ALAARLRGAR EAGAPFQVVH FDGHGATDRG ALAFERPGGG ADYVPAGRLA GVLAAADVPV AVLNACQSGA IGKRLEAAVA TGLVLGGVDA VVAMSYRVYA VAAAEFMTAF YGQLLTGGTI SEAVRAGRSR MAQHPGRPSP KGELPLEDWA VPVYYRRREV RFPQLRAVPP VSGGGPDRDE LKSEAEFVGR DDLFCTLEMA ARTDRVLVLH GPGGTGKTEL AKAFGRWWRD TGGVDRPDGV FWHSFEPGVP SLGLGGAIRE TGLRLYGPEF ALRDPAARRD LVLEQLRARR LLLIWDNFES VATMPGDVAS PLDETACVEL KDFLREAAHG QSTILVTSRT PESWLGDVRR LAVEGMLPHE AIEYADQVLA PFPAAAGRRA DRSFGELLEW LEGHPLSMRL ILPHLATTEP AALLAGLHGT QPLPIAEEGA AAGENARTRS LTASVGYSVV HLSPAARRLL VGLSLLRGVA DAEILGAFSA QADTPQRFRG VSADEWDMVL DEAARLGLLT TRADGGYGIH PALPAYLAAQ WQAEEPMIYP GARAAADSAL LAAWAAACQL WGKTMVAEAS AEAYAFIDRN RRTMGRMLGY ALDNRRWAAA EPIYITLYQY LDHSGLAEEA RGWSDRARRA VEGPGGALSA GKLPTLRLED LRDPRALSPA KLAEYRSVLS LWRAVLPALA LWNVVVMSQA NTLTRAGDLD AAEARYQEIL QVQENTPGLS PAARALTYGK LGFVAEERGR FARADEMYRR ALVIQEEGGD RQGIAQSLRH LGSVALDQGR WDEAAQHYRR SLAILKETPG DLHSTQLVYG KLGDLALLRG HLDEAEDWHA ESLALAEQRH DWDGMARSLH SLGMLSFKRG QLAEAERWYL RCLAAAERVR DRPGMAAVYH QLGMVALRRG QDESREWFRR SLALHDELGD RSGVARDHRM LGMTAIAGGQ ADEAKRLLLE SLAAVVDIGE QKSIIECYHL LGMLAHQQGW WDEAERWLRK CLALEEKMGH EAGVAVCCLQ LGVLAQARND DARALGWTVR ALAVARRFPQ LAVRLASGPV LSTVTSRLGI GAVEECWERV TGESLPGDVR LLALEHAHDE PLADR
|
| |