Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5501 |
Symbol | |
ID | 5673832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6660998 |
End bp | 6666016 |
Gene Length | 5019 bp |
Protein Length | 1672 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244356 |
Product | hypothetical protein |
Protein accession | YP_001509762 |
Protein GI | 158317254 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.627379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.180231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTTGA CCGCTCCGGC GTCCGGAGCC GCCCATCCCG ACGGGCCGGC CCCGGGCGCC ACGACGGGCC CCGGCGGCGC GCCGCCCTCC GATCACCCGC GCCGGCAGGG GCGGCTCGGC CGGGTCCGCC GGCTGCTGCC GTCCTGGCCC GCGCTGGTAC TCGCCGCGCT GGCCTACATC CCGCTGCTGG CGACGGCGCC GGGGCGGATC GGCGCCGACA CGAAGGCCTA CCTCTACCTC GACCCGGGCC GGATGCTGGC CCGCGCGGTG TCGATGTGGG ACCCCGACGT CGGCATGGGG ACCGTCACCC ACCAGAACAT CGGCTACCTG TTCCCGCAGG GCGCGTTCTA CTGGCTCGCC CAGCTCGCCG GGCTGCCCGA CTGGGTGGCG CAGCGGCTGT GGACGGGCTC GATCCTGTTC GGCGCGGGCG CCGGCGTGCT GTTCCTGCTG CGCACGTTCG GCTGGGCGAA CCGCTACGCG TTCATCGCGG CCCTCGGGTA CATGCTCACC CCGTACACGC TGGAGTACGA GGCGCGGATC TCGGCCATCC TGCTGCCGTA CGCGGGCCTG GGCTGGCTGA TCGGCATCAC CGTGCGGGGG CTGCGCGAAG CCGGCGCGGA CGACCGCTCC CGCGCCGCGG ACACCCCGCG CCTGGTGCGC TGGCGCTCGG GCTGGCGCTG GCCGGCGGCG TTCGCGCTGA TGGTCACCCT GATCGGCAGC ATCAACGCGT CCAGCCTGAT CTTCATCCTG TTCGCTCCGC TGCTGTGGGT GCCGTTCGCG GTGTGGGGCA CCCGGGAGGT CCGTCTCGGC ACCGCCCTGA CCCTGTGCGG GCGGGCGGTC GCGCTGGTCG TCGTGACGTC CGCCTGGTGG ATGGCCGGTC TGTACACCCA GGCCGGCTAC GGGCTGAACG TGCTCGCCTT CACCGAGACG GTGAAGACCG TGGCGAGCAG CTCGCAGGCG TCCGAGGTGC TGCGCGGCCT GGGTAACTGG TTCTTCTACG GCGAGGACGC GCTCGGCCTG TGGATCGGCC CGGCCAAGGA CTACACCGGC AGCCTGGTGA TCATCGCGAT CAGCTTCGCG GTGCCGATCC TGGCCCTGGT CGCCGCGTCC TGCCTGCGGT GGGGCCAGCG CGCGTACTTC GTGGCGCTGA TCGCGCTGGG CACGACGATC GCGGTCGGCG TCTACCCGTA CGACCACCCG TCCCCGCTGG GCCGGGTGTT CCGCGACTTC GCCGAGGGCT CCACCGCGGG CCTGGCGCTG CGGTCGCTGC CGCGTGCCGT CCCGATGGTC GTCCTGGGGC TCTCGGTGCT GCTCGCCGGT GGGCTCGCCG TCCTGGACCA GCGGTACGCG GCCCGCCGCG CACAGGCCCG GGCCGGCGCG TCGTCCACGA CCGATACCCC GCGGCCCGCG GCCGGTGCTC TACGGCTGCG CGGGCGCACG GTGCCGACGC TGGCGTTCGG CGGGGTGGCG CTGCTGCTCG TGCTGAACAT GTCACCGCTG TTCCGCGGGC ACTTCATCGA GCCGCTGCTG GACCGGCCCG AGGACATCCC CGGCTACGAG CAGGAGCTCG CCAGCGCGCT GGACGCCGTC GCCCCCGACG CGACCGGGGA ACAGACCCGG GTGCTGGAGC TGCCCGGCGC CGACTTCGCG CACTACCGCT GGGGCACGAC GCTCGACCCG GTCATGTCCG GGCTGATGGA CCGGCCGTCC GTGGTCCGCG AACTCATCCC CTACGGCGAC GCCGGCTCCG TGGACCTGCT GCGCTCGCTG GACCGGCGGA TGCAGGAGGG CGTGCTCGAC CCGGCTTCGA TCCCCGACAT CGCCCGGCTG ATGAGCGCCG GCGACGTCGT GCTGCGCAGC AACCTGGCCT ACGAGCGGTT CCGGACCCCG CGCCCGCGCG CGACCTGGGA CCTCGTGGCC AACCAGCGCC CCGCCGGGCT GGCCGAGCCG CGCACGTTCG GACCTCCGGT GCTGGAGGAC CCGGGCATCC CCTACACCGA CGAGATCACC CTGGGCACCA ACGCCGACGT CATCGACCCG CCGGCCCTCG CCGACTTCCC CGTCGACGAC CCGCACCCGA TCGTGCGTGC CGAGCGCACC GACGCCCCGC TGCTGGTCAG CGGCAACGGG GAGGCCCTCG TCGACGCGGC GGCCACCGGC CAGCTCGACG CGGTGCTCAA CGACGGCCGC ACCATCCTCT ACGCCGGTGA CCTCGCCAGC GACCCGCAGC GGCTGCGCCA GGCCCTGGAC GACGGCGCCG AGCTGCTGGT CAGCGACACC AACCGGCTGC GCGCCGAGCG CTGGACGGGC ATCCGGGAGA ACTTCGGCTA CGTCGAGCAG CCCGGCGTCA CCCCGCTGGC GAAGGACGCC AACGACAACC GGCTGGTGCT CTTCCCCGAC GCCGGCCACT CGGCGCAGAC CGTCACCCAG ATGCACGCAC CGGGCTCGGA GGCCCAGGTC GCGGACGTGC GGGCCACCGA TTACGGCAAC ACCTTCTCCT ACGGGGTCTC CGACCGCCCG GTGCTCGGCG TCGACGGCAG CCTGGACTCG GCCTGGCGGG TCGGCGCGTT CACCGACCCG GCGGGCGCCG CCTGGCAGGT CGACCTGGCG AAGCCGACGA CCACCGACCA CATCCGCGTC GTCCAGCCGC TCAACGGCCC GCGCAACCGG TGGATCACCA GGGCGACCCT GACCTTCGAC GGCGGCTCTC CGGTCACCGT CGACCTGCGG GACTCCTCCC GCACCCAGGC CGGCCAGACG ATCACCTTCC CGTCCCGGAC GTTCACCACA CTGCGCATCC ACGTCGATGC CACGAACTTC GGGGTGCGGC GGACCTACGA CGGGCTGTCC GCCGTCGGCT TCGCCGAGGT GGAGATCCCC GGCACCAACG GGAAGCCGCT GACCGCGGAG GAGGTGCTGC GCCTGCCCAC CGACACCCTC GACGCCGCCG GCGCCTCCTC CCTCGACCAC CGGCTGAGCC TGCAGATGTC CCGCGACCGG GCGAACCCCG CCGAGCCGTT CCGCCGCGAC CCCGAGCCGA CGATGGCACG CTCGTTCACG CTGCCCACGG CGCGCACGTT CGCGCTCACC GGCACCGCCC GGATCTCCGC CTACGCCTCC GACCAGCTCG TGGACGCGAC CCTCGGCCGG GCCGGAGCCG CGCCGACGGT CACCTCCTCC GGCCGGCTGC CCGGCGCGCT CGCCGCACGC GGGTCGAGCG CGTTCGACGG CGACACCACC ACGGCGTGGA GCCCCGGCAT CGGCGACCAG ATCGGCTCCT GGCTCCAGGT GACCTCGCCG ACCCCGGTCA CGTTCTCCTC GATGAACCTC GCCCTCGTCA CCGACGGGCG GCACTCCGTA CCCACGAAGA TCGGCATCAT GGTGGACGGG CGTCGGGTCG GCGCGGTCGC CGTCCCACCG GTCGCCGACA CGCCGGTCAG CTCGCAGACC CGCAACGCCG CCCAGAACGT CAGCCTCACC TTCCCCGCGG TCACCGGCAG CGCCATCCGG CTGGTCGTGG ACGACGTCCG CACCGTCACC AGCGTCGACA CGATCAGCGG CGCGGTGATG GACCTGCCGG TCGGCATCGC CGAGGTCGTC GTACCGGGCC TGGACGGCAC CGCCGGCGGA ACGGGCCGCG GCTCGGCCGC GACCACCGCT CCCGGCACCG CCGGCTCACC CGCCACCGCC ACGGCCACCG GTGCCGACAC CGGCGCTGTC ACCGGAGCCG CAGCCGGCCT GACGATCACC GCGGTTCCCG GGACCACCGG GACGGCGGCG ACCGGTACCG CGGACATTCC CGCCCCCTGC CGCACCGACC TGCTGACCAT CGACGGCGCA CCGGTCGGGA TCCAGGTCAC CGGAAGCATC AAGGACGCAG CCGGCCGCGC CGGACTGACG GTGCGGGCCT GCGGGACACC GGTCACCCTC AACGCGGGTG ACCACGTGGT GCGCACCTCC AACGGCGCGC TGACCGGGAT CGACGTCGAC CGGCTGCTGC TGGCCTCCGA CGCCGGCGGC GGAGCCTGGC TCGACGCCAC CGGGGCGGGC CCCGGGACTG CCGCCGTTCC GGGCTCGGCC GGCACCGGCG CGGGTGGCAC CGGTGTGGCC GCGGCCGCCA CGAGCGGCAG CGTGGGCGGG ACCGCACCGA CCGTGAAGGT CGAGGCGACC AGCGACACCT CGTTCCGCGT GGATGTCAGC GCCGCGACCC CGGGCAAACC GTTCTGGCTC GTGCTGGGCG AAAGCCGCTC CCCCGGCTGG ACGGCCCGCG TCAACGGCCA GGACCTCGGC GAATCCCACC TGGTCGACGG ATACGCGAAC GGCTGGCGGA TCGACCCGAG CGCCGGGAGC TTCACCGTCC TCCTCGACTG GGCACCGCAG CACATCGTCG GCATCGCGCT GAAACTGTCG GTGGTGACGG GCATCCTGAG CCTGGCGATC CTGCTGGTTG GGGCGTACCA GCGCCGCCGC GGCCGGAGCT GGCGCCATCT CGTCGGCTAC CAGTACATCC CCGCGGGCAC CGGCACCGGC GGAGTCGCCC GAGCGGGCAC CGCGGGTCCA GGCCGGGCTG ATCTGCCGAC GGCGGATCCC TGGCGGGCCG GGCCGGGCCC GGCCCTCCGG GTGTCGGGCC GGTCGACGCT GCTCACGGCC CTGGCCACCG GGGCGGCGAG CGTGGTGCTG GTCGCCCCGC TGGCGGGACT CGTGGTTGCC GTGCTGACCG CCGCCGCGCT GCGGATCCCC CGCGCCCGGC CGCTGCTGCG CGTCGCCCCC GCCGCGTGCC TGGCTCTCAG CGCGCTCTAC GTCCTCCAGG TGCAGGCCCG GCACGACCTG CCAGCCAACG GCTCCTGGGT GGCCGCGTTC GACCGAGTCG CCATGATCTC CTGGCTCACG GTCCTCCTCC TGGCCGCCGA CATGATCGTC TCGCTGGTCC GCGCCCGCGC CGCTGCCCGG ATCGCCTCGG GCACCGCCCC CGGGACCTCC ACCGCTCCCG CCGCACAGGG CTCCCCGTCC GAGCTGTGA
|
Protein sequence | MTLTAPASGA AHPDGPAPGA TTGPGGAPPS DHPRRQGRLG RVRRLLPSWP ALVLAALAYI PLLATAPGRI GADTKAYLYL DPGRMLARAV SMWDPDVGMG TVTHQNIGYL FPQGAFYWLA QLAGLPDWVA QRLWTGSILF GAGAGVLFLL RTFGWANRYA FIAALGYMLT PYTLEYEARI SAILLPYAGL GWLIGITVRG LREAGADDRS RAADTPRLVR WRSGWRWPAA FALMVTLIGS INASSLIFIL FAPLLWVPFA VWGTREVRLG TALTLCGRAV ALVVVTSAWW MAGLYTQAGY GLNVLAFTET VKTVASSSQA SEVLRGLGNW FFYGEDALGL WIGPAKDYTG SLVIIAISFA VPILALVAAS CLRWGQRAYF VALIALGTTI AVGVYPYDHP SPLGRVFRDF AEGSTAGLAL RSLPRAVPMV VLGLSVLLAG GLAVLDQRYA ARRAQARAGA SSTTDTPRPA AGALRLRGRT VPTLAFGGVA LLLVLNMSPL FRGHFIEPLL DRPEDIPGYE QELASALDAV APDATGEQTR VLELPGADFA HYRWGTTLDP VMSGLMDRPS VVRELIPYGD AGSVDLLRSL DRRMQEGVLD PASIPDIARL MSAGDVVLRS NLAYERFRTP RPRATWDLVA NQRPAGLAEP RTFGPPVLED PGIPYTDEIT LGTNADVIDP PALADFPVDD PHPIVRAERT DAPLLVSGNG EALVDAAATG QLDAVLNDGR TILYAGDLAS DPQRLRQALD DGAELLVSDT NRLRAERWTG IRENFGYVEQ PGVTPLAKDA NDNRLVLFPD AGHSAQTVTQ MHAPGSEAQV ADVRATDYGN TFSYGVSDRP VLGVDGSLDS AWRVGAFTDP AGAAWQVDLA KPTTTDHIRV VQPLNGPRNR WITRATLTFD GGSPVTVDLR DSSRTQAGQT ITFPSRTFTT LRIHVDATNF GVRRTYDGLS AVGFAEVEIP GTNGKPLTAE EVLRLPTDTL DAAGASSLDH RLSLQMSRDR ANPAEPFRRD PEPTMARSFT LPTARTFALT GTARISAYAS DQLVDATLGR AGAAPTVTSS GRLPGALAAR GSSAFDGDTT TAWSPGIGDQ IGSWLQVTSP TPVTFSSMNL ALVTDGRHSV PTKIGIMVDG RRVGAVAVPP VADTPVSSQT RNAAQNVSLT FPAVTGSAIR LVVDDVRTVT SVDTISGAVM DLPVGIAEVV VPGLDGTAGG TGRGSAATTA PGTAGSPATA TATGADTGAV TGAAAGLTIT AVPGTTGTAA TGTADIPAPC RTDLLTIDGA PVGIQVTGSI KDAAGRAGLT VRACGTPVTL NAGDHVVRTS NGALTGIDVD RLLLASDAGG GAWLDATGAG PGTAAVPGSA GTGAGGTGVA AAATSGSVGG TAPTVKVEAT SDTSFRVDVS AATPGKPFWL VLGESRSPGW TARVNGQDLG ESHLVDGYAN GWRIDPSAGS FTVLLDWAPQ HIVGIALKLS VVTGILSLAI LLVGAYQRRR GRSWRHLVGY QYIPAGTGTG GVARAGTAGP GRADLPTADP WRAGPGPALR VSGRSTLLTA LATGAASVVL VAPLAGLVVA VLTAAALRIP RARPLLRVAP AACLALSALY VLQVQARHDL PANGSWVAAF DRVAMISWLT VLLLAADMIV SLVRARAAAR IASGTAPGTS TAPAAQGSPS EL
|
| |