Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3977 |
Symbol | |
ID | 5672338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4762660 |
End bp | 4764357 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242856 |
Product | hypothetical protein |
Protein accession | YP_001508273 |
Protein GI | 158315765 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02677] conserved hypothetical protein TIGR02677 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.21248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.150808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGGGG TCGCGGCGTC GGAGCGGGCA GGGGGCCGCG ACCGGCCCGC GAACGTCTCT CGGTTCGGGC CCTTCGCCCA TGTGACAGTG GAGAAGGCCC CGCTGTACCG CGCCATCATG CGCACGTTCG TCCGGGCGAA GCGCAGGTTC AACGTCCACC TGCGCCCCGA GGACGTCCTG GAAATGCTCG CCCGCGAGCC GGGCCAGGAC GGGGACGGGC TGGACGGCCT GGACCTCGGC GGCATCACGG ATGTGAGCGC CGAGCTACGC AGCCTGGTCG ACTGGGGCAA CCTGCGCGGC GATCCGGACA CGAGCCGGGT CACCTCCGTC GAGGACTTCA ACCGCCCCCG ATTCCTCTAC CAGCTCACGC CCGCCGGCGA GGCCACCGAG ACCGCGCTGG AGGCTTTTGA CGAGGCGTTG GGCCGGCGCG GGGAGCTGCA GGCCGTCGCC CTGTCCGACA TCCACAGCCA GCTGCGGGCG CTGCTAGCCC TGCTGCCGGA ACCGGAGATC GATGCGGCGA AGGCGCACCA GCTGCTGCGT GACCTGGCCG GGGTCTTTCG CGGGCTGGCG GACAACGCCC AGGCGTTCAT GGGGTCGCTG CAGCGCACCA TCGATCTGCA CGACGCCGAC CTCGACGTCT TCCTGGCCTA CAAGGAACAG CTCATCGACT ATCTCCAGCG GTTCATCCGC GACCTCGTCG TCACCTCGGC GCGGATCGTC GACGTCCTGC GGGAGATCGA GGAGCACGAC CTGTCCCGCC TGCTGCGGGC GGTCGCCGAG CGCGAGGCCC GCGACGCGGC ACCCGACCTC GACCACCCGC CCCCGCCGAC ACCGTCGCAG CCGGCTCTGA CGTCGCAGCC GGAGCTGACG CAGGGGGAGC CGGCCGAGCC GCTGGCGAGC GGGGTCGTCG GCGATCGGCT GGCGTTGTGG ACGGAGCGGT GGGAGGGGTT ACGGCACTGG TTCGTCGGCG ATCGGACGCA TCCGCCTCAG TCACGCCTCC TGCGGGAGCG GGCTCGGGCC GCCGTGCCGG CGCTGCTGTC GGTGGTGCTG GTCCTCAACG AGCGCAGGTC GGGGCGCAGC GACCGGTCGG CGGACTTCCG TGCGCTCGCG CTCTGGTTCG CGCAGGCCCC GAACGACGAC GACGCCCACC GGCTATGGCG GGTGTGCTTC GGGCTTGCTC CGGCCCGCCA CCTCACGGTC GACGCGGCGA CCGTGGAGGC CCGTGACGCC GAGCCGGTGC CGGCGTCGAC GCCGTGGAGC GAGGCGCCTC CGATTCTCGT CGACATCCGG CTGCGGCGCA CCGGGCGGTA CGAACGCCGG GGCGCACCCA ACCGCGTCCG CGACCGCGCG GCCGAGCGCC GGCTGCTGGC CGAGAGACTG GCGGCCGAGG AGGATGTGGT GCGGGCCGCG CGCCGCCAGC TCGCCACCGG CGAACCCGTG CGCCTGTCCG AGCTGGCCGA GCTCGACGAC CCGACCTTCC GACTGTTCCT GACCGTCCTC GGGGACGCCC TCGCCCGCCG CCGGCCGGGG GAACTGAGCG TCGTGACCAG CAGCGCCGAC GGCACTCTGT CCATCTCGCT CACGCCCACC GACGACGGTG TGATCGCGAC CGTGCCCACC AGCGCGGGCC TGTTCACCGG CCCCGACCAT CTCCTGGAGA TCCACGACTT GCTGACAGCG ACGGCGGTGA CGGCGTGA
|
Protein sequence | MAGVAASERA GGRDRPANVS RFGPFAHVTV EKAPLYRAIM RTFVRAKRRF NVHLRPEDVL EMLAREPGQD GDGLDGLDLG GITDVSAELR SLVDWGNLRG DPDTSRVTSV EDFNRPRFLY QLTPAGEATE TALEAFDEAL GRRGELQAVA LSDIHSQLRA LLALLPEPEI DAAKAHQLLR DLAGVFRGLA DNAQAFMGSL QRTIDLHDAD LDVFLAYKEQ LIDYLQRFIR DLVVTSARIV DVLREIEEHD LSRLLRAVAE REARDAAPDL DHPPPPTPSQ PALTSQPELT QGEPAEPLAS GVVGDRLALW TERWEGLRHW FVGDRTHPPQ SRLLRERARA AVPALLSVVL VLNERRSGRS DRSADFRALA LWFAQAPNDD DAHRLWRVCF GLAPARHLTV DAATVEARDA EPVPASTPWS EAPPILVDIR LRRTGRYERR GAPNRVRDRA AERRLLAERL AAEEDVVRAA RRQLATGEPV RLSELAELDD PTFRLFLTVL GDALARRRPG ELSVVTSSAD GTLSISLTPT DDGVIATVPT SAGLFTGPDH LLEIHDLLTA TAVTA
|
| |