Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4639 |
Symbol | |
ID | 5672982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5532347 |
End bp | 5533531 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243497 |
Product | transposase IS4 family protein |
Protein accession | YP_001508913 |
Protein GI | 158316405 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5659] FOG: Transposase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGGGG TCGTGGCAGC GGAGGATGTC GTCGGGTGGG AGCGGGAGCT CGCGGCGTCG ACGGACGGGC TGGGTGGGTT GTTCAACCGG CCTGAGCCCA GGCGTGTGTT CGGTGACTTC GTGCGGGCGC TGCTGGCGGA CGTACCGAAG AAGAACTCGT GGGGGCTGGC CGAGCATGCG GGTTATGCAA CGCCGCGGCC GTTCGAGCAT CTGCTCGACG GGGCTGTGTG GGACGCCGAT CTGCTGCGCG ACGCGGTGCG GGAGTTCGTG GTCGACCGGC TCGGGTCGCC GGTGGGTGTG CTGGTCGTCG ATGACACGCA GGCGTTGAAG AAAGGTGACA AGTCGGTGGG GGTGGCTCCT CAGTACTACG GGCTGACCGG GGACGTCGCG AACGTGCAGA CCATGGTCAT GTGTACCTAT GCCTCGCCGG CCGGGCACGC GTTCGTGGAC CGGGAGTTGT ACCTGCCCGA GGTGTGGACC AGCGACCCGG CCCGCTGCCG GGCGGCCGGC GTGCCCACCG ACCGACAGTT CGCCACGAAA CCCCAGCTCG CGGTGGCGAT GCTGACCCGG GCGGTCGACG CCGGGGTGCC GTTTCGCTGG GTCGTCGCCG ACAGCGGCTA CGGCAAGGAC GCCCGGCTGC GGGGGTTCTG CCACGACCGG GGGCTGTCCT ACGTGCTGGC CGTCCCGAAG AACCTCGCCC TCCTCGACGC CCGGGGCCGG CCGACCCGCC CGGACCGGTT ACACGCCCGG CTGCCCGTGG GAGTGTTCGA GCGCCGTTCG TGCGGCGCCG GGTCGAAAGG CGCCCGCTGG TATGACTGGG CCGCCCACGC GGTCACCGTC GCCGGAGAGG ACCCGGCCAG CGGGCACGCT CACACCCTGC TGGTGCGTAA GTCCACCACC CCGCGTACTC GTGACGGCAA GACCTTCTAC GACGTCGAGT ACTTCCTCGC CCACGCCCCG ACCGCGACCG GCGTCCCCGA CCTGGTCGCC GCCGCCGGGA CGAGGTGGAC CATCGAGGAA AACAACGGCC AGGGCAAGGA CGTCCTCGGT CTCGACCAGT ACCAGGTCCG GAAATGGACC CCCTGGCACC GACACGTCAC CCTCAGCATG CTCGCCCAGG CGTTCCTCGC CGCGACCCGC GCCAACCCGG GAAAAGACCC CCGCATCCAG GAGGCCACCA GCTAA
|
Protein sequence | MVGVVAAEDV VGWERELAAS TDGLGGLFNR PEPRRVFGDF VRALLADVPK KNSWGLAEHA GYATPRPFEH LLDGAVWDAD LLRDAVREFV VDRLGSPVGV LVVDDTQALK KGDKSVGVAP QYYGLTGDVA NVQTMVMCTY ASPAGHAFVD RELYLPEVWT SDPARCRAAG VPTDRQFATK PQLAVAMLTR AVDAGVPFRW VVADSGYGKD ARLRGFCHDR GLSYVLAVPK NLALLDARGR PTRPDRLHAR LPVGVFERRS CGAGSKGARW YDWAAHAVTV AGEDPASGHA HTLLVRKSTT PRTRDGKTFY DVEYFLAHAP TATGVPDLVA AAGTRWTIEE NNGQGKDVLG LDQYQVRKWT PWHRHVTLSM LAQAFLAATR ANPGKDPRIQ EATS
|
| |