Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1615 |
Symbol | |
ID | 5670018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1932554 |
End bp | 1933816 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240534 |
Product | IS891/IS1136/IS1341 family transposase |
Protein accession | YP_001505960 |
Protein GI | 158313452 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00623083 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGACGA CGCTGCAGGC GTACCGGTTC GCGCTCGACC CGAACAACGT CGGGCGTGCC GGCTTGCGCC GGCACGCGGG CGCGTGCAGG TTCGCGTTCA ACTGGGGCCT GGCTCGGGTG AAGGCCGCGC TGGCGCAGCG CGAGGCCGAG GAATCCTACG GGCTGGCCGG CGACCTGCTC ACCGCGGTTC CGTGGACGCT GCCCGCGCTG CGCCTGGCCT GGAACACGGT GAAGAACGAC ATCGCGCCGT GGTGGGCGGA GTGCTCGAAG GAGGCGTTCT CCGCCGGGCT GGCGCAGTTG GCCGCCGGGT TGAAGAACTT CTCCGACTCC CGCAAGGGCA AACGGAAAGG CCGCACGGCC GGTTTCCCCC GGTTCAAAAA GCGCGGGAAG GACCGTGACT CGTTCCGGTA CACCACCGGC TCCTACGGTC CGGACGGGGA CCGGCATGTG AAACTGCCCC GGATCGGCCG GGTGAAGGTC CACGAGCCGA TGGGCGCGCT CACCGCCCTG CTCGGCGACG GCCGTGCCCG CCTCCTCGGC GCGGCCGTGT CCCGCACGGC TGGCCGCTGG TTCGTGTCGT TCACCGTGCA GGTCGAGAAG AAGCTCCGCA GGACCTCCCG CGCCTACGCC CGGTCACAGC CGGGCAGCAG CGGCAGGCGC AAGCTCGCCA CCGACCTGGC GAAACAGCAT GCCCACACCG CCAACCAGCG GCGCGACGGG CTACACAAGG TCACCACCAA CCTCGCCCGG ACCCACCACA CGGTGGTCAT CGAGGATCTG CACGTCGCCG GCATGGTGCG TGACCACAGC CTGGCGAAGG CGGTCTCCGA CGTCGGGATG GGCGAACTGC GCCGCCAGTT GGAGTACAAG TGCGGCCGGT GGGTTCGCGA CCCGAAGACC AGGACGCAGG TGTACGTGCC CGGCTGGCAT GGCGCGCACC TGCACGTCGC GGACCGTTGG TACCCAAGCT CGAAGACCTG TTCCGGCTGT GGCTGGCGAA ACCCAAGCCT GACACTGTCG GACCGCACCT TCTCCTGCCC GTCCTGCGGG CTGGTGATCG ACCGCGACGA GAACGCGGCG GTCAACCTGG CTCGGCTCGT CGACCGCGAG TACATCGGCG ACGTTAAAAC AGCCCGTGGA GCCGACCGTA AGACCAACGC GCCAGCACCA CCGGCGCGGC GGCGGGTGGC TGTGAAGCGG GAACCGGGCA CGGCCAAGAC CGGTCAGACC CGGGGTGCCT CACCGAAAGG TGAAGCGGCA TGA
|
Protein sequence | MKTTLQAYRF ALDPNNVGRA GLRRHAGACR FAFNWGLARV KAALAQREAE ESYGLAGDLL TAVPWTLPAL RLAWNTVKND IAPWWAECSK EAFSAGLAQL AAGLKNFSDS RKGKRKGRTA GFPRFKKRGK DRDSFRYTTG SYGPDGDRHV KLPRIGRVKV HEPMGALTAL LGDGRARLLG AAVSRTAGRW FVSFTVQVEK KLRRTSRAYA RSQPGSSGRR KLATDLAKQH AHTANQRRDG LHKVTTNLAR THHTVVIEDL HVAGMVRDHS LAKAVSDVGM GELRRQLEYK CGRWVRDPKT RTQVYVPGWH GAHLHVADRW YPSSKTCSGC GWRNPSLTLS DRTFSCPSCG LVIDRDENAA VNLARLVDRE YIGDVKTARG ADRKTNAPAP PARRRVAVKR EPGTAKTGQT RGASPKGEAA
|
| |