Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2869 |
Symbol | |
ID | 5671258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3384116 |
End bp | 3385504 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241778 |
Product | transposase IS4 family protein |
Protein accession | YP_001507198 |
Protein GI | 158314690 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.338684 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCAGA GTACCGGTGT TTACCCCTCT GTCCGCATCG ACGCCGCCGG GCGGGGTGTC GTGTCGGCGG CGGGCAGCGT GCTGCTGACC GAGACGGTCC GCGTCAGCGG GCTGGGTCGG TTGCTGTCCG ACGCGCTCGC GCCGTGGCGG TCACCGCTGG CGACCCATGA CCCGGCGAAG GTACTGACCG ATCTGGCCGT GGCGCTGGCG TTGGGCGGGG ACTGCCTGGC CGACATCGCC CAGCTACGCG CGGAGCCGGA GTTGTTCGGG CTGGTGGCCT CGGACCCGAC GGTGTCACGC ACGATCGACC GGCTCGCCGG CGACGCCGAG AAGGTCCTGA CGGCACTCGA CCGGGCCCGC GCGGCAGCGC GGGGCCAGGT ATGGGCGGCG GCCGGGGAAC ACGCCCCCGA CCACACGGTG TCGGTGCACG ACCCGCTGAT CGTGGATCTC GACGCGACAC TGGTCACCGC CCATTCCGAG AAACAAGACG CGGCGCCGAC ATTCAAACGC GGCTTCGGGT TCCACCCGTT GTGGGCGTTC ATCGACCACG GCCAGGCCGG CACCGGGGAA CCCGCGACGG TCCTGCTGCG CCCCGGTAAC GCCGGGTCGA ACACCGCCGC CGACCACATC ACCGTCGCCA CCGGCGCCCT CGCCCAGCTC CCGGCCCGGC TACGCCGCTC CCGCAAGGTA CTGATCCGCG CCGACTCCGC CGGTGGGACC CACGCCTTCC TCGCCTGGGC GCACCAGCGC AGGCTGGCCT ACTCGGTCGG GTTCACCCTG CCCGACAACG CCGCCCAGCT GATCGGCGAG ATCCCCGCGA AGGCGTGGAC ACCGGCCTAC GACGCCGACC GCCAGCCCCG GCCCGGCGCG TTCGTCGCCG AACTGTCCGG GCTGATGGAC CTGACCGGCT GGCCACCGGG CATGCGTGTC CTCGTCCGCA AGGAACGCCC GCACCCCGGC GCGCAGCTGC GTGTCACCGA CGTCGACGGC AACCGGATCA CCGCGTTCGC GACCAACGCC GCCCGCGGGC AGCTCGCCGA CCTCGAACTA CGCCACCGCC GCCGGGCCCG CTGCGAGGAC CGTATCCGAA CCGCCAAGGA CACCGGTCTG ACCAACCTGC CGCTGCACGA CTTCACCCAG AACCAGATCT GGTGCGCCCT TGTCGCACTC GCTCTCGATC TGATCGCCTG GACCCAGATG CTCGCCCTCG CCGGCCAGCC CGCGCGACGC TGGGAACCGA AACGGCTACG CCACCGCCTG TTCTGGCTCG CAGGCCGGCT CGCCCACCAC GCCCGCCGCA GCACCCTGCA CCTGCCCCCG CACCACCCCT GGGCCCCCCT CGCCCTCCAG GCGATCACCA CACTGCGCGC ACTGCCCGCC CCCGGATAG
|
Protein sequence | MVQSTGVYPS VRIDAAGRGV VSAAGSVLLT ETVRVSGLGR LLSDALAPWR SPLATHDPAK VLTDLAVALA LGGDCLADIA QLRAEPELFG LVASDPTVSR TIDRLAGDAE KVLTALDRAR AAARGQVWAA AGEHAPDHTV SVHDPLIVDL DATLVTAHSE KQDAAPTFKR GFGFHPLWAF IDHGQAGTGE PATVLLRPGN AGSNTAADHI TVATGALAQL PARLRRSRKV LIRADSAGGT HAFLAWAHQR RLAYSVGFTL PDNAAQLIGE IPAKAWTPAY DADRQPRPGA FVAELSGLMD LTGWPPGMRV LVRKERPHPG AQLRVTDVDG NRITAFATNA ARGQLADLEL RHRRRARCED RIRTAKDTGL TNLPLHDFTQ NQIWCALVAL ALDLIAWTQM LALAGQPARR WEPKRLRHRL FWLAGRLAHH ARRSTLHLPP HHPWAPLALQ AITTLRALPA PG
|
| |