Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2409 |
Symbol | |
ID | 5670805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2862760 |
End bp | 2864148 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241326 |
Product | transposase IS4 family protein |
Protein accession | YP_001506747 |
Protein GI | 158314239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.795016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCAGA GTACCGGTGT CTACCCGTCT GTCCGCATCG ACGCCGCCGG GCGGGGTGTC GTGTCGGCGG CGGGCAGCGT GCTGCTGACC GAGACGGTCC GCGTCAGCGG GCTGGGTCGG TTGCTGTCCG ACGCGCTCGC GCCGTGGCGG TCGCCGCTGG CGACGCATGA CCCGGCGAAG GTACTGACCG ATCTGGCCGT GGCGCTGGCG TTGGGCGGGG ACTGCCTGGC CGATATCACG TTGCTGCGCG CGGAGCCGGC ACTGTTCGGG CTGGTGGCCT CGGACCCGAC GGTGTCACGG ACGATCGACC GGCTCGCCGG CGACGCCGAG AAGGTCCTGA CGGCACTCGA CCGGGCCCGC GCGGCAGCGC GGGGCCAGGT ATGGGCGGCG GCCGGGGAAC ACGCCCCCGA CCACACGGTG TCGGTGCACG ACCCGCTGAT CGTGGATCTC GACGCGACAC TGGTCACCGC CCATTCCGAG AAACAAGACG CGGCGCCGAC ATTCAAACGC GGCTTCGGGT TCCACCCGTT GTGGGCGTTC ATCGACCACG GCCAGGCCGG CACCGGGGAA CCCGCGACGG TCCTGCTGCG CCCCGGTAAC GCCGGGTCGA ACACCGCCGC CGACCACATC ACCGTCGCCA CCGGCGCCCT CGCCCAGCTC CCGGCCCGGC TACGCCGCTC CCGCAAGGTA CTGATCCGCG CCGACTCCGC CGGTGGGACC CACGCCTTCC TCGCCTGGGC GCACCAGCGC AGGCTGGCCT ACTCGGTCGG GTTCACCCTG CCCGACAACG CCACCCAGCT AATCGCCCGC CTCCCGAAGA AGGCGTGGAC ACCGGCCTAC GACGCCGACC GCCAGCCCCG AACCGGCGCT TTCGTCGCCG AACTGTCCGG CCTGATGGAC CTGACCGGCT GGCCACCGGG CATGCGTGTC CTCGTCCGCA AGGAACGCCC GCATCCCGGC GCGCAGCTGC GGATCACTGA CGTCGACGGC AACCGGATCA CCGCGTTCGC GACCAACAGC GTCCGCGGGC AGCTCGCCGA CCTCGAACTA CGCCACCGCC GCCGGGCCCG CTGCGAAGAC CGCATCCGAG CCGCCAAGGA CACCGGCCTG ACCAACCTGC CTCTGCACGA CCTCGACCAG AACAGGGTCT GGTGCCAGAT CGTCGCGCTC GCCTGCGACC TGATCGCCTG GACACAGATG CTCGCCCTCG CAGACCAGCC AGCCCGACGC TGGGAACCGA AACGGCTACG CCACAGCCTG TTCTGGCTCG CGGGACGGCT CGCCCACCAC GCCCGCCAGA GCGTGCTCCA CCTCGCCGCC CACCACCCCT GGACACCACT CGCCATCCAG GCGATCACCA CGCTGCGCGC CCTACCCGCT CCCGGATAG
|
Protein sequence | MVQSTGVYPS VRIDAAGRGV VSAAGSVLLT ETVRVSGLGR LLSDALAPWR SPLATHDPAK VLTDLAVALA LGGDCLADIT LLRAEPALFG LVASDPTVSR TIDRLAGDAE KVLTALDRAR AAARGQVWAA AGEHAPDHTV SVHDPLIVDL DATLVTAHSE KQDAAPTFKR GFGFHPLWAF IDHGQAGTGE PATVLLRPGN AGSNTAADHI TVATGALAQL PARLRRSRKV LIRADSAGGT HAFLAWAHQR RLAYSVGFTL PDNATQLIAR LPKKAWTPAY DADRQPRTGA FVAELSGLMD LTGWPPGMRV LVRKERPHPG AQLRITDVDG NRITAFATNS VRGQLADLEL RHRRRARCED RIRAAKDTGL TNLPLHDLDQ NRVWCQIVAL ACDLIAWTQM LALADQPARR WEPKRLRHSL FWLAGRLAHH ARQSVLHLAA HHPWTPLAIQ AITTLRALPA PG
|
| |