Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3100 |
Symbol | |
ID | 5671479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3658590 |
End bp | 3659618 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241998 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001507418 |
Protein GI | 158314910 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.185223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCCAT CGTATGAGGG TCGCCAGATT GTGGGAATCG ATCTCCATCG GCGGCGGTCG GTGATCGTGC GGATGACCCC GGACGGGTAT CCGCTGGGAA CGGTTCGGAT CAGCAACGAT GCTGCCACCC TCGCTGCTGA GATCAGCCGT GCCGGTGAGC ACCCCGACGT GGTGTTGGAG GCGACGTACG GCTGGTACTG GGCGGCGGAT GTGCTGGCCG CCGCTGGCGC GCAGGTTCAT CTCGCGCACC CGCTGGGCGT GAAGGGTTTC GCCTACCGGC GGGTGAAGAA CGACGTCCGT GACGCCGCGG ATCTCGCGGA CCTGCTGCGT ATGGGACGTC TGCCCGAGGC GTGGGTCGCT CCACCGCCGG TGCGGGAACT GCGGGAGACC GTCCGTCACC GGGCGGCGCT CGTCGCGATC CGGTCGGCGT GCAAGGCCCA GATCCACGCG GTCCTCGCGA AGAACGGTGT CGCGGTGCCC ATGACCGACC TGTTCGGCCA GGCCGGAACC GACCTCCTCG GCCAGGTCGA GCTCCCGAGC CCGTTCCACG CCCGCGTCAC GAGTCTGCGC CGCCTCATCG ACCTGCTCGA CTTCGAAATC GACGCGGCCG CCTGTCAGCT CGCCGGCCGG CTCGCGCGGG ATCCGGGCTA TCAGGCGCTG CTGGTCCTGC CCGGGGTCGG GAAGACCTTG GCCGCGGTGT TCCTCGCCGA GATCGGGGAC ATCACCCGCT TCCCCACGCC CGGTCACCTT GCCAGTTGGG CTGGTCTCAC TCCCCGGCAC CGTGAGTCCG ACACCACCGT GCACCGCGGC CACATCACCA AACAGGGCTC CTCCCTGATT CGCTGGGCCG CGATCGAAGC CGTGTCGATC CTGCCTCCGA CGACCCCGGT CCTGGGCCCG ACCAAGACCC GGGTCGCTGC CCGCCGCGGC ACCAACATCG GCAAGGTCGC CGCGGCCCGC AAGCTGCTCA CGTTCGTCTT CTACGCGCTG CGCGACGGTG AGGTCCGCGC GCTGCACACG GCGGCGTGA
|
Protein sequence | MSPSYEGRQI VGIDLHRRRS VIVRMTPDGY PLGTVRISND AATLAAEISR AGEHPDVVLE ATYGWYWAAD VLAAAGAQVH LAHPLGVKGF AYRRVKNDVR DAADLADLLR MGRLPEAWVA PPPVRELRET VRHRAALVAI RSACKAQIHA VLAKNGVAVP MTDLFGQAGT DLLGQVELPS PFHARVTSLR RLIDLLDFEI DAAACQLAGR LARDPGYQAL LVLPGVGKTL AAVFLAEIGD ITRFPTPGHL ASWAGLTPRH RESDTTVHRG HITKQGSSLI RWAAIEAVSI LPPTTPVLGP TKTRVAARRG TNIGKVAAAR KLLTFVFYAL RDGEVRALHT AA
|
| |