Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4623 |
Symbol | |
ID | 5672968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5513040 |
End bp | 5514068 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243484 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001508900 |
Protein GI | 158316392 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCCAT CGTATGAGGG TCGCCAGATT GTGGGAATCG ATCTCCATCG GCGGCGGTCG GTGATCGTGC GGATGACCCC GGACGGGTAT CCGCTGGGAA CGGTTCGGAT CAGCAACGAT GCTGCCACCC TCGCTGCTGA GATCAGCCGT GCCGGTGAGC ACCCCGATGT GGTGCTCGAA GCGACCTACG GCTGGTACTG GGCGGCGGAT GTGCTGGCCG CCGCTGGCGC GCAGGTTCAT CTCGCGCACC CGCTGGGCGT GAAGGGTTTC GCCTACCGGC GGGTGAAGAA CGACGTCCGT GACGCCGCGG ATCTCGCGGA CCTGCTGCGT ATGGGACGTC TGCCCGAGGC GTGGGTCGCT CCACCGCCGG TCCGGGAACT GCGGGAGACC GTCCGTCACC GGGCGGCGCT CGTCGCGATC CGGTCGGCGT GCAAGGCCCA GATCCACGCG GTCCTCGCCA AGAACGGTGT CGCGGTGCCC ATGACCGACC TGTTCGGCCA GGCCGGAACC GACCTCCTCG GCCAGGTCCA GCTCCCGAGC CCGTTCCACG CCCGCGTCAC GAGTCTGCGC CGCCTCATCG ACCTGCTCGA CTTCGAAATC GACGCGGCCG CCTGTCAGCT CGCCGGCCGG CTCGCGCGGG ATCCGGGCTA TCAGGCGCTG CTGGTCCTGC CCGGGGTCGG GAAGACCTTG GCCGCGGTGT TCCTCGCCGA GATCGGGGAC ATCACCCGCT TCCCCACGCC CGGTCACCTT GCCAGCTGGG CTGGTCTCAC CCCCCGGCAC CGTGAGTCCG ACACCACCGT GCACCGCGGC CACATCACCA AACAGGGCTC CTCCCTGATT CGCTGGGCCG CGATCGAAGC CGTGTCGATC CTGCCTCCGA CGACCCCGGT CCTGGGCCCG ACCAAGACCC GGGTCGCTGC CCGCCGCGGC ACCAACATCG GCAAGGTCGC CGCGGCCCGC AAGCTGCTCA CGTTCGTCTT CTACGCGCTG CGCGACGGTG AGGTCCGCGC GCTGCACACG GCGGCGTGA
|
Protein sequence | MSPSYEGRQI VGIDLHRRRS VIVRMTPDGY PLGTVRISND AATLAAEISR AGEHPDVVLE ATYGWYWAAD VLAAAGAQVH LAHPLGVKGF AYRRVKNDVR DAADLADLLR MGRLPEAWVA PPPVRELRET VRHRAALVAI RSACKAQIHA VLAKNGVAVP MTDLFGQAGT DLLGQVQLPS PFHARVTSLR RLIDLLDFEI DAAACQLAGR LARDPGYQAL LVLPGVGKTL AAVFLAEIGD ITRFPTPGHL ASWAGLTPRH RESDTTVHRG HITKQGSSLI RWAAIEAVSI LPPTTPVLGP TKTRVAARRG TNIGKVAAAR KLLTFVFYAL RDGEVRALHT AA
|
| |