Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2854 |
Symbol | |
ID | 5671243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3368763 |
End bp | 3370379 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241763 |
Product | transposase |
Protein accession | YP_001507183 |
Protein GI | 158314675 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGTCG CGATGGACTA CGGGCCGCCG AGCGTCGAGA AGCCGCTCGG CGCGCTCCCT GTCGTGCGTG ACTACCTGGC CCGCCTCGAT GTCGCGGGCA CGATCGATCG ACTCGCCCCG ATGCGTGACA AGGTCAACCG GGCCACCCAC GGCGACGTGA TCGCCGCACT GGTCGCGAAC CGGCTGACCT CGCCGACCCC GCTGCTGCAC GTCGAACGGT GGGCACGGGA CTGGGCGGTC GAAGAGATGT TCGGTATCGC GCCGGACGTG TTGAACGACG ACCGGGTCGG TCGGGCGCTT GACGCGCTCG CCCCGGTCTG CGAGGCCGTG GTCGGTTCGG TCGGTGCCGC GGCGATCGCC GCGTTCGACC TCGACCTGTC CCGTGTCCAC TGGGACATGA CATCGATCTC GCTGCACGGC GCCTATCCCG AGATCGACGG CGACTACGCG ACCCCGAAGT ACGGCCACCC GAAGGACCGC CGCCCCGACC TCAAACAGGT CCAGACCGGG CTCGCGGTCA CCGGTGACGG CGGGATCCCG CTGCTGCACC GCGCCTACGA CGGCGGCGCC GGTGAGGTCT GTCAGGTCAC CGGCGCGATG CGGGCCCTGG CCGGGCTCGC CGGCCCCCGA CAGTTCCTGC TGGTCGGCGA CAGCAAACTC GTCTCCTACG CGAATCTGAC CGCGTTGACG TCCACGCCCG GGGTCACGTT CCTCGCCCCG GCACCGAAAA CCGTCGTGCC CGGCCAGGTC CTGGCCGCCC AGGACTGGGC CACCGCCACA CTCGTCGGCT ATGTCGCGGC CCGTGACCAG GACAAACCCT TCCACCAGCG GGCGGCCTAC CGTGTCCGGG AAGGCGTCAC CACCCTGCGG GGACCCCGGA AGAAAGACCC GCCGGTCACC GTGCGCACGG TGTTCGTCTG GTCGTCGGCG AACGACCAGG CCGCGAAAGC CGCCCGCGCG CTGAAACTCG GCCGCGCCCG CACCGACCTC GACACCCTCA CCCGCGCCGC GGGCAGCCAC CACCTCTACC GCACCGAGGC CGCGGTGCAG GCCCGCCTGA CCCTCCTGGC GACGAAACAC CGCGTCACCC GCTACCTCCA CGCCACCACC AGCGTCGACC CGGACACCGG GAAACCGGCC CTGGCCTGGA ACTTCGACTC CGCCGCGCTC GACGCCGAGG CTGCCACCGA CGGCTGGTAC GCGCTGCTGA CCAACCTGCC TGACGACGTC GGCCCGGCTG AGGTCCTCGC CCGCTACAAA GGGCAGGAAG TCGTTGAACG TCGCTACGGC GCGTTCAAGG GCCCGCTCGC GGTCGCCCCG ATGTTCCTGC ACTCCAACCA GCGGATCCAC GCCCTCATCC ACGTGATCTG CCTGGCCCTG CTCGTCTTCT GCCTGGTCGA ACGCCAAGCC CGGCTCGGTA CCGGCCCTGA CGGGAAGATC CCCGGCATCT ACGCCGGTCG TCCCGCCCGC CCCACCGGAG CACTGGTCCT CGGCGCACTG AGCAGGCTGC GCCTCGTGCC CGCCCGCGGC GACCAGCCTG CCTACATCCC CCGCCCATCG GCGCTGCACC AGCACCTCCT CGACATTCTC GGCGTCGATC CCACCCGGCC ACCTTGA
|
Protein sequence | MTVAMDYGPP SVEKPLGALP VVRDYLARLD VAGTIDRLAP MRDKVNRATH GDVIAALVAN RLTSPTPLLH VERWARDWAV EEMFGIAPDV LNDDRVGRAL DALAPVCEAV VGSVGAAAIA AFDLDLSRVH WDMTSISLHG AYPEIDGDYA TPKYGHPKDR RPDLKQVQTG LAVTGDGGIP LLHRAYDGGA GEVCQVTGAM RALAGLAGPR QFLLVGDSKL VSYANLTALT STPGVTFLAP APKTVVPGQV LAAQDWATAT LVGYVAARDQ DKPFHQRAAY RVREGVTTLR GPRKKDPPVT VRTVFVWSSA NDQAAKAARA LKLGRARTDL DTLTRAAGSH HLYRTEAAVQ ARLTLLATKH RVTRYLHATT SVDPDTGKPA LAWNFDSAAL DAEAATDGWY ALLTNLPDDV GPAEVLARYK GQEVVERRYG AFKGPLAVAP MFLHSNQRIH ALIHVICLAL LVFCLVERQA RLGTGPDGKI PGIYAGRPAR PTGALVLGAL SRLRLVPARG DQPAYIPRPS ALHQHLLDIL GVDPTRPP
|
| |