Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4649 |
Symbol | |
ID | 5672992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5548222 |
End bp | 5549334 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243507 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001508923 |
Protein GI | 158316415 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGCA TGATGCTTTC CCGTGACGAT GTCATCGTCG GCGTCGACAC TCACAAAGAT CAGCATGTTG CCGTGCTACT CGACGGTCTC GGAGGCCGTC TCGCGGACCT GGTGATCCCG GCGACCCCGC CCGGCTTCGA GATGCTGCTC GCATTCTGCC TGGCCCGGGT GCATTCTCCT GGGCGGCTTG TTGCCTTTGG GGTCGAGGGA ACCGGCTCCT ACGGTCTGGG ACTGGCGCGT TTTCTGCGCC GGCATGGCCA CGACGTCCGG GAAGTCAGCC GTCCACCACG GAAAGGTGAG CGACGTCAGG CGGGCAAGAC CGACACAATC GACGCCGAGC ATGCCGCGCG CCAGGTCGTC GCCGGCGTGC TGACCGCGAC GCCGAAGACG GCCGATGGGT CGGTCGAGGC GCTTCGTCTG ATCAAGGTTG CCCGGGACAC TGCGGTGAAG GCCCAGTCAG CAGCCATGAT CACGTTGAAG GCGACGTTGG TGACCGCCGA CGACGAGCTC CGGGGCGCGC TCGAACCTCT CACAGATCAC CGCCTGATCG AAGCCTGCGC GGCGCTCGAG TGCGTGGGGG CGCCGACGAC ACCCGCGAAG GCGATGAGGC ACGTGCTGGC GTCGCTGGCG CGGCGGTGGC TGAGTCTGCA CGAAGAAGTC AAGTCACTGA GCTGGCATCT CAAACATCAG ACAAAAACCG CCGCGCCACG GCTTGTCGAA GCAGTCGGTA TCGGCCCTGA CACGGCGGCT GAAATGCTGA TCGCCGCCGG GGACAACACC GACCGGATCC GCTCGGAATC GGCGTTCGCG AAGCTCTGCG GCGTGAGTCC GATCCCGGCG TCCTCCGGGA AGACGCACCG TCACAGGCTC AACCGAGGTG GGAACCGGCA GGCCAACGCC GCGCTCTACC GTACCGTCAT CGTGCGCATG CGATGGCATC AACCTACCAT CGATTACGTC GAGCGGCGCA CCGCAGAAGG ACTTACGAAG CGTGAGATCA TCCGCTGCCT GAAACGATAC GTCGCGCGGG AACTCTATCG CCTTCTACCG CCATCGAACG TGGTCGAGTA CAGCCGCGCC GCGGCTTCTG ATCCGTCGCC TCAGGCCGCT TGA
|
Protein sequence | MPSMMLSRDD VIVGVDTHKD QHVAVLLDGL GGRLADLVIP ATPPGFEMLL AFCLARVHSP GRLVAFGVEG TGSYGLGLAR FLRRHGHDVR EVSRPPRKGE RRQAGKTDTI DAEHAARQVV AGVLTATPKT ADGSVEALRL IKVARDTAVK AQSAAMITLK ATLVTADDEL RGALEPLTDH RLIEACAALE CVGAPTTPAK AMRHVLASLA RRWLSLHEEV KSLSWHLKHQ TKTAAPRLVE AVGIGPDTAA EMLIAAGDNT DRIRSESAFA KLCGVSPIPA SSGKTHRHRL NRGGNRQANA ALYRTVIVRM RWHQPTIDYV ERRTAEGLTK REIIRCLKRY VARELYRLLP PSNVVEYSRA AASDPSPQAA
|
| |