Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2645 |
Symbol | |
ID | 5671038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3127470 |
End bp | 3129038 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241560 |
Product | transposase IS4 family protein |
Protein accession | YP_001506980 |
Protein GI | 158314472 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCCGT ACGTGCGGAC GGTGAAGACG GCGTCGGGTG CGCGGGCGGT CCAGATCGTG CACTCGCAGC ACAAAGGTTC GCGGGAGATC GAGCACGTCG GGTCGGCGCA CATCGACGCC GACCTGGAGC TGCTCAAGGC GGTGGCGCGG CAGCGGCTGG CCGCGGGGCA GGGCGAGCTC GACCTGCGGC TGCCCGGGAG CCCTGCGAAC TCCGGCGCCG CCTTGCCGAT CACGTCGACG CGGGCGGGCC ACCTGCTCGA CGCGCTGGCC CGAGGCTACG AGGCGCTGGG GTTCGGTGCC GTGACCGGGC GGGACGAGGT GTTCCAGGCG TTGGTGCTCG CACGGGTAGT CGAGCCGACC AGCAAGCTCG ACTCGCTGCG CGTGCTGGAA GAAGTCGGTG CCCCGGCGCC GTCCTACCGG ACCGTCCAGC GCCGGCTGCG TCGCTACGCC GGCGTCGACG AGGTCGACGC CGAGACCGGG CAGCCCGTTC CCGTCGACCC GGCGGGCGGG CCGTGGCGGG CACGGCTGTC GCGGGCCTGC GCGGACCACG TCAAGCTCGG CCCGGCGATT CTGTTGCTGT ACGACGTGAC GACGCTGTAC TTCGAGACCG ACCAGGGCGA CGGGTTCCGC GAGCCGGGGT TCAGCAATGA ACGGCGCCTG GAACCGCAGG TCACCGTCGG CCTGCTCACC GACGGGGCGG GGTTCCCGCT GACGGTGCAC GCCTTCGAGG GCAACCGCGC GGATACCACC ACGATGCTGC CCGTCCTGAC CGCCTTCCTG AAAGCGCACG ACCTGCGCGA CGTGACAGTC GTGGCGGACG CCGGCATGGT CTCGGAGGCG AACAAACGCG CGATCGAAGC AGCCGGGCTG TCGTTCGTCC TCGGCGCCCG CGTCCCCGAG GTGCCCTACC TGGTCAAGGC GTGGCGCGAA CGGCACCCGG ACACCGAAAT CCCCGACGGG CACGTGTTCG TCCAGCCCTG GCCCGCCGGC CCGTCCGACA ACCGGCGCGA CCACACGGTC TTCTACCAGT ACAAGGCCGA CCGAGCCCGT CGCACGCTGC GCGGCATCGA CCAGCAGGTC GCCAAGGCCG AGAACGCCGT CGCAGGCAAG ACCGCGGTGA AACGGAACCG GTACGTGCGC CTGACCGGTG CGAAGAAGTC GGTCAACCGG GCGCTCGAAG AGAAGAACCG CGCCCTGGCC GGGATCAAGG GCTACGTCAC CAACTTGCCG AACCCCGACC CCGACCAGGT CATCAGCACC TACAGCCAGC TGCTGAACGT CGAGAAGAGC TTCCGGATGA GCAAGTCCGA CCTCGCGGCC CGGCCGATCT ACCACCACAC CCGAGAGTCG ATCGAGGCGC ACCTGACAGT CGTGTTCGCC GCCCTCGCCG TCAGTCGCTG GATCGAGAAC ACCACCGGCT GGTCCGTCCG CAAGTTCGTC AAGACCGCGC GCCGCTACCG CACCGTCACG ATCCAGGCCG GAGAGCACAC CATCACCGCA GCCGATCCCG TTCCCGACGA CCTACGAGCC GCCCTCGACG CCGTCCACGG CACGCACCAA TTGGCCTAA
|
Protein sequence | MSPYVRTVKT ASGARAVQIV HSQHKGSREI EHVGSAHIDA DLELLKAVAR QRLAAGQGEL DLRLPGSPAN SGAALPITST RAGHLLDALA RGYEALGFGA VTGRDEVFQA LVLARVVEPT SKLDSLRVLE EVGAPAPSYR TVQRRLRRYA GVDEVDAETG QPVPVDPAGG PWRARLSRAC ADHVKLGPAI LLLYDVTTLY FETDQGDGFR EPGFSNERRL EPQVTVGLLT DGAGFPLTVH AFEGNRADTT TMLPVLTAFL KAHDLRDVTV VADAGMVSEA NKRAIEAAGL SFVLGARVPE VPYLVKAWRE RHPDTEIPDG HVFVQPWPAG PSDNRRDHTV FYQYKADRAR RTLRGIDQQV AKAENAVAGK TAVKRNRYVR LTGAKKSVNR ALEEKNRALA GIKGYVTNLP NPDPDQVIST YSQLLNVEKS FRMSKSDLAA RPIYHHTRES IEAHLTVVFA ALAVSRWIEN TTGWSVRKFV KTARRYRTVT IQAGEHTITA ADPVPDDLRA ALDAVHGTHQ LA
|
| |