Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1461 |
Symbol | |
ID | 5669865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1757384 |
End bp | 1759003 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240381 |
Product | transposase IS4 family protein |
Protein accession | YP_001505807 |
Protein GI | 158313299 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.646816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGACGG CCTCGGGTGC GACGGCGGTG CAGATCGCCG AGTACGTCGG TGGCCGTCGT CAGCGGATCG TGGCGCATGT GGGGTCCGCC CATACCGAAG CGGAGCTCGG GATCCTGCTG GCGCGGGCCG AGGAGATGCT CGCCGACAGC CAGCAGGCGG CGCTCGACCT CGGGATCGAG CCGGCCGTGC GCACGGCCAG GCTGCTCGGG TCGCCGCGTG AGCCGGCGCT GTTCGACCCC GAGCCCGCCG CCGGACCTGC CGCGGTGGTC GGGCCCGCGA AGGTGCTCAC GACCGCGTCG GTGCTGCTGT TCGACGCCCT GGCCTCGGTG TTCACCGATC TCGGGTTCGA CGCGTTGGGC GACCCGGTGT TCCGGGACCT GGTGATCGCC CGGGTCGTGG AGCCGACGTC GCTGCTCGAC ACCGGCCGGG TGCTCACCGA CTTGGGACGC AAGCCCGCGG CGTACGCGAC GATGAAACGC ACCCTGACCC GCTGCGCCTC CGGCGGCTAC CGCGACCAGG TCGCCGACCT GTGCTTTGCC CACGCCCTGG CCCACGGCGA CGTCTCCCTA TGCCTCTATG ACGTGACCAC GCTGTACTTC GAGGCCGAGA AGGAAGACGA CCTACGCAAG GTCGGCTACT CCAAGGAACG TCGCGTCGAC CCGCAGATCG TCGTCGGGCT GCTTGTCGAC CGTTACGGCT TCCCGCTGGA GATCGGCTGC TTCGAGGGCA ACCGGGCCGA GACCGCGACG ATCCTGCCGA TCATCCGCCA GTTCAAAGAC CGTCACCAGC TCGAGAACCT GGTCGTGGTC GCCGACGCCG GCATGCTGTC CGCGACCAAC CTGCGTGAAC TCGACGACGC CGGGTTCGGG TTCATCGTCG GCTCACGGGT CACCAAGGCG CCGATCGACC TGGCCTCCCA CTTCCGCTGG CACGGCGACG CCTTCACCGA CGGCCAGGTC ATCGACACGG TCACGCCCCG CACAGGGCGC AACCGTGACA ACGACACCGA CGTGAAGACC GAGCCTGTCT GGACCCGTGA CCAGCACCCC AGGTCGTGGC GAGCCGTCTG GGCCTACTCG GCCAAGCGCG CCGCCCGCGA CAACAAGACG CTGACCGCGC AGGAGAACCG CGCCCGCGCT GTCGTCGACG GCGAGAAGAC TACCCGCACG CCGAGATTCG TCACCGTCAA GGGCGACGCC GCCACCCTCG ACGAGGCCAG CCTCACCCGG GCCCGCCGGC TCGTCGGGCT GAAGGGCTAC GTCACCAACC TGCCGGTCAC CGTCCTGACC GCCGACCAGG TCATCTCGAA CTACCACGAC CTTTGGCACG TCGAGCAGTC GTTCCGGATG TCGAAGACCG ATCTCGCTGC CCGGCCCATG TTCGTCCGCA CGAAGGAGGC GATCGACGCC CACCTGACGA TCGTGTTCAC CGCGCTCGCC GTCGCCCGCA CCGTTCAGAA CCGCACCGGC CTCGCGGTCC GCAACGTGAT CCGACAGCTC CGCCCGCTGC GCTCCGCGAC CATCGCGATC AACGGCGCCA TCCAGACCTT CCCGCCCGCG ATCAACCCGG ACAAACAAGC GGTACTCGAC ACCCTCCACG CGGCGGCCGT CACGCACTAA
|
Protein sequence | MRTASGATAV QIAEYVGGRR QRIVAHVGSA HTEAELGILL ARAEEMLADS QQAALDLGIE PAVRTARLLG SPREPALFDP EPAAGPAAVV GPAKVLTTAS VLLFDALASV FTDLGFDALG DPVFRDLVIA RVVEPTSLLD TGRVLTDLGR KPAAYATMKR TLTRCASGGY RDQVADLCFA HALAHGDVSL CLYDVTTLYF EAEKEDDLRK VGYSKERRVD PQIVVGLLVD RYGFPLEIGC FEGNRAETAT ILPIIRQFKD RHQLENLVVV ADAGMLSATN LRELDDAGFG FIVGSRVTKA PIDLASHFRW HGDAFTDGQV IDTVTPRTGR NRDNDTDVKT EPVWTRDQHP RSWRAVWAYS AKRAARDNKT LTAQENRARA VVDGEKTTRT PRFVTVKGDA ATLDEASLTR ARRLVGLKGY VTNLPVTVLT ADQVISNYHD LWHVEQSFRM SKTDLAARPM FVRTKEAIDA HLTIVFTALA VARTVQNRTG LAVRNVIRQL RPLRSATIAI NGAIQTFPPA INPDKQAVLD TLHAAAVTH
|
| |