Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2794 |
Symbol | |
ID | 5671183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3304957 |
End bp | 3306213 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241703 |
Product | transposase IS4 family protein |
Protein accession | YP_001507123 |
Protein GI | 158314615 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5659] FOG: Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTGA GTGAGATGGA CCGGGTGCGG CCGGTGATCG AACGGTTCGC GGGTGAGATG TTCGCGGATC TGCCGCGGCG GGATCAGCGG GGCAAGGGTG AGCTGTATGT GCGGGGGCTG CTGACCGACG GCAAGCGCAA GAGCATGGTC CCGATGGCCG CCCGCCTCGG CGTGGACCAT CAGCAGCTAC AACAGTTCGT GACCAGCTCG ACCTGGGACT ACCGCCAGGT GCGGCGGCGG CTGACGGGCT GGGCGACCGG GTTCCTCGAC CCGGTGGCGC TGGTGGTGGA CGACACCGGT TTCCCCAAGG ACGGGCCGGC CTCGCCCGGG GTGGCCCGGA TGTACTCCGG CACCCTGGGG AAGGTCGGGA ACTGTCAGAT CGGGGTGTCG GTGCACGCGG TGACCGACTG GGCGTCGGCC GCGGTGGACT GGCGGCTGTT CCTCCCGGCC TCCTGGGACG ACACCGCCCT GTCCGACCCG CAGGAGAGCG CCGCCGCGCG GGCCCGGCGG GCACACGCGG GGGTCCCGGA CGAGGCGCGG CACCGGGAGA AGTGGCGGCT GGCCCTGGAC ATGATCGACG AGCTGGCCGG CTGGGGTATG CCCGTCCGGC CGGTGGTCGC GGACGCCGGC TACGGTGACG CCGCCGAGTT CCGCCAGGGC CTGACCGACC GGAACATCCC CTACGTGCTG GCGGTGAAGC CGACCGCGAC CGCCTACCCC GCCGACGCCA CGCCGGTCAC CGCCCCGTAC TCCGGGAACG GCCGTCTGCC CGTGCCCGCC TACCCCGACC CACCCCGGGA TCTGAAATCC CTGGTCATGG CCGCCGGCCG CCGCGCGGGC CGGTACGTGA CCTGGCGTCA CGGCACCCAC AAGACCCCGG ACAACCCGAC CGCAGGGATG CGCTCCCGCT TCCTCGCACT CCGGGTCCGC CCCGCGGGCC GGAACATCAC CCGCAAGTCC GACCGGAGCC TGCCGGACTG CTGGCTGCTG GCCGAATGGC CCCCCGGCCA GCCCGAGCCC ACCGACTACT GGCTGTCCAC CCTGCCCACC GAGATCCCGA TCCGCGACCT CGTCCGTCTC GCGAAGATCC GCTGGCGGAT CGAACACGAC TACCGCGAAC TCAAAGACGG CCTCGGCCTC GACCACTTCG AAGGCCGGAC CTGGACCGGC TGGCACCACC ACGTGACCCT CGTCAGCATC GCCCAAGCCC TCTGCACCCA GCTGAGACGA ACCCCAAAAG TCCCTGCGCC GGCCTGA
|
Protein sequence | MELSEMDRVR PVIERFAGEM FADLPRRDQR GKGELYVRGL LTDGKRKSMV PMAARLGVDH QQLQQFVTSS TWDYRQVRRR LTGWATGFLD PVALVVDDTG FPKDGPASPG VARMYSGTLG KVGNCQIGVS VHAVTDWASA AVDWRLFLPA SWDDTALSDP QESAAARARR AHAGVPDEAR HREKWRLALD MIDELAGWGM PVRPVVADAG YGDAAEFRQG LTDRNIPYVL AVKPTATAYP ADATPVTAPY SGNGRLPVPA YPDPPRDLKS LVMAAGRRAG RYVTWRHGTH KTPDNPTAGM RSRFLALRVR PAGRNITRKS DRSLPDCWLL AEWPPGQPEP TDYWLSTLPT EIPIRDLVRL AKIRWRIEHD YRELKDGLGL DHFEGRTWTG WHHHVTLVSI AQALCTQLRR TPKVPAPA
|
| |