Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4441 |
Symbol | |
ID | 5672793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5306417 |
End bp | 5307358 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243310 |
Product | transposase, ISlxx5 |
Protein accession | YP_001508726 |
Protein GI | 158316218 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAT CCCGGTGGAA CGTCCGCAAG CGAGGCCCTG AGCCGCAGGC AGCCAAGCGT GATCACTACC TTCTGCTGAT GGCGCAGGGC ATGAGCAACT CGGTAGCGTG CCGGGAGGTC GGGATCAACC GCAGGACCGG AACACGGTGG CGATACGGGC GCACCAGCAC CGACCACGCC GGCCGTACCC GCTCCTACCC ACCGATCACC CAGCAGCCGC GTGTGGTGTC CTCGCGGTTC CTGTCCGAGG ACGAGCGGGT CCGCATCGCT GACCTGCTGC GCACCGGGAA CTCGATGCGG GAGGTCGCCC GCCAACTCGG CCGCAGCCCC GCCACGATCA GCCGCGAGAT CCGCCGCAAC CGCGACCCGC ACTCCGGGGT CTACCACCCT CACCAGGCCG AACAGCGCGC AGTGGCTCGC CGCACCCGGT TCACCGACGG GAAGCTGCGG TGTAACCCGC AGCTGCGCCA GTTCGTCCAG CAGCGCCTGG ACCAGCACTG GAGCCCCGAG CAGATCAGCG TCGCACTCCG GCAGGAATTC CCCGACGACC CGGACATGCA GGCCGCACCC GAGACCATCT ACCAGGCCCT CTACCGGCCC GAACGCGGCG GCCTGGACCG CGACACCGCC ACGAAACTAC GCACCCGCCG ACGGTCCCGC CGCCCGCGAC GCCGTCCTGA CCAGCGCGCC ACCCGCTTCG TCGGCCCCGG CACCCTGATC ACCCAGCGGC CGGCCGAAGT GCAGGACAGG ATCCAGCCCG GTCACTGGGA AGGCGACCTC ATCGTCGGCC AGGGCAACCG GTCCGCCATC GCCACCCTGG TCGAGCGCTC CACCCGCTAC CTGACGCTGC TGCACCTGCC CGGCGGCGCG GTGCCGACCA AGTCCTCGAC GCGCTCGTAC GCGAGATCTC CGGCCTGCCC ACCCCACTGG CGCGTTCGCT GA
|
Protein sequence | MTGSRWNVRK RGPEPQAAKR DHYLLLMAQG MSNSVACREV GINRRTGTRW RYGRTSTDHA GRTRSYPPIT QQPRVVSSRF LSEDERVRIA DLLRTGNSMR EVARQLGRSP ATISREIRRN RDPHSGVYHP HQAEQRAVAR RTRFTDGKLR CNPQLRQFVQ QRLDQHWSPE QISVALRQEF PDDPDMQAAP ETIYQALYRP ERGGLDRDTA TKLRTRRRSR RPRRRPDQRA TRFVGPGTLI TQRPAEVQDR IQPGHWEGDL IVGQGNRSAI ATLVERSTRY LTLLHLPGGA VPTKSSTRSY ARSPACPPHW RVR
|
| |