Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3903 |
Symbol | |
ID | 5672264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4668253 |
End bp | 4669761 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242782 |
Product | transposase |
Protein accession | YP_001508199 |
Protein GI | 158315691 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.199615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGGTG TCATGGGTGG GTTCCGTTGC GACTGCGGTA GTTGCCGTGG GCGAGGATGG CGGCACGGTA GGCGAGGATC CTGTCTGTAC GCCCTCGGCC TTGAGCTTGA CGACGCAGGC TTCGACTACT CCGTGCTCAG CGGGTTCCGG GCGCGGCTGG TCGAGCATGG CCTGGAAGCC ACGGTCCTCG ACCTGCTGCT CGCCCGGCTC TCGGAGCTGG GGCTGCTGCG CGCGGGCGGG CGAGCCCGCA CGGACTCCAC CCATGTGCTC GCGGCGGTGC GTTCGCTCAA CCGGCTGGAG TTCGTCGGGG AGACGATGCG CGCCGCGCTG GAGGCGCTGG CCGGGGCGGC CCCCGGCTGG CTGACCGGCG TCGTCGAGCC CGGCTGGGAC CGCCGCTACC AGGCACGGGT GGAGTCCTAC CGGCTGCCCG CCTCGGCGGC GGGGCGGGCC GAGCTGGCTG TCACGGTCGG CCGGGACGGG TTCGCCCTGC TCACTGCCCT CGGCGCGGCG GACGCGCCGG TGTGGTTGCG CCAGGTCCCC GCCGTCGACG TCCTGCGGAT CGCCTGGATC CAGCAGTACC ACCGCACCGA GAACGCCGAC GGCGGGACGG AGGTGGCCTG GCGGGAGGAC AAGGACCTCC CGCCAGGAAG GCTGCGGCTG GCCTCGCCCT ACGACCTGGC CGCCCGTTAC GGGGTCAAAC GCGGGTCCGG CTGGGTCGGC TACAAGGTCC ACCTGACCGA GACCTGCAAC GACGACACCC CCCATGTGAT CACGAACGTG GAGACGACCC ACGCCACCAT GACCGACCAG GAGACGACCC CGGTCGTGCA CACCCACCTC GCCAGCCGCG GGCTGCTGCC CGCCGAGCAC ACCGTCGACA CCGGCTACAC CTCCACCGAG CTGTTCCTGA CCAGCGCGGC CGAGTACGGC GTCGAACTGG TCGGCCCGCT CGGGGTGAAC ACCTCCTGGC AGGCCCGCAC CCCCGGCGCC TTCGACCTGT CCGACTTCGC CGTCGACTGG GACCGCGAGC AGGTCACCTG CCCGAACGGA GCGGTCAGCA GCAGCTGGCG CACCGAGAAG GCCCGAGGCA AGCCCGTCCT GAAGGTCGAC TTCCGCAGGA AGGACTGCCT GCCCTGCCCA CTGCGCGCCC GGTGCACCTC CTCGGCAACC AACACCCGCA AGCTCACCCT GCGACACCGG GAACAGCACG AACTGCTCGA ACGCCTCCGC GTCGAGCAGG CCACCGACGC CTGGAAGAAG CGCTACGACC GCCGGGCCGG CGTCGAGGGC ACCATGCGCC AGGCCACCGC CGTCACTGGT ATCCGCCACG CCCGCTACCA CGGCCAGGAC AAGAACCACC TCGCCCACGT CCTCGCGGCC ACGACGGTCA ACATCGTCCG CCTCGACGCG TGGTGGACCG GCACCCCGAC CGGACGAACC CGGACCAGCC ACTGCCCCTC GATCTCGGCG ATCCGACCAC GCCCCTCGGT CGGGCGACGC TGGCGCTGA
|
Protein sequence | MLGVMGGFRC DCGSCRGRGW RHGRRGSCLY ALGLELDDAG FDYSVLSGFR ARLVEHGLEA TVLDLLLARL SELGLLRAGG RARTDSTHVL AAVRSLNRLE FVGETMRAAL EALAGAAPGW LTGVVEPGWD RRYQARVESY RLPASAAGRA ELAVTVGRDG FALLTALGAA DAPVWLRQVP AVDVLRIAWI QQYHRTENAD GGTEVAWRED KDLPPGRLRL ASPYDLAARY GVKRGSGWVG YKVHLTETCN DDTPHVITNV ETTHATMTDQ ETTPVVHTHL ASRGLLPAEH TVDTGYTSTE LFLTSAAEYG VELVGPLGVN TSWQARTPGA FDLSDFAVDW DREQVTCPNG AVSSSWRTEK ARGKPVLKVD FRRKDCLPCP LRARCTSSAT NTRKLTLRHR EQHELLERLR VEQATDAWKK RYDRRAGVEG TMRQATAVTG IRHARYHGQD KNHLAHVLAA TTVNIVRLDA WWTGTPTGRT RTSHCPSISA IRPRPSVGRR WR
|
| |