Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5889 |
Symbol | |
ID | 5674211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7149587 |
End bp | 7152178 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244738 |
Product | HAD family hydrolase |
Protein accession | YP_001510140 |
Protein GI | 158317632 |
COG category | [R] General function prediction only |
COG ID | [COG5610] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0387615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.138972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCACCG ACGTCCGACT TCGCGAGGTC TGGTCGCATC TGGAGTCGGG CGGTGCGCAG GCCCTGACTC TGGACGTATT CGACACTCTG CTCTGGCGCA TGGTGCCCGA GCCGGTTCAC GCGTTCATCA CCCTGGGGCA CCGGCTGGCC GACCTCGGCC AGCTGCCGTC CGGTGTCACG CCGGGCGAGT TCGCCCGGCT GCGGGTCTTC GCCGAGCACA AGGCCAGGCT GCACTCGCAC GAGGTGCGCG GCACCCACGA GGTGCGCCTG GACGAGATCT GGCAGGTTCT CGTCCCCGCG CTGCCCGGCG CGGGCAGCAT CGGCGACCTG ATGGACGTCG AGCTCGCCGT CGAGCGGGAC CTGTGCCGCG CCGACCTGGC CGTGGTGGAG CTCGCCGAGC TCGCCATGAC CAAGCTCGGC CTGCCGGTCT ACCTGCTCTC CGACACATAC TTCTCGGCCG CCCAGCTCGA GCGGCTGCTC AGCCGCCCGG AGCTCGCCGG GGTGCCGTTC ACCCAGATCT TCACCTCCTC CGACTCGGGG ATCAGCAAGA GCGACGGGCT GTTCCGGCAC ATGCTGGCGG CCTCGAACCT GCAGCCCTCG CGGGTCGTCC ACCTCGGCGA CCACCCGGTC GCGGACGTGG AGAGCGCCCG CGAGCACGGG CTGGTCGCCA TCCACTACCC GAAGTACTCG GGTTCGCTGA AGGCGACGAT CGAGCTGGAG GGCCTGCTCA CCGGCCCCGG GGACGACACC CCCCTCGACG CCGCCAACGG CGACTACGGC ATGACCGCGC TGCGGGCCCG CTCGCTGCAC CGGGCGGACG CCGCCGCCGT CCCGCCGGGG CTGCGGCGCT ACTGGGAGTC CGGGGCGACG GTGTTCGGCC CGGTCTTCAC CGGGTTCGCC GACTGGGCGG TGGAACGCAC CGGGGACCAC GGCGCCGACC ACATCTACTG CCTGATGCGG GAGGGTGACT TCCTCTCCCG GCTGATCGCC GACCCGGGTG CCGACGCCGG GGTGACCACC TCGACGCTGT GGGCGTCCCG GCAGGTCTGC GCGCTGGCCA ACGTGTTCGA GGGCAGCCCC GAGGAGCTGC GCGGGTTCCT CGTCCGCCGG CACGCGCCCA GCGTCGGCCA GCTGCTGCGC CAGCTCGGGG TGGCCATCGA CAACGTCGCC GGCATCTCCT CGCTCACCGA CCGCCGGCTC GACGTCCCCG GCCTGCTCGA CGACACTCTC GAGGCGCTGT GCTCCGACGA GCGCATCCGC AGCGAGATCG TGCTGACCGC CGGCCGGCTG CGTGACCGCT ACGTCCAGTA CCTCGACACC CAGCTGCCCG ACTCGGGCCG CATCGTGCTG GTCGACCTGG GCTGGGGCGG CACCATCCAG GCGCTGCTCG CCCGGCTGCT CGCCTCCACC GGGCGCGAGT TCGACGTCGT CGGCCTGTAC ATGGCGACGA ACGCCGCCGC CGGCACGCAC CGCCTCGCCG GCCTGCAGAT CGAGGGTTAC GCGGCCTCCG GCGGGCAGCC CGAGCTGATG GCCAACCAGC TCATGCGCAG CCCCGAGGTG CTCGAACAGC TCTGTATGCC CGACATCGGC TCGCTGGTCT CCTTCGACGA CGACGGCCGG CCCGTCCTGA GCATCGACCG GACGTCGCGG ACCCAGGTCG CCCAGCGGAC CGCCGTCCAG GACGGCATCC TCGCCTTCCA GCGGGAATGG CTGCGCTACC GGCGCTCGGA GACGGCGATG CCGTCGCTGT CCGAGCCCGG GGCCCGGCAC GCGTCGCTGC GGACGCTGAC CCGGTTCGTC TCCCGTCCCA CCGCGGCCGA GGCGTCGGCG TTCGGGGCCT GGGCGCACGA CGACAACTTC GGCTCCGACT CCACCGAGGG CCTGCTCCCG CCCGAGCTGG TCCGCCGGAT GCCCTACCTG ACCCCCGCGG ACGTCGAGAA GATCACGATG CGGGAGCTGT ACTGGCCCGC GGGCGTGGCC GGAGTCGCGA ACCGGTCGCT CGCGGTGATC AGCGGCCTGG CGGCCGCGGC CGGGGTGCCC GCCGAGGAGG TCTCCCCGGA GGCCGCGGCC GGCCCCGTCG AGGTGTACGT CGACACCGGC ACCGACTTCG TCCACGGGCA GAAGGAGACC GCGGTCACCC GTTCCGGCCG GGACGGGATG TCGATCGTGC GCCTGCGCAT CGACGGGGTC GGCGTCCGGC GGGTGCGCAT CGACCCGGCC GGCCGGCGCG GCCTGCTCCG TCTCGACTGG CTGACGATCT CGTTCCACCT GCACAACGCC GTCGAGCCGT ACAAGGTGAC CGTCACGTCG CTGGACGACC TCGCCGGCCA GCAGATCGCG CTGGTCGGGG TCCGCCTGCT GCAGGCCAAC CTGCTGGAGA TCGTCGGCGA CGATCCCCAG ATCATCTACA CGATTGACTT CACCTCCCAG CCCACGCTGG GCGGAACGTA CGCGATCGAG GTCGAGATGG CTTTCGGCTG GTTGGGGATC CGGGCCGACA CGCTTCAGGT GCGCACGGCC GGTGTCGCCC GCACCGGCCT GCCCGTCCGC GCCGCCCGCA AGATCCGTCG CGAGCTGGGC GGCCTGCGAT GA
|
Protein sequence | MFTDVRLREV WSHLESGGAQ ALTLDVFDTL LWRMVPEPVH AFITLGHRLA DLGQLPSGVT PGEFARLRVF AEHKARLHSH EVRGTHEVRL DEIWQVLVPA LPGAGSIGDL MDVELAVERD LCRADLAVVE LAELAMTKLG LPVYLLSDTY FSAAQLERLL SRPELAGVPF TQIFTSSDSG ISKSDGLFRH MLAASNLQPS RVVHLGDHPV ADVESAREHG LVAIHYPKYS GSLKATIELE GLLTGPGDDT PLDAANGDYG MTALRARSLH RADAAAVPPG LRRYWESGAT VFGPVFTGFA DWAVERTGDH GADHIYCLMR EGDFLSRLIA DPGADAGVTT STLWASRQVC ALANVFEGSP EELRGFLVRR HAPSVGQLLR QLGVAIDNVA GISSLTDRRL DVPGLLDDTL EALCSDERIR SEIVLTAGRL RDRYVQYLDT QLPDSGRIVL VDLGWGGTIQ ALLARLLAST GREFDVVGLY MATNAAAGTH RLAGLQIEGY AASGGQPELM ANQLMRSPEV LEQLCMPDIG SLVSFDDDGR PVLSIDRTSR TQVAQRTAVQ DGILAFQREW LRYRRSETAM PSLSEPGARH ASLRTLTRFV SRPTAAEASA FGAWAHDDNF GSDSTEGLLP PELVRRMPYL TPADVEKITM RELYWPAGVA GVANRSLAVI SGLAAAAGVP AEEVSPEAAA GPVEVYVDTG TDFVHGQKET AVTRSGRDGM SIVRLRIDGV GVRRVRIDPA GRRGLLRLDW LTISFHLHNA VEPYKVTVTS LDDLAGQQIA LVGVRLLQAN LLEIVGDDPQ IIYTIDFTSQ PTLGGTYAIE VEMAFGWLGI RADTLQVRTA GVARTGLPVR AARKIRRELG GLR
|
| |