Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1515 |
Symbol | |
ID | 5669919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1818850 |
End bp | 1819539 |
Gene Length | 690 bp |
Protein Length | 229 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240435 |
Product | HAD family hydrolase |
Protein accession | YP_001505861 |
Protein GI | 158313353 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000770474 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTGCGCG CAATCCTGCT CGACTTCTAC GGCACCGTCG TGACCGAAGA CGACTACACC ATCGAGATCG TCTGCGAACA GGTCCGTGTC ACCGCCACCG GACGACCTGA CCTGACCGCC GCGGAGGTCG GCGCCTACTG GCGACAGGTG TTCCAGGAGG AGACGGGCAG AAGCATCGCG GAGGCGTTCC GCACCCAGCG GGACATCACC CTTTCGTCGC TGGCCCGCGC ACTACGACGC TTCGGCTCCA CCGCCGACCC GTACATGCTG TGCACGCCAC AGTTCGACCT CTGGCGCCAG CCGAAACTCT GCGCCGACAG CAGGGCCTTC CTGGACGCGC TCGACCTGCC GGTATGCGTC GTGTCCAACA TCGACCGGGC GGATCTGCGC ACCGCGATCG ACCATCACCA GCTGCCACTG GACCTGCTGG TCACCAGCGA GGACGCCCGC TGCTACAAAC CGCACCCGGC CATCTTCCAG ACCGCGACGC GACTACTCGG GCTGCCCCCC GACGCCGTGC TCCACATCGG CGACTCGCTG ACCTCCGACG TCGCCGGCGC CCACGCGCTG GGCATCCCCA CCATCTGGGT CAACCGGTCA GGACGGCCCC GCCCCGCCGA CCTGACCTCG ATCGCGGAGG TCGGTGCCCT CACCGAAGCG CTCCCGCTGC TGCAGCAGGC GCGCCGGTAG
|
Protein sequence | MLRAILLDFY GTVVTEDDYT IEIVCEQVRV TATGRPDLTA AEVGAYWRQV FQEETGRSIA EAFRTQRDIT LSSLARALRR FGSTADPYML CTPQFDLWRQ PKLCADSRAF LDALDLPVCV VSNIDRADLR TAIDHHQLPL DLLVTSEDAR CYKPHPAIFQ TATRLLGLPP DAVLHIGDSL TSDVAGAHAL GIPTIWVNRS GRPRPADLTS IAEVGALTEA LPLLQQARR
|
| |