Gene Franean1_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1515 
Symbol 
ID5669919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1818850 
End bp1819539 
Gene Length690 bp 
Protein Length229 aa 
Translation table11 
GC content69% 
IMG OID641240435 
ProductHAD family hydrolase 
Protein accessionYP_001505861 
Protein GI158313353 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000770474 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGCGCG CAATCCTGCT CGACTTCTAC GGCACCGTCG TGACCGAAGA CGACTACACC 
ATCGAGATCG TCTGCGAACA GGTCCGTGTC ACCGCCACCG GACGACCTGA CCTGACCGCC
GCGGAGGTCG GCGCCTACTG GCGACAGGTG TTCCAGGAGG AGACGGGCAG AAGCATCGCG
GAGGCGTTCC GCACCCAGCG GGACATCACC CTTTCGTCGC TGGCCCGCGC ACTACGACGC
TTCGGCTCCA CCGCCGACCC GTACATGCTG TGCACGCCAC AGTTCGACCT CTGGCGCCAG
CCGAAACTCT GCGCCGACAG CAGGGCCTTC CTGGACGCGC TCGACCTGCC GGTATGCGTC
GTGTCCAACA TCGACCGGGC GGATCTGCGC ACCGCGATCG ACCATCACCA GCTGCCACTG
GACCTGCTGG TCACCAGCGA GGACGCCCGC TGCTACAAAC CGCACCCGGC CATCTTCCAG
ACCGCGACGC GACTACTCGG GCTGCCCCCC GACGCCGTGC TCCACATCGG CGACTCGCTG
ACCTCCGACG TCGCCGGCGC CCACGCGCTG GGCATCCCCA CCATCTGGGT CAACCGGTCA
GGACGGCCCC GCCCCGCCGA CCTGACCTCG ATCGCGGAGG TCGGTGCCCT CACCGAAGCG
CTCCCGCTGC TGCAGCAGGC GCGCCGGTAG
 
Protein sequence
MLRAILLDFY GTVVTEDDYT IEIVCEQVRV TATGRPDLTA AEVGAYWRQV FQEETGRSIA 
EAFRTQRDIT LSSLARALRR FGSTADPYML CTPQFDLWRQ PKLCADSRAF LDALDLPVCV
VSNIDRADLR TAIDHHQLPL DLLVTSEDAR CYKPHPAIFQ TATRLLGLPP DAVLHIGDSL
TSDVAGAHAL GIPTIWVNRS GRPRPADLTS IAEVGALTEA LPLLQQARR