Gene Franean1_5889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5889 
Symbol 
ID5674211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7149587 
End bp7152178 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content72% 
IMG OID641244738 
ProductHAD family hydrolase 
Protein accessionYP_001510140 
Protein GI158317632 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0387615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.138972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCACCG ACGTCCGACT TCGCGAGGTC TGGTCGCATC TGGAGTCGGG CGGTGCGCAG 
GCCCTGACTC TGGACGTATT CGACACTCTG CTCTGGCGCA TGGTGCCCGA GCCGGTTCAC
GCGTTCATCA CCCTGGGGCA CCGGCTGGCC GACCTCGGCC AGCTGCCGTC CGGTGTCACG
CCGGGCGAGT TCGCCCGGCT GCGGGTCTTC GCCGAGCACA AGGCCAGGCT GCACTCGCAC
GAGGTGCGCG GCACCCACGA GGTGCGCCTG GACGAGATCT GGCAGGTTCT CGTCCCCGCG
CTGCCCGGCG CGGGCAGCAT CGGCGACCTG ATGGACGTCG AGCTCGCCGT CGAGCGGGAC
CTGTGCCGCG CCGACCTGGC CGTGGTGGAG CTCGCCGAGC TCGCCATGAC CAAGCTCGGC
CTGCCGGTCT ACCTGCTCTC CGACACATAC TTCTCGGCCG CCCAGCTCGA GCGGCTGCTC
AGCCGCCCGG AGCTCGCCGG GGTGCCGTTC ACCCAGATCT TCACCTCCTC CGACTCGGGG
ATCAGCAAGA GCGACGGGCT GTTCCGGCAC ATGCTGGCGG CCTCGAACCT GCAGCCCTCG
CGGGTCGTCC ACCTCGGCGA CCACCCGGTC GCGGACGTGG AGAGCGCCCG CGAGCACGGG
CTGGTCGCCA TCCACTACCC GAAGTACTCG GGTTCGCTGA AGGCGACGAT CGAGCTGGAG
GGCCTGCTCA CCGGCCCCGG GGACGACACC CCCCTCGACG CCGCCAACGG CGACTACGGC
ATGACCGCGC TGCGGGCCCG CTCGCTGCAC CGGGCGGACG CCGCCGCCGT CCCGCCGGGG
CTGCGGCGCT ACTGGGAGTC CGGGGCGACG GTGTTCGGCC CGGTCTTCAC CGGGTTCGCC
GACTGGGCGG TGGAACGCAC CGGGGACCAC GGCGCCGACC ACATCTACTG CCTGATGCGG
GAGGGTGACT TCCTCTCCCG GCTGATCGCC GACCCGGGTG CCGACGCCGG GGTGACCACC
TCGACGCTGT GGGCGTCCCG GCAGGTCTGC GCGCTGGCCA ACGTGTTCGA GGGCAGCCCC
GAGGAGCTGC GCGGGTTCCT CGTCCGCCGG CACGCGCCCA GCGTCGGCCA GCTGCTGCGC
CAGCTCGGGG TGGCCATCGA CAACGTCGCC GGCATCTCCT CGCTCACCGA CCGCCGGCTC
GACGTCCCCG GCCTGCTCGA CGACACTCTC GAGGCGCTGT GCTCCGACGA GCGCATCCGC
AGCGAGATCG TGCTGACCGC CGGCCGGCTG CGTGACCGCT ACGTCCAGTA CCTCGACACC
CAGCTGCCCG ACTCGGGCCG CATCGTGCTG GTCGACCTGG GCTGGGGCGG CACCATCCAG
GCGCTGCTCG CCCGGCTGCT CGCCTCCACC GGGCGCGAGT TCGACGTCGT CGGCCTGTAC
ATGGCGACGA ACGCCGCCGC CGGCACGCAC CGCCTCGCCG GCCTGCAGAT CGAGGGTTAC
GCGGCCTCCG GCGGGCAGCC CGAGCTGATG GCCAACCAGC TCATGCGCAG CCCCGAGGTG
CTCGAACAGC TCTGTATGCC CGACATCGGC TCGCTGGTCT CCTTCGACGA CGACGGCCGG
CCCGTCCTGA GCATCGACCG GACGTCGCGG ACCCAGGTCG CCCAGCGGAC CGCCGTCCAG
GACGGCATCC TCGCCTTCCA GCGGGAATGG CTGCGCTACC GGCGCTCGGA GACGGCGATG
CCGTCGCTGT CCGAGCCCGG GGCCCGGCAC GCGTCGCTGC GGACGCTGAC CCGGTTCGTC
TCCCGTCCCA CCGCGGCCGA GGCGTCGGCG TTCGGGGCCT GGGCGCACGA CGACAACTTC
GGCTCCGACT CCACCGAGGG CCTGCTCCCG CCCGAGCTGG TCCGCCGGAT GCCCTACCTG
ACCCCCGCGG ACGTCGAGAA GATCACGATG CGGGAGCTGT ACTGGCCCGC GGGCGTGGCC
GGAGTCGCGA ACCGGTCGCT CGCGGTGATC AGCGGCCTGG CGGCCGCGGC CGGGGTGCCC
GCCGAGGAGG TCTCCCCGGA GGCCGCGGCC GGCCCCGTCG AGGTGTACGT CGACACCGGC
ACCGACTTCG TCCACGGGCA GAAGGAGACC GCGGTCACCC GTTCCGGCCG GGACGGGATG
TCGATCGTGC GCCTGCGCAT CGACGGGGTC GGCGTCCGGC GGGTGCGCAT CGACCCGGCC
GGCCGGCGCG GCCTGCTCCG TCTCGACTGG CTGACGATCT CGTTCCACCT GCACAACGCC
GTCGAGCCGT ACAAGGTGAC CGTCACGTCG CTGGACGACC TCGCCGGCCA GCAGATCGCG
CTGGTCGGGG TCCGCCTGCT GCAGGCCAAC CTGCTGGAGA TCGTCGGCGA CGATCCCCAG
ATCATCTACA CGATTGACTT CACCTCCCAG CCCACGCTGG GCGGAACGTA CGCGATCGAG
GTCGAGATGG CTTTCGGCTG GTTGGGGATC CGGGCCGACA CGCTTCAGGT GCGCACGGCC
GGTGTCGCCC GCACCGGCCT GCCCGTCCGC GCCGCCCGCA AGATCCGTCG CGAGCTGGGC
GGCCTGCGAT GA
 
Protein sequence
MFTDVRLREV WSHLESGGAQ ALTLDVFDTL LWRMVPEPVH AFITLGHRLA DLGQLPSGVT 
PGEFARLRVF AEHKARLHSH EVRGTHEVRL DEIWQVLVPA LPGAGSIGDL MDVELAVERD
LCRADLAVVE LAELAMTKLG LPVYLLSDTY FSAAQLERLL SRPELAGVPF TQIFTSSDSG
ISKSDGLFRH MLAASNLQPS RVVHLGDHPV ADVESAREHG LVAIHYPKYS GSLKATIELE
GLLTGPGDDT PLDAANGDYG MTALRARSLH RADAAAVPPG LRRYWESGAT VFGPVFTGFA
DWAVERTGDH GADHIYCLMR EGDFLSRLIA DPGADAGVTT STLWASRQVC ALANVFEGSP
EELRGFLVRR HAPSVGQLLR QLGVAIDNVA GISSLTDRRL DVPGLLDDTL EALCSDERIR
SEIVLTAGRL RDRYVQYLDT QLPDSGRIVL VDLGWGGTIQ ALLARLLAST GREFDVVGLY
MATNAAAGTH RLAGLQIEGY AASGGQPELM ANQLMRSPEV LEQLCMPDIG SLVSFDDDGR
PVLSIDRTSR TQVAQRTAVQ DGILAFQREW LRYRRSETAM PSLSEPGARH ASLRTLTRFV
SRPTAAEASA FGAWAHDDNF GSDSTEGLLP PELVRRMPYL TPADVEKITM RELYWPAGVA
GVANRSLAVI SGLAAAAGVP AEEVSPEAAA GPVEVYVDTG TDFVHGQKET AVTRSGRDGM
SIVRLRIDGV GVRRVRIDPA GRRGLLRLDW LTISFHLHNA VEPYKVTVTS LDDLAGQQIA
LVGVRLLQAN LLEIVGDDPQ IIYTIDFTSQ PTLGGTYAIE VEMAFGWLGI RADTLQVRTA
GVARTGLPVR AARKIRRELG GLR