Gene Franean1_4170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4170 
Symbol 
ID5672525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4955996 
End bp4958359 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content73% 
IMG OID641243043 
ProductHAD family hydrolase 
Protein accessionYP_001508460 
Protein GI158315952 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0560] Phosphoserine phosphatase
[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID[TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like
[TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.243579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.356217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCTGC GCGAGCGGCT CGCGGGCAGG CGTGTGTTCG TGACCGGGGT GACCGGGTTC 
GTCGGTGAGG CGCTGTTGGA GCGGCTGCTC TCGGAGTTCC CGGACACCCA GGTCGTGGCG
CTCGTGCGGC CCCGGGGCAG CCATTCGGGG ATGGCCCGGT TGGAGCGGAT GACCCGTAAG
CCGGCGTTCG CCGGGCTGCG GGAACGGTTG GGCCGTGAGG GGCTGGCGGC GCGGCTGGCG
GCGCAGGTGC GGGTCATCGA GGGTGACCTG GCCAGTATGG GTGAGTTGCC CGCCGACCTG
GACGTGGTGA TCCACTGCGC CGGGGAGGTG TCGTTCGACC CGGCGATCGA CGAGGGGTTC
GCGACGAACC TGGGCGGGGT GCAGGAGCTG CTGCGCGCGG TGCGCGCTGC CGGGGCCCGC
CCGCATCTGG TGCACGTGTC GACCGCCTAT GTCGCCGGGT TGCGTTCGGG GCACATCGCC
GAGGGCCGGC TGGCCCACGA GGTGGACTGG GCGGCGGAGC AGGCGTCGGC CGCGCGGGCC
CGCCACGCCG CGGAGGACGC GTCGCGCTCG CCGGAGAACT CGGCGCGGTT CCGCGAGCAG
GCGTCGGCCG AGCATGTGCG GGCCGGCGCG CAGACGGTGT CGGCGCAGGC GGAGGCGGCG
CGGCGGGCGT GGGTGGCGGC GCGGTTGGTC GCCGCGGGCT CCGAACGCGC GCAGGTGCTG
GGCTGGACGG ACGCGTACAC GTTCACCAAG GCGTTGGGTG AGCGGTATCT GGAGGACTCC
CACGGCGATC TGCCGTTGAC GATCGTCCGC CCGTCGATCA TCGAGAGCGC GCTGGCGAAG
CCGTTCCCGG GGTGGATCGA GGGGTTCAAG ATGGCCGAGC CGCTGATCCT GGCGTTCGGC
CGCGGGGAGC TGCCGGACTT CCCTGCGTCC CCGGATGCCG TCGTCGACAT CATCCCGGTG
GATCTGGTCG TCAACGCGCT GCTGGCGGCG GCGGCGAGCC CCCCGCCGCC GGAGCGCCCG
GCCTATTACA CGGTGTGTTC GGGGTTCCGG AACCCGCTGC TGTTCCGTGA CCTCTACGAC
TATGTGCGGG GCTATTTCCT GGCCGATCCG CTGCCGCGGC GCGGCCGCGG GCATATCGGG
GTTCCTGAGT GGCCGTTCGC CGGGGCGGTG GCGGTGGAGG CGAAGCTGCG CCGCGGGGAG
AAGGCGGTGG AGTGGGCGAA CCGGGTGTTG GCGCACGCGC CGCGTTCGGA GCGGGTTCGC
CGGCTGGCCG TCGACTTGGA ACGTACCGAG GGGCGGGTGG CGTTCCTACG CCGCTACTCC
GATGTGTACC GGGCGTACAC GAAGGCCGAG CTGGTCTACG TCGACGACGC CACCGCGGCG
CTGCACGCGG CGATGGACCC GGCCGACCAG GTCGACTTCG GGTTCGACCC GGCGTGTTTC
GACTGGCGGC ACTACCTGCA GGACGTGCAC TGCCCGGCGG TGACGCAGGT GCTGCGCCGG
CCGCGCGACG CCGCCCCGGC GCGGCGGATG GCCCGCAACC TCACCGCCGG CGAGGGGGTG
CTGGCCGTGT TCGACCTGGA CGGGACGCTG GTGTCCTCGA CGGTGGTCGA GTCGTATCTG
TGGCTGCGGC TGGCCGACGG CGACGTCGGG GAACGGGCCC GGGAGCTGGT GTCGCTGGCG
CGGGCACTGC CGGGGTATCT GCGTGCCGAG CGCCGCGACC GGGGGCATCT GATTCGCTCG
GTGTACGGCC GGTACGCCGG CGCCGACCCG GTGGAGCTGG CCCGGGTCGT CGACGAGGTC
GCGGCGGATG TGGTGTTGCG GCGGGTGAAA CCGGCGGCGG TGCGCCGTGT GCGGGAGCAT
CGCGCGGCCG GGCATCGCAC CGTGCTGTTG ACCGGGGCGG TCGATGTGCT GACCCGTCCG
CTGGCGCCGT TGTTCGACGA GATCGTCGCG ACCGGCCTGG AAGTCGGCGC GGATGGCCGG
TACACGGGAC GGTTGTTGTC GTCGCCGCTG GTCGGGGATG CTCGGGCGGC GTTCGTGGAT
CATTACGCGC GGCGCCGGGG GGCGGACCTG TCCGCGTCGT GGGCGTACGC GGACAGCCTG
TCGGATCTGC CGATGCTGCG CACGGTGGGC AATCCGGTGG CGGTGAACCC GGATGTGGCG
TTGCACAAGG TGGCGCGGGG GGCGGGCTGG CCGATCGAGG AGTGGCCGTC GACGCCGGGT
GAGCCGCGGC TGATGGTCGC CAGCCGCGCT GAGCGGGGCC TGTTCGCCGC GGCGGCCCGC
GCCCGGGTTG CCGCGGGCGG CCTGTCAGGG TCGGGGTCGG GTGGGAGTGG TGTTGTGCGG
GTGGAAGTGG GTGAGGGCCG GTGA
 
Protein sequence
MGLRERLAGR RVFVTGVTGF VGEALLERLL SEFPDTQVVA LVRPRGSHSG MARLERMTRK 
PAFAGLRERL GREGLAARLA AQVRVIEGDL ASMGELPADL DVVIHCAGEV SFDPAIDEGF
ATNLGGVQEL LRAVRAAGAR PHLVHVSTAY VAGLRSGHIA EGRLAHEVDW AAEQASAARA
RHAAEDASRS PENSARFREQ ASAEHVRAGA QTVSAQAEAA RRAWVAARLV AAGSERAQVL
GWTDAYTFTK ALGERYLEDS HGDLPLTIVR PSIIESALAK PFPGWIEGFK MAEPLILAFG
RGELPDFPAS PDAVVDIIPV DLVVNALLAA AASPPPPERP AYYTVCSGFR NPLLFRDLYD
YVRGYFLADP LPRRGRGHIG VPEWPFAGAV AVEAKLRRGE KAVEWANRVL AHAPRSERVR
RLAVDLERTE GRVAFLRRYS DVYRAYTKAE LVYVDDATAA LHAAMDPADQ VDFGFDPACF
DWRHYLQDVH CPAVTQVLRR PRDAAPARRM ARNLTAGEGV LAVFDLDGTL VSSTVVESYL
WLRLADGDVG ERARELVSLA RALPGYLRAE RRDRGHLIRS VYGRYAGADP VELARVVDEV
AADVVLRRVK PAAVRRVREH RAAGHRTVLL TGAVDVLTRP LAPLFDEIVA TGLEVGADGR
YTGRLLSSPL VGDARAAFVD HYARRRGADL SASWAYADSL SDLPMLRTVG NPVAVNPDVA
LHKVARGAGW PIEEWPSTPG EPRLMVASRA ERGLFAAAAR ARVAAGGLSG SGSGGSGVVR
VEVGEGR