Gene Franean1_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0972 
Symbol 
ID5669386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1135770 
End bp1136729 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content74% 
IMG OID641239900 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_001505334 
Protein GI158312826 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.701633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGTTC TGGGCCATCG TGGAAGCCGG ACACCCGGTC CGGAGAACAC GCTCGAGGCG 
GTAGACGCCG CGTTACGGGC AGGTGCGGAC GGCGTCGAGC TCGACGTGCG CCGCAGCGCC
GACGGCGACC TGGTGTGTGT GCACGACGCG CGGCTGCCGC GGTTGGGCGG TCGGGCCGTC
ATCCGCCGGT CGACCAGTGA GCTCGCCAGT CGTGGCATCC CACTGCTGAC CGAGATGCTC
GACGTCTGGG ACGGCCGTGG CCGCCTCATC CTGGAGATCA AGAACCAGCC GGGCCAGCCG
GATTTCGACG CTCCACGCGA GCGGACGGCC CGCGCGCTCA TCGAGCTGCT GCGGGCTCGC
GGGCTGCCGG GCTCCTCCGT CGCCGGCGCG CACCTCGATC AGGCGACAGC GCCGGCCTCC
GGTGCGCCAT CCGACGTCGG CCCACGGCTC GCCGTCGGTG CGCCGCCCGG CCCCGGTGCG
CTTCCCGCGA ACGGAGCGCC TCCCGCGAAC GGAGCGTCAC TCGCGAACGG AGCGCCTCCT
GCGCCCGGCT CGGCGGGTGG GCCGGCCGGC GAATCGAGTG AGTCGAGCTC CACCGGGAAC
TCGACGCCGG CAAGCCAATC TGTGTCGGGA ATCACAGTCT CGTCCTTCGA CTGGTTCGCA
ATCGAGGCGA TCCGCGACGC CGGGCTCGGC GTCGCGACCG CGTTCCTCAC GATGCCGCGG
ATGTCGGTGA GCGGCGGGGT CGCCTACGCC CGCTCGGCAG GCCACACGGA GCTGCACGCG
CACGTGTCCG CCGTGCTGGG CGTGGCCGAC GCGGTCCCGC GCGCGCGGCG GGCCGGCCTC
CGCCTCGTCA CCTGGACGGT CACCGACCCG GCGACGGCGA TCGAGCTGCG CGACGCCGGT
GTGGACGGCG TGATCTGTGA CGACCCCGTG GGCGTCGGCC AGGCGCTGCG GCGCCCCTGA
 
Protein sequence
MEVLGHRGSR TPGPENTLEA VDAALRAGAD GVELDVRRSA DGDLVCVHDA RLPRLGGRAV 
IRRSTSELAS RGIPLLTEML DVWDGRGRLI LEIKNQPGQP DFDAPRERTA RALIELLRAR
GLPGSSVAGA HLDQATAPAS GAPSDVGPRL AVGAPPGPGA LPANGAPPAN GASLANGAPP
APGSAGGPAG ESSESSSTGN STPASQSVSG ITVSSFDWFA IEAIRDAGLG VATAFLTMPR
MSVSGGVAYA RSAGHTELHA HVSAVLGVAD AVPRARRAGL RLVTWTVTDP ATAIELRDAG
VDGVICDDPV GVGQALRRP