Gene Franean1_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1500 
Symbol 
ID5669904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1802424 
End bp1803725 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID641240420 
Productdiacylglycerol kinase catalytic region 
Protein accessionYP_001505846 
Protein GI158313338 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0397379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGAT CCCGGCGGGC GGCGGCTTCA GTGGCGTTGG TGAGCTTCAC CGCGGCGGTC 
GCGGTCATTG TCGCCCGGCT GATCGACCGG CCGATTGCCC TGCCGGTTTC GGTCGCCGCT
GTGACGATCG CGTTGGTAGC GGGCTGGACG GCGCTGGTCG GACGGGGGCT GCGCCGCCTG
AGCGCGGCTG TCGTGGCGGC CTTCGCGCTC GGCGGCCTGA TCGTCTTCGC GGGGGTGGTC
GGCCTGACCA CCGTGCTGGT CACCGTGTGC CTGCTCGTCA CGTCGGGAAC CGCGGCGCGC
GGAGCCTTCG GGCGGCACCG GCCGAAAGGG ATCATGGCTG GAGCCGCCCG CCGCGGTGTC
CTGCTGGTGA ACCCGCGTTC CGGGGGCGGC GCCGCCGACC GCCACAATCT CGCCCAGGAG
GCCGCCCGGC GCGGGATCGC GGTCGTGACC CTCACGCCAG GCGCTGATCT GCGCTCCCTC
GCCGAGGACG TCGCCGACCG CGGAGCCGAC GTGGTGGGCA TGGCGGGTGG CGATGGTTCG
CAGGCGGTCG TCGCCGACGT GGCGCGGCGG CGCGGGATGG CCTTCGTCTG TGTCCCAGCC
GGTACTCGTA ACCACTTCGC CCTTGACCTC GGCCTCGATC GGAAGGATGT CGCGGGGGCG
CTCGACGCGT TCGACCTGGC TGTCGAGCAG CGCGTTGACC TCGGTCAGCT CGGTGACCGC
GTGTTCGTCA ACAACGTCTC GTTGGGCGTC TACGCCGAGA TCGTGCAGTC CGACAGCTAC
CGTGACGCCA AGATGGGAAC GGCGGCCGCG AGGCTGCCTG ACCTTCTGGG CCTGGACCGC
GCTCCGCCGG ATCTTCGCTT CACCGGCCCG GACGGCCGCG CCGGATCGAC CGCGGACGTT
CTGCTCGTGT CCAACAACGC CTACCAACTG CACAGTCTTG GTGGTTTCGG CACCCGGCCC
CGGCTCGACG GCGGCCGGCT TGGCATGGTG GCATTGCGCG TCGATCGCGC CCGCGACCTA
CCCGTACTCG TCGCGCTCGA GTCCGTGGGT GCCATCAGCC GGTTCCGTGG TTTCCACCAG
TGGACCAGCC CCACGATGCG GGTCGACTCC GCGCGACCGG TCAGCGTCGG CGTGGACGGG
GAGGCGCTGT GCCTGCCGCC ACCGCTGGAG CTGCGCTCGC TTCCGGCGGC TGTGCGCGTT
CGGATTCCGC TGCACGCGCC CGGGGTCCCG ACCGTGCGGC CGGGTGTCTG GGAGATGTTC
CCCGCGCTGA TCCGCATCGC GGGTGGCCGG TCGCCGGCAT GA
 
Protein sequence
MSRSRRAAAS VALVSFTAAV AVIVARLIDR PIALPVSVAA VTIALVAGWT ALVGRGLRRL 
SAAVVAAFAL GGLIVFAGVV GLTTVLVTVC LLVTSGTAAR GAFGRHRPKG IMAGAARRGV
LLVNPRSGGG AADRHNLAQE AARRGIAVVT LTPGADLRSL AEDVADRGAD VVGMAGGDGS
QAVVADVARR RGMAFVCVPA GTRNHFALDL GLDRKDVAGA LDAFDLAVEQ RVDLGQLGDR
VFVNNVSLGV YAEIVQSDSY RDAKMGTAAA RLPDLLGLDR APPDLRFTGP DGRAGSTADV
LLVSNNAYQL HSLGGFGTRP RLDGGRLGMV ALRVDRARDL PVLVALESVG AISRFRGFHQ
WTSPTMRVDS ARPVSVGVDG EALCLPPPLE LRSLPAAVRV RIPLHAPGVP TVRPGVWEMF
PALIRIAGGR SPA