Gene Franean1_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0894 
Symbol 
ID5669308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1043618 
End bp1044568 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content74% 
IMG OID641239821 
Productdiacylglycerol kinase catalytic region 
Protein accessionYP_001505256 
Protein GI158312748 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.495381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.488234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC CGGGCACGGC CAGCAGCAGC ACCGCGGCGC GGGCCGAGGT GCCCGACGCG 
GCGGTGGACC GTTCGAGGCT CACCGTCGTG GTCAACCCGA AGGCTGGCGG CGGCCGGGCG
GCGAAGGTCC TCGACGGCGT CCGCGCGGCG CTCGCCCGCT GGGCCGAGGA CGTCTCCGTC
GAGACCACGA AAAGCCTCGA GCACGCCGAG GAACTAGCCC GCTCGGCCGT CGCGGCGGGG
CGGGTGACCG TCGCGCTGGG TGGTGACGGC CTGGTCGGCA GGGTGGCGGG CGCCGTCGCC
CGGTCGGGCG GCGTGCTCGC GGTGCTGCCC GGCGGCCGGG GCAACGACTT CGCGCGAGGA
CTGGGCATCC CGCGTGATCC GGCACTCGCC GCGACCGCGC TCGTCGCGGC CGTGGAGCGC
AGGGTGGACC TGCCGGAGGC GAACGGGGTG CCGTTCGTCG GGATTGCCAG CCTCGGGTTC
GACTCCGACG TCCAGGTGAT CGCGAACCGG ACGACCTGGC TGTCCGGCCA GAGCGTCTAC
ACCTACGCGG CGTTGCGCGG GGTGGCGGCC TGGAAGCCGG CCCGGTTCAC CGTGACCATC
GACGGCGAGC CGCCACTGGA GCACGTCGGG TGGACGGTCG GCGCGGCGAA CGGTCCGTAC
TACGGCGGCG GGATGAAGTT CGCCCCGGAC GCCGACATCG CCGATGGCCG GTTGGAGATC
GTCCTGGTCG CGCGCACCGG GCGGTTCACC TTCCTTCGGT TGTTCCCGCG CATCTTCTCC
GGCCGGCACG TCGAGGTCCC CTACGTCCAG GTGCGGCGGG GCGAGCGGCT CGTCGTGGAC
GCGGACCGCC CGTTCCAGGT CTACGCGGAC GGCGACCCGA TCGCCGACCT CCCGGCTGAG
ATCGTCGTCC GGCCCGGGGC CCTGCGCCTG CTCACGCCGC CCCAGGCCTA A
 
Protein sequence
MTEPGTASSS TAARAEVPDA AVDRSRLTVV VNPKAGGGRA AKVLDGVRAA LARWAEDVSV 
ETTKSLEHAE ELARSAVAAG RVTVALGGDG LVGRVAGAVA RSGGVLAVLP GGRGNDFARG
LGIPRDPALA ATALVAAVER RVDLPEANGV PFVGIASLGF DSDVQVIANR TTWLSGQSVY
TYAALRGVAA WKPARFTVTI DGEPPLEHVG WTVGAANGPY YGGGMKFAPD ADIADGRLEI
VLVARTGRFT FLRLFPRIFS GRHVEVPYVQ VRRGERLVVD ADRPFQVYAD GDPIADLPAE
IVVRPGALRL LTPPQA