Gene Franean1_0773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0773 
Symbol 
ID5669189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp901685 
End bp902863 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content77% 
IMG OID641239701 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001505137 
Protein GI158312629 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.272276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCGAT CGCATCCGGG TCGACGCGCC ACCGAACCCG GCCGCCCCCT GCCGACCGTC 
ACCGTTCGCG CTCCCGCCAA GGTCAACCTC CACCTCGGGG TGGGCCCACG CCGGCCCGAC
GGCTACCACG AGGTGACGAC GATCCTGCAG GCCGTCGCGC TGTACGACGA CATCACCGCG
ACGTCGGTCC CCCCGGAGTC ACTCTCCGGG CCCGAGGGCG CCGGCCCGGT GTTCACCGAC
GAGGACCCGA TCGCGGTCAC GGTCGGCGTC GCCGGCGAGG GCGCGCGGCC GGCAACGTCA
GACGACGCGG ACGGCGCTGG CGACGGCCCG GGTGGTTCCC CGGGTGACAC CGGCGCCGAG
CCGTCCGTCT CGGTGGTGCC GACCGGTAAG GACAACCTGG CTGTCCGCGC CGCCTACCTG
GTCGCCGAGG CCGCCGGGAT CCGCGGCGAG GCCGTCCACC TGACGCTGTC GAAGGGCATC
CCGGTCGCCG CCGGGATGGC CGGCGGCAGC GCCGACGCGG CCGCGGCCCT GCTCGCCTGC
GACACGCTGT GGGGCGCCGG CCTCGACCGC GAGACCCTTG TCGCGCTGGC CGCCAAGCTG
GGCAGCGACG TCCCGTTCCC GCTCACCGGT GGGACGGCGC TGGGCACCGG CCGCGGTGAG
CAGCTCACCG ACGTTCTCGG GCGCGGCGAG TACCACTGGG TGTTCGCGCT CGCCGACGGC
GGGCTGTCGA CGCCCGCCGT CTACGGCGAG TTCGACCGGC TCTCCGAGGG CAGGCTGCGC
ACCGGGCCCA CGCCCGCGGA CGCCGTCCTG AGCGCCCTGC GCAGCGGGGA CCCGGCGGAG
CTCGGAGCCG CCCTGGTCAA CGACCTGCAG CCGGCGGCGC TGCGGCTGCG CCCGTCCCTG
CGGCGGGTAC TGGAGAGCGG GCTGGAGCTG GGCGCGATCG GCGCGATCGT GAGCGGATCC
GGGCCGACCT GCGCTTTCCT CACCCGCGAC GCGGCGGCGA GCGTCTCGCT CGCCGCGAGC
CTCGCCGGCA TGGGCGTCGC GCGCGCCGTC CGACGGGCCC ACGGCCCGGT CGCCGGAGCA
CGGGTGATCG GCCCGGCGGA CCCGGCCGGT CCGGGTGGGG AGCCGGGCAG CTCCACGGCG
CAGTCCCCGC CGCTCTCCCC CTCGTCGTCA CCGGCGTGA
 
Protein sequence
MPRSHPGRRA TEPGRPLPTV TVRAPAKVNL HLGVGPRRPD GYHEVTTILQ AVALYDDITA 
TSVPPESLSG PEGAGPVFTD EDPIAVTVGV AGEGARPATS DDADGAGDGP GGSPGDTGAE
PSVSVVPTGK DNLAVRAAYL VAEAAGIRGE AVHLTLSKGI PVAAGMAGGS ADAAAALLAC
DTLWGAGLDR ETLVALAAKL GSDVPFPLTG GTALGTGRGE QLTDVLGRGE YHWVFALADG
GLSTPAVYGE FDRLSEGRLR TGPTPADAVL SALRSGDPAE LGAALVNDLQ PAALRLRPSL
RRVLESGLEL GAIGAIVSGS GPTCAFLTRD AAASVSLAAS LAGMGVARAV RRAHGPVAGA
RVIGPADPAG PGGEPGSSTA QSPPLSPSSS PA