Gene Franean1_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2119 
Symbol 
ID5670519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2544673 
End bp2545674 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content72% 
IMG OID641241040 
Productundecaprenyl diphosphate synthase 
Protein accessionYP_001506461 
Protein GI158313953 
COG category[I] Lipid transport and metabolism 
COG ID[COG0020] Undecaprenyl pyrophosphate synthase 
TIGRFAM ID[TIGR00055] undecaprenyl diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAGG GCGGGGGCCC TGGCCGGGCA CGGCAGTGCG CGGCGGGCGT GAGGGGACCG 
TGTGAGGAGC GGGGACGGGA TGTGTGCTGC GAGACCGGCG GTCCGGGTGT GCGCGGGCCG
GCCGGTTCGG ACCATGGGGG TGAGCGCCCC ACCGGGGCGG GGGTCGGTCG CCGTGCCGTC
GGCCCGAACG AGAGGATGTC GGCTGTGAGC CTGCGTGAAG AGGTGGGCCG GCGGTTCCGC
GATCCGCGTC CTCATCCGTC CGGAGCGCAG CCACCGCCGT TGCCGCGGGC CCTGGTCCCG
CAGCATGTCG GGATCGTCAT GGACGGAAAC GGGCGCTGGG CGAAGCTGCG CGGCCTGCCC
CGTACCAAGG GGCACGAGGC GGGTGAGGAG GCGCTGTTCG ACTGCGTCGA GGGCGCCATC
GAGATGGGGG TGCGCTGGCT GTCGGTGTAC GCGTTCTCCA CGGAGAACTG GAAGCGCTCG
CCGGACGAGG TCGCGTTCCT GATGCGGTTC ACCGACGGCG TGTTCGGTCG CCGTATCGAC
GACATGGACG AGCTCGGAGT CCGGGTCCGC TGGGCGGGCC GCCGTCCCCG GCTGTGGGGC
AACGTGATCC GGCGGCTGGA GTCGGCGGAG CAGCGCACCC GGGACAACGA CCGGCTCACG
CTCGTGATGT GCGTGAACTA CGGCGGGCGG GCCGAGCTGG CGGACGCCGC GGCGGCGATC
GCCCGGGACG TGCGGGCCGG CCTGCTGCGC CCGGAGCGCG TCAACGAGGA GACCGTCGCC
CGGTATCTCG ACGAGCCCGA CATGCCGGAC GTCGACCTGC TCATCCGGAC GTCCGGCGAG
CAGCGGCTGA GTAACTTCCT GCTCTGGCAG GCCGCGTACG CCGAGTTCTC CTTCGTGCCC
ACTCTGTGGC CTGACTTCGA CCGCCGGGAT CTGTGGCTGG CCTGTGAGGA GTACGCCCGT
CGTGACCGGC GCTACGGCGG GATCATCACC AACCGTGCCT GA
 
Protein sequence
MAEGGGPGRA RQCAAGVRGP CEERGRDVCC ETGGPGVRGP AGSDHGGERP TGAGVGRRAV 
GPNERMSAVS LREEVGRRFR DPRPHPSGAQ PPPLPRALVP QHVGIVMDGN GRWAKLRGLP
RTKGHEAGEE ALFDCVEGAI EMGVRWLSVY AFSTENWKRS PDEVAFLMRF TDGVFGRRID
DMDELGVRVR WAGRRPRLWG NVIRRLESAE QRTRDNDRLT LVMCVNYGGR AELADAAAAI
ARDVRAGLLR PERVNEETVA RYLDEPDMPD VDLLIRTSGE QRLSNFLLWQ AAYAEFSFVP
TLWPDFDRRD LWLACEEYAR RDRRYGGIIT NRA