Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2119 |
Symbol | |
ID | 5670519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2544673 |
End bp | 2545674 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241040 |
Product | undecaprenyl diphosphate synthase |
Protein accession | YP_001506461 |
Protein GI | 158313953 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0020] Undecaprenyl pyrophosphate synthase |
TIGRFAM ID | [TIGR00055] undecaprenyl diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGAGG GCGGGGGCCC TGGCCGGGCA CGGCAGTGCG CGGCGGGCGT GAGGGGACCG TGTGAGGAGC GGGGACGGGA TGTGTGCTGC GAGACCGGCG GTCCGGGTGT GCGCGGGCCG GCCGGTTCGG ACCATGGGGG TGAGCGCCCC ACCGGGGCGG GGGTCGGTCG CCGTGCCGTC GGCCCGAACG AGAGGATGTC GGCTGTGAGC CTGCGTGAAG AGGTGGGCCG GCGGTTCCGC GATCCGCGTC CTCATCCGTC CGGAGCGCAG CCACCGCCGT TGCCGCGGGC CCTGGTCCCG CAGCATGTCG GGATCGTCAT GGACGGAAAC GGGCGCTGGG CGAAGCTGCG CGGCCTGCCC CGTACCAAGG GGCACGAGGC GGGTGAGGAG GCGCTGTTCG ACTGCGTCGA GGGCGCCATC GAGATGGGGG TGCGCTGGCT GTCGGTGTAC GCGTTCTCCA CGGAGAACTG GAAGCGCTCG CCGGACGAGG TCGCGTTCCT GATGCGGTTC ACCGACGGCG TGTTCGGTCG CCGTATCGAC GACATGGACG AGCTCGGAGT CCGGGTCCGC TGGGCGGGCC GCCGTCCCCG GCTGTGGGGC AACGTGATCC GGCGGCTGGA GTCGGCGGAG CAGCGCACCC GGGACAACGA CCGGCTCACG CTCGTGATGT GCGTGAACTA CGGCGGGCGG GCCGAGCTGG CGGACGCCGC GGCGGCGATC GCCCGGGACG TGCGGGCCGG CCTGCTGCGC CCGGAGCGCG TCAACGAGGA GACCGTCGCC CGGTATCTCG ACGAGCCCGA CATGCCGGAC GTCGACCTGC TCATCCGGAC GTCCGGCGAG CAGCGGCTGA GTAACTTCCT GCTCTGGCAG GCCGCGTACG CCGAGTTCTC CTTCGTGCCC ACTCTGTGGC CTGACTTCGA CCGCCGGGAT CTGTGGCTGG CCTGTGAGGA GTACGCCCGT CGTGACCGGC GCTACGGCGG GATCATCACC AACCGTGCCT GA
|
Protein sequence | MAEGGGPGRA RQCAAGVRGP CEERGRDVCC ETGGPGVRGP AGSDHGGERP TGAGVGRRAV GPNERMSAVS LREEVGRRFR DPRPHPSGAQ PPPLPRALVP QHVGIVMDGN GRWAKLRGLP RTKGHEAGEE ALFDCVEGAI EMGVRWLSVY AFSTENWKRS PDEVAFLMRF TDGVFGRRID DMDELGVRVR WAGRRPRLWG NVIRRLESAE QRTRDNDRLT LVMCVNYGGR AELADAAAAI ARDVRAGLLR PERVNEETVA RYLDEPDMPD VDLLIRTSGE QRLSNFLLWQ AAYAEFSFVP TLWPDFDRRD LWLACEEYAR RDRRYGGIIT NRA
|
| |