Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3958 |
Symbol | |
ID | 3906917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4735611 |
End bp | 4736642 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637881285 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_483037 |
Protein GI | 86742637 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.604239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAGCCC CCGCCAAGGT CAACCTGCAT CTCGGCGTCG GACCGCTCCG ACCGGACGGC TACCACGACG TCATCACAGT GCTGCAGGCC GTCTCGCTGT TCGACGACGT CTCGGCGACG TCCGTCGATC CCCCGCGGTT CCACGACCCG GGGCCATCGA CCACCGACGG GAACGGCGGG TCCGACGGGA ACGGCCGGTC CAGCGGGGAC ATCGTCGTGA CCGTCGAGGT GTCCGGGGAG GGCGCCGACC CGGCGTCGCT GGGTCCGGCG ACGTCCACGC CGGAGGTCTC CATCGTCCCC ACCGGCCGGG ACAACATCGC CGTCCGGGCC GCTCACCTGG TTGCCGAGGC CGCGGGCATC ACCTCGGAAC GGGTTCATCT CACCCTGACG AAGGGCATCC CCGTCGCCGC GGGGATGGCC GGGGGCAGCG CCGACGCGGC GGCGGCGCTC GTCGCCTGCG ACGCGCTCTG GCAGACCGGC CTGGACCGGG CGACCCTGAC CCGACTCGCC GCCCAGCTCG GCAGCGACGT CCCGTTCCCC CTGGCCGGCG GCACCGCACT CGGCACCGGG CGCGGCGAGC AGCTCACCGA CGTCCTGGCG ACGGGCGAGT ACTACTGGGT GTTCGCGCTC GCCGACGGCG GCCTGTCCAC CCCCGCGGTC TACAAGGAGT TCGACCGGCT GACCGAGGGC AAACTGCGGA CCGGCCCGAC CCCCGCCGAC GACGTGCTCG CCGCGCTGCG CACCGGCGAC CCCGGCCAGC TCGGAGCCGC CCTGGTCAAC GACCTGCAGC CGGCAGCACT GCGGCTTCGG CCGTCCCTGC GCCGCGTCCT GGAGGCGGGC CGGGAGCTGG GAGCCGTCGG GGCGATCGTG AGCGGCTCCG GCCCGACCTG CGCCTTCCTC ACGGCCGGAG CGCAGGAGAG CATCGCGCTC GCGGCGAGCC TCGCCGGGAT GGGGGTCGCC CGCGCGGTAC GCCGGGCCTC CGGGCCGGCG AGCGGCGCCA GGATGGTGGA GGGAGCAGGC GAAGCGCCGT GA
|
Protein sequence | MRAPAKVNLH LGVGPLRPDG YHDVITVLQA VSLFDDVSAT SVDPPRFHDP GPSTTDGNGG SDGNGRSSGD IVVTVEVSGE GADPASLGPA TSTPEVSIVP TGRDNIAVRA AHLVAEAAGI TSERVHLTLT KGIPVAAGMA GGSADAAAAL VACDALWQTG LDRATLTRLA AQLGSDVPFP LAGGTALGTG RGEQLTDVLA TGEYYWVFAL ADGGLSTPAV YKEFDRLTEG KLRTGPTPAD DVLAALRTGD PGQLGAALVN DLQPAALRLR PSLRRVLEAG RELGAVGAIV SGSGPTCAFL TAGAQESIAL AASLAGMGVA RAVRRASGPA SGARMVEGAG EAP
|
| |