Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4541 |
Symbol | |
ID | 4070220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5385169 |
End bp | 5386107 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986581 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_593615 |
Protein GI | 94971567 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000250612 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.557772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCT TTGTCCGTTC ATACGCGAAG ATCAATCTTG GGCTGCGCAT TGGGCCGCGG CGCGCGGATG GGTTTCACGA TCTGCGCACG ATGTACACGA CGATCGCGTT GCACGAGCTT GTTTCGGTTG AAGTCGAAGA CGGCGACGGG ATCGAGATCC GGTGTGACGA TCCGCGGGTG CCTTGCGATT CGACGAATAC GTGCCACAAG GCGGCGGCGC TGGTGTTGGA GGCGCTCGGT CTGCGCTCTA AGGTATTGAT TTCCATCGAG AAGCGGCTGC CGGTACAGGG TGGACTCGGG GCAGCTTCTG GGAATGCGAT TGCGACGATT TTTGGGCTGG AGCGGATGCT GGGGAAGGAG CTTTCGGCCA AAGAAAGGGC AGAGATTGCC GAGAAGATCG GATCCGACCT TAACTTGTTC TTATACGGTG GGTTAACGTT GGGTACCGGG CGCGGCGAGG AGGTCTGGCC GCTGCCGGAT TTGCCATCGC TGCCGCTGGT GATCGTGACG CCGGAGGTCG GAGTGTCGAC GCCGGTGGCA TTTAAGGCGT GGGATTCATT GACCCATCCG GAGGGCTCCG TTACAATAAA TGAGTTCAAC CATCTCGTTT ACGAGTGGTT GTCTGCATCC GGTGTTCCCG CCTTGAGCGG GGACCGGGCC GAGACGCTGC TTCTCGACCT TGTCCGAACC GGGATTTCGA ACGACTTCGA ACGCGTTGTC TTTCCAGAAA TTCCCGTATT ACGAGAGGTC AAGTGTGCGC TCGAGCGCGA AGGCGCTTTG TATGCGTCGC TTTCCGGCTC AGGTTCAACC TTGTATGGGT TGTTTCGTTC GTCTGCGGAA GCATCGACAG CGGCGGAGCG GTTGAACTCG AGCGGGCTGA AAGCGACGGC GACACAGACC TTGCCGCGTG AGCAGTACTG GCGGGAAATG TTTCAGTAA
|
Protein sequence | MSTFVRSYAK INLGLRIGPR RADGFHDLRT MYTTIALHEL VSVEVEDGDG IEIRCDDPRV PCDSTNTCHK AAALVLEALG LRSKVLISIE KRLPVQGGLG AASGNAIATI FGLERMLGKE LSAKERAEIA EKIGSDLNLF LYGGLTLGTG RGEEVWPLPD LPSLPLVIVT PEVGVSTPVA FKAWDSLTHP EGSVTINEFN HLVYEWLSAS GVPALSGDRA ETLLLDLVRT GISNDFERVV FPEIPVLREV KCALEREGAL YASLSGSGST LYGLFRSSAE ASTAAERLNS SGLKATATQT LPREQYWREM FQ
|
| |