Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0089 |
Symbol | |
ID | 8414371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 118023 |
End bp | 119024 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645023067 |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritolkinase |
Protein accession | YP_003180472 |
Protein GI | 257789866 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAC CGAGCAAGAA CGGCGAAAAC GCCGTAGACA TGCTGGCCGT GGCTCGCATC GCGCAGGGCG TGGACATCGC GGGCTTCGCC GGCCCCGATG CCGTCAAGCT GGTGGCACCG GCGAAAGTGA ATCTGTTCCT CGACATCGGC GCGAAGCGCG CGGACGGCTA TCACGAGGCC GTCAGCATCA TGCATGCGCT CATGCTGCAC GACGTTTTAC GCATGAAGCT GGCGCCGGGA CGCGGAGAAG GTCTATCCGT CGATCTCTCG TGCGTTGCGC GCGAAGGGCT CGCGATGCTC GATGTGCCCG TGGAGAGCAA CATCGTGGCG AAGGCCGTGC GCTTGCTGGC TGAAAAATTG GGACGCACGG CTGACGAGAC GGTGGTCGCC TGCCTCGAGA AGCACATCCC CGCCGAGGCC GGTTTGGGCG GCGGCTCCTC CGACGCCGCC GCGGCGCTGC TGGGCGCTGC GCACTTGTGG GGCGTGCCTG CCGACGATCC GCGTATCGAG GAAGCAGCAC GAAGCCTGGG TGCCGACGTG GCGTTCTTCC TGCATGGCGG CTGCGCTTGC TTCACGGGCG TGGGCGATGC GTTCGACCAT GCCCTTGCGC CCATGAACGG CAACGTCGTG CTCGTGAAGC CCGAGGGCGG CGTGTCCACG GCTGCCGCGT ACCGCGCGTT CGACGAGCAT CCCACGGCCA TTTCCGAGGC CGATCGCGAA GCAGCCCTCG AAGCCCAGCG CGCCGCCGAC GTGCCGCTGC GCAACAACCT CGTGCCCGTT TCCGAGCACC TGCTTCCGGC GCTCGTGGAC ATCCGCTTGT GGGCTAGCGA GCGCGCCGAC GTGCAGCGCG TGCTCATGTC GGGCAGCGGG TCGGCCGTGT TCATGCAGTG CGCCACGTTC GCCGATGCAG GTCGCGTGGC CGCCGAGGCG CGCATGCGCG GCTGGTGGGC GCGTGCTACC ATGTTTGGCT CGGCGCGTGC GGCCGTGGTG CCGAACCGCT GA
|
Protein sequence | MTQPSKNGEN AVDMLAVARI AQGVDIAGFA GPDAVKLVAP AKVNLFLDIG AKRADGYHEA VSIMHALMLH DVLRMKLAPG RGEGLSVDLS CVAREGLAML DVPVESNIVA KAVRLLAEKL GRTADETVVA CLEKHIPAEA GLGGGSSDAA AALLGAAHLW GVPADDPRIE EAARSLGADV AFFLHGGCAC FTGVGDAFDH ALAPMNGNVV LVKPEGGVST AAAYRAFDEH PTAISEADRE AALEAQRAAD VPLRNNLVPV SEHLLPALVD IRLWASERAD VQRVLMSGSG SAVFMQCATF ADAGRVAAEA RMRGWWARAT MFGSARAAVV PNR
|
| |