Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1003 |
Symbol | |
ID | 7104231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 1055808 |
End bp | 1056758 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643474095 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_002371235 |
Protein GI | 218245864 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGCTA TGCAAACCTA TACTTTAATG GCTTCGGCTA AGATTAATCT TTATCTGGAA ATCATTGGCG ATCGCCCTGA TGGCTACCAT GAATTGGTGA TGATCCTGCA AAGCATTGAC TTGGGCGATC GCTTAGAACT CCGTCCCAAT GGGACCCAAA CCTTTCGCTT AAACTGTTCC CATCCCCAAG TACCCACCGA TAATACCAAT TTAGCCTATC GAGCAGCGCA ATTGATGAAA CAGGAGTTTA GTCAAGCGTT TGCTAACTAT GGGGGAGTAG ATATTACCTT AGATAAGCAT ATTCCTGTGG CTGCAGGGTT AGCCGGGGGG TCAACCGATG CCGCCGCCGT GTTAGTCGGC TTAGACTTAA TGTGGGAATT GGGTTTAACC TTGCCAGAAT TACAGGAATT AGGCGGAAAA CTAGGCTCAG ATGTCCCCTT TTGTATCGCA GGAGGAACCG CGATCGCCAC AGGACGAGGA GAAAAACTAG ACCCCATTGA AGATATAGAC CATCTTTGGG TAGTTTTAGG CAAATACCAG AGCTTAGAAG TCTCTACTCC TTGGGCTTAT CAAACCTATC GACAAAAATT TGGAGACGCT TATATTAGCG ATCGCCCTGC TGTAGAGTCC CGAACCTTCA AGGTTCATTC AGGACCTTTA GTTAAGGCTA TCAGCCATAA AGATAACACG AAAATCGGTC AACTATTACA CAACGATTTA GAAAAGGTAG TTTTGCCCGA ATATCCCCAA GTGAACCATC TTAGAGAGGT CATGCAACAA GCCGGGGGAC TCGGAACCAT GATGTCAGGA TCAGGTCCAA CCGTGTTTAC TCTCTGTGAG TCTCAAGAAG CAGCAGAAAC GATTAAAAAT CAAGCAAGAC TCGCTATTGA GGATGCAGAC CTAAAATTTT GGGTGACAAA GCTATCTAGC AATGGAATTA AAGTACTTTA A
|
Protein sequence | MCAMQTYTLM ASAKINLYLE IIGDRPDGYH ELVMILQSID LGDRLELRPN GTQTFRLNCS HPQVPTDNTN LAYRAAQLMK QEFSQAFANY GGVDITLDKH IPVAAGLAGG STDAAAVLVG LDLMWELGLT LPELQELGGK LGSDVPFCIA GGTAIATGRG EKLDPIEDID HLWVVLGKYQ SLEVSTPWAY QTYRQKFGDA YISDRPAVES RTFKVHSGPL VKAISHKDNT KIGQLLHNDL EKVVLPEYPQ VNHLREVMQQ AGGLGTMMSG SGPTVFTLCE SQEAAETIKN QARLAIEDAD LKFWVTKLSS NGIKVL
|
| |