Gene PCC8801_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1003 
Symbol 
ID7104231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1055808 
End bp1056758 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content45% 
IMG OID643474095 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_002371235 
Protein GI218245864 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGCTA TGCAAACCTA TACTTTAATG GCTTCGGCTA AGATTAATCT TTATCTGGAA 
ATCATTGGCG ATCGCCCTGA TGGCTACCAT GAATTGGTGA TGATCCTGCA AAGCATTGAC
TTGGGCGATC GCTTAGAACT CCGTCCCAAT GGGACCCAAA CCTTTCGCTT AAACTGTTCC
CATCCCCAAG TACCCACCGA TAATACCAAT TTAGCCTATC GAGCAGCGCA ATTGATGAAA
CAGGAGTTTA GTCAAGCGTT TGCTAACTAT GGGGGAGTAG ATATTACCTT AGATAAGCAT
ATTCCTGTGG CTGCAGGGTT AGCCGGGGGG TCAACCGATG CCGCCGCCGT GTTAGTCGGC
TTAGACTTAA TGTGGGAATT GGGTTTAACC TTGCCAGAAT TACAGGAATT AGGCGGAAAA
CTAGGCTCAG ATGTCCCCTT TTGTATCGCA GGAGGAACCG CGATCGCCAC AGGACGAGGA
GAAAAACTAG ACCCCATTGA AGATATAGAC CATCTTTGGG TAGTTTTAGG CAAATACCAG
AGCTTAGAAG TCTCTACTCC TTGGGCTTAT CAAACCTATC GACAAAAATT TGGAGACGCT
TATATTAGCG ATCGCCCTGC TGTAGAGTCC CGAACCTTCA AGGTTCATTC AGGACCTTTA
GTTAAGGCTA TCAGCCATAA AGATAACACG AAAATCGGTC AACTATTACA CAACGATTTA
GAAAAGGTAG TTTTGCCCGA ATATCCCCAA GTGAACCATC TTAGAGAGGT CATGCAACAA
GCCGGGGGAC TCGGAACCAT GATGTCAGGA TCAGGTCCAA CCGTGTTTAC TCTCTGTGAG
TCTCAAGAAG CAGCAGAAAC GATTAAAAAT CAAGCAAGAC TCGCTATTGA GGATGCAGAC
CTAAAATTTT GGGTGACAAA GCTATCTAGC AATGGAATTA AAGTACTTTA A
 
Protein sequence
MCAMQTYTLM ASAKINLYLE IIGDRPDGYH ELVMILQSID LGDRLELRPN GTQTFRLNCS 
HPQVPTDNTN LAYRAAQLMK QEFSQAFANY GGVDITLDKH IPVAAGLAGG STDAAAVLVG
LDLMWELGLT LPELQELGGK LGSDVPFCIA GGTAIATGRG EKLDPIEDID HLWVVLGKYQ
SLEVSTPWAY QTYRQKFGDA YISDRPAVES RTFKVHSGPL VKAISHKDNT KIGQLLHNDL
EKVVLPEYPQ VNHLREVMQQ AGGLGTMMSG SGPTVFTLCE SQEAAETIKN QARLAIEDAD
LKFWVTKLSS NGIKVL