Gene Cyan8802_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1032 
Symbol 
ID8390341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1056029 
End bp1056979 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content45% 
IMG OID644979047 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_003136800 
Protein GI257058912 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000405209 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGTGCTA TGCAAACCTA TACTTTAATG GCTTCGGCTA AGATTAATCT TTATCTGGAA 
ATCATTGGCG ATCGCCCTGA TGGCTACCAT GAATTGGTGA TGATCCTGCA AAGCATTGAC
TTGGGCGATC GCTTAGAACT CCGTCCCAAT GGGACCCAAA CCTTTCGCTT AAACTGTTCC
CATCCCCAAG TACCCACCGA TAATACCAAT TTAGCCTATC GAGCAGCGCA ATTGATGAAA
CAGGAGTTTA GTCAAGCGTT TGCTAACTAT GGGGGAGTAG ATATTACCTT AGATAAGCAT
ATTCCTGTGG CTGCAGGGTT AGCCGGGGGG TCAACCGATG CCGCCGCCGT GTTAGTCGGC
TTAGACTTAA TGTGGGAATT GGGTTTAACC TTGCCAGAAT TACAGGAATT AGGCGGAAAA
CTAGGCTCAG ATGTCCCCTT TTGTATCGCA GGAGGAACCG CGATCGCCAC AGGACGAGGA
GAAAAACTAG ACCCCATTGA AGATATAGAC CATCTTTGGG TAGTTTTAGG CAAATACCAG
AGCTTAGAAG TCTCTACTCC TTGGGCTTAT CAAACCTATC GACAAAAATT TGGAGACGCT
TATATTAGCG ATCGCCCTGC TGTAGAGTCC CGAACCTTCA AGGTTCATTC AGGACCTTTA
GTTAAGGCTA TCAGCCATAA AGATAACACG AAAATCGGTC AACTATTACA CAACGATTTA
GAAAAGGTAG TTTTGCCCGA ATATCCCCAA GTGAACCATC TTAGAGAGGT CATGCAACAA
GCCGGGGGAC TCGGAACCAT GATGTCAGGA TCAGGTCCAA CCGTGTTTAC TCTCTGTGAG
TCTCAAGAAG CAGCAGAAAC GATTAAAAAT CAAGCAAGAC TCGCTATTGA GGATGCAGAC
CTAAAATTTT GGGTGACAAA GCTATCTAGC AATGGAATTA AAGTACTTTA A
 
Protein sequence
MCAMQTYTLM ASAKINLYLE IIGDRPDGYH ELVMILQSID LGDRLELRPN GTQTFRLNCS 
HPQVPTDNTN LAYRAAQLMK QEFSQAFANY GGVDITLDKH IPVAAGLAGG STDAAAVLVG
LDLMWELGLT LPELQELGGK LGSDVPFCIA GGTAIATGRG EKLDPIEDID HLWVVLGKYQ
SLEVSTPWAY QTYRQKFGDA YISDRPAVES RTFKVHSGPL VKAISHKDNT KIGQLLHNDL
EKVVLPEYPQ VNHLREVMQQ AGGLGTMMSG SGPTVFTLCE SQEAAETIKN QARLAIEDAD
LKFWVTKLSS NGIKVL