Gene PCC8801_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0471 
Symbol 
ID7101944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp480947 
End bp482857 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content48% 
IMG OID643473580 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_002370723 
Protein GI218245352 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATAA GTGATATTAC TCATCCGAAT CAATTGCATG GACTATCCAT CCGGCAACTG 
GAAGACGTAG CCCGTCAAAT TCGAGAAAAG CACCTACAAA CCATCGCAGC AACAGGGGGA
CACCTTGGCC CCGGTTTAGG GGTAGTCGAA TTAACCATCG CCCTCTATCA AACCCTAGAC
TTAGACCGCG ATAAAGTGAC CTGGGACGTG GGACACCAAG CCTATCCCCA CAAAATGCTA
ACCGGCCGTT ATAAAAATTT CCACACCCTA CGACAAAAAG ACGGTATTGC AGGATATCTT
AAACGTTGTG AAAGCAAATT TGACCATTTT GGAGCCGGAC ACGCTTCTAC AAGTATTTCT
GCGGCGTTAG GCATGGCTTT AGCCAGAGAT GCCCAAGGGG AAGACTACAA AGTCGTAGCC
GTCATCGGAG ATGGAGCCCT TACCGGGGGT ATGGCCTTAG AAGCCATCAA CCATGCAGGA
CATTTACCCC ATACTAATCT GATGGTGGTC TTAAACGACA ATGAAATGTC CATTTCTCCC
AATGTTGGGG CAATTTCTCG CTATCTCAAC AAAGTTCGCC TCAGTGACCC CGTTCAGTTC
CTCGCCGACA ACCTCGAAGA ACAATTCAAA CACTTGCCCT TTTTTGGCGA CTCCCTGAGT
CCGGAGATGG AACGGGTTAA AGAAGGGATG AAACGCCTGG CAGTGTCTAA AGTGGGGGCT
GTGATTGAAG AATTAGGGTT TAAATACTTC GGTCCGATCG ACGGTCATAA CCTACAGGAA
CTCATTAGTA CCTTTAAACA GGCTCATAAA GTCACTGGAC CCGTTCTCGT TCACGTTGCC
ACCGTTAAAG GAAAAGGCTA CGAATTGGCG GAAAAAGACC AAGTGGGCTA CCATGCTCAA
AGTCCCTTTA ATTTAGCCAC GGGTAAGGCT ATCCCGTCGA GTAAACCGAA ACCCCCCAGT
TACGCCAAAG TCTTTGCCCA TACCTTGACC ACCCTAGCGG AAAATAACCC CAAAATTATC
GGTATTACCG CAGCCATGGC TACGGGAACC GGACTCGATA AACTTCAGGC AAAACTGCCA
AAACAATACA TTGATGTGGG TATCGCCGAA CAACACGCCG TTACCTTAGC TGGTGGCTTA
GCTTGTGAAG GAATGCGTCC CGTAGTAGCG ATTTACTCGA CTTTTCTGCA ACGAGCCTAT
GACCAAGTGC TCCATGATGT CTGCATTCAA AATCTTCCCG TTTTCTTCTG TATGGATCGC
GCAGGAATTG TCGGGGCAGA CGGTCCAACC CACCAAGGAA TGTACGATAT CGCCTATCTG
CGCTGTATTC CCAATATGAC CATTATGGCT CCTAAAGATG AGGCGGAATT ACAGCGCATG
ATCGTGACGG GGGTTAATTA CACTGATGGA CCGATCGCCA TGCGCTATCC CCGTGGTAAC
GGTATCGGTG TGCCTTTGAT GGAAGAAGGA TGGGAACCTC TCCCCATTGG AAAAGGCGAA
ATTCTCCGCA ATGGAGATGA TCTGTTGATT CTGGGGTACG GAACTATGGT TAATACGGCT
TTACAAGCGG CGGAAACCCT CCGAGAACAC GGCATCGAAG CCACGGTGGT TAATGCCCGT
TTTGTTAAAC CCTTGGATAC CGAGTTAATT CTACCCTTAG CTCAACGCAT CGGCAAAGTT
GTCACCCTTG AAGAGGGTTG TCTGATGGGC GGTTTTGGGT CTGCTGTGGC TGAAGCGTTC
TCTGACCATA ATGTGTTAGT TCCTCTCAAA CGCTTTGGTG TTCCCGATCG CTTAGTGGAT
CATGCTACCC CTGATCAATC AAAAGCCGAT TTAGGCTTAA CTAGCCCCCA AATTGCGGAA
CAAATTCTCC AAGTTTTCTT TAGTAATCGA CAACCTTCGA TGGTGAGTTA A
 
Protein sequence
MHISDITHPN QLHGLSIRQL EDVARQIREK HLQTIAATGG HLGPGLGVVE LTIALYQTLD 
LDRDKVTWDV GHQAYPHKML TGRYKNFHTL RQKDGIAGYL KRCESKFDHF GAGHASTSIS
AALGMALARD AQGEDYKVVA VIGDGALTGG MALEAINHAG HLPHTNLMVV LNDNEMSISP
NVGAISRYLN KVRLSDPVQF LADNLEEQFK HLPFFGDSLS PEMERVKEGM KRLAVSKVGA
VIEELGFKYF GPIDGHNLQE LISTFKQAHK VTGPVLVHVA TVKGKGYELA EKDQVGYHAQ
SPFNLATGKA IPSSKPKPPS YAKVFAHTLT TLAENNPKII GITAAMATGT GLDKLQAKLP
KQYIDVGIAE QHAVTLAGGL ACEGMRPVVA IYSTFLQRAY DQVLHDVCIQ NLPVFFCMDR
AGIVGADGPT HQGMYDIAYL RCIPNMTIMA PKDEAELQRM IVTGVNYTDG PIAMRYPRGN
GIGVPLMEEG WEPLPIGKGE ILRNGDDLLI LGYGTMVNTA LQAAETLREH GIEATVVNAR
FVKPLDTELI LPLAQRIGKV VTLEEGCLMG GFGSAVAEAF SDHNVLVPLK RFGVPDRLVD
HATPDQSKAD LGLTSPQIAE QILQVFFSNR QPSMVS