Gene PCC8801_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1664 
SymbollpxB 
ID7101635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1748671 
End bp1749831 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content45% 
IMG OID643474735 
Productlipid-A-disaccharide synthase 
Protein accessionYP_002371871 
Protein GI218246500 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATCT TTATTAGTAC GGGGGAAGTT TCTGGTGACT TACAGGGGTC GCTGTTAGTC 
GAGTCGCTTT ATCAACGGGC AGAAGCTAGG GGGATACCGC TAGAAATCCT GGCGTTAGGG
GGCGATCGCA TGGCAGCGGC CGGGGCTAAA CTGTTGGGAA ATACGGCGGC GATCGGTTCT
ATTGGCATTG TGGAGTCCCT CCCGTTTATT ATTCCCACTT GGTTGATGCA GCGTCGGGTT
AAGCAGTATT TACGGGAGAA TCCTCCTGAT ATTTTGATTC TCATCGATTA TATGGGTCCA
AATGCAGCTT TTGGCCAATA TGCGCGGAAA CATCTCCCCC AAGTGCCGAT TATTTATTAT
ATTGCCCCTC AATCTTGGGT ATGGGCTCCC AATAGTAAAA CGATTCAACA ATTTGCTCAT
ATTACTGACC TTCTGTTGGC GATTTTCCCT GAAGAAGCGA GATTTTTTGA AGAAAAAGGG
GTTTCGGTTA AATGGGTGGG TCATCCCTTG CTCGATCGCA TGGCAAAGGC TCCCAGTCGA
GAGGTGGCGC GTCAACGGTT AAATTTACAT TCGGATCAGT TGATTGTCGC GCTTTTTCCG
GCTTCGCGCT ATCAGGAGTT AAAGTTTCAT CTGCCGTTGA TGTGTCAAGC AGCAGCCAAA
TTACAGGAAA AAATCCCTAA TTTACACTTT TTGCTGCCTG TTTCCTTGAG TGAGTATCGC
AGTACCATTG AAGAGACGGT GAAAGCCTAT CCGTTTTCGG TAACGTTGTT GGATGGTCAA
GCGTTGGATG TGATGGCGGC GGCAGATTTT GCGATCGCTA AATCGGGAAC GGTGAATTTA
GAGTTAGCTT TGCTAAAAAT TCCCCAATTA GTGTTATGTT TGGTCAATCC TTTAACGATG
TGGATTGCTC GCAATATTCT TAAGTTTTCT ATTCCCTATA TGTCACCGCC AAATTTAGTG
GTGATGGAGG CAATTATTCC CGAATTGTTG CAGGAAGAAG CAACTATAGA GCGCATTGTT
CAAGAGTCTT TGGATTTATT ATTGAATACA GAACGCCGTC AAAAAACCTT GGCAGATTAT
GAACAAATGT CTACTCTGTT AGGGGAGGTA GGAGTCTGTG ATCGTGTGGC TAATGAAATT
TTAGATTATT CTAAAAGTTA G
 
Protein sequence
MRIFISTGEV SGDLQGSLLV ESLYQRAEAR GIPLEILALG GDRMAAAGAK LLGNTAAIGS 
IGIVESLPFI IPTWLMQRRV KQYLRENPPD ILILIDYMGP NAAFGQYARK HLPQVPIIYY
IAPQSWVWAP NSKTIQQFAH ITDLLLAIFP EEARFFEEKG VSVKWVGHPL LDRMAKAPSR
EVARQRLNLH SDQLIVALFP ASRYQELKFH LPLMCQAAAK LQEKIPNLHF LLPVSLSEYR
STIEETVKAY PFSVTLLDGQ ALDVMAAADF AIAKSGTVNL ELALLKIPQL VLCLVNPLTM
WIARNILKFS IPYMSPPNLV VMEAIIPELL QEEATIERIV QESLDLLLNT ERRQKTLADY
EQMSTLLGEV GVCDRVANEI LDYSKS