Gene PCC8801_4227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4227 
Symbol 
ID7103782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4435581 
End bp4436954 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content44% 
IMG OID643477209 
ProductO-antigen polymerase 
Protein accessionYP_002374308 
Protein GI218248937 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR00947] probable bicarbonate transporter, IctB family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCTG TTTGGCATCA ATTCACCCTT TCGGACTTCT CTCCCTACCG ATGGCTTGCT 
GCAAGCTATG TATACCGCAT CATCGGGTTA TTAGGACAAT GGAAACAAGG CAGTTTTCTC
CTACAATGGG GAGAACCCCT AGGGGCACTA CTAATCAGCA TTGTCTTTAT TTTTGGACCA
TTTATCTCCA CAGGGTTGAT CGGACTCTGG TTATTCGCCT TAGCAGCCTA TTGGGGACTG
TTAACCCTTG CTGATAAAGG AAAACCGGGT ATTACCCCCA TTCATCTATT AGTCATGGTG
TATTGGGGAA TGGCCGCGAT CGCCGTTGCC TTGTCTCCCG TCAAAACAGC AGCCTTGACC
GGGTTTGTGA AATTAAGCCT ATATCTGCTA TTTTTTCTAC TATCAGCGCG AATTTTGCAA
TCTCCTCGCC TCACCAATGG CTTAATTACA GTTGTCTTAC TGATCGGGTT GGTGGTGAGT
TCTTATGGAG TCAGACAAAA CTTTTTTGGA GTAGAACAAT TAGCCACTTG GAACGATCCC
ACCTCTGAAT TAGCCCAAGC AACCCGCGTT TATAGCTATT TAGGCAATCC TAACCTACTC
TGTTCCTATT TATTCGCTGC GATCGCCCTT AGTATCGGCG CGGTTTTTGT TTGGCAAGGA
CGACTCCCCA AAGCGTTAGC GGTAACGATG GTTCTGGTTA ATTCATCGTG TCTCTACTTT
ACGGGAAGTC GAGGCGGTTG GATCGGCATG ATGGCTTTAT TGGTTAGCTT TGCTTTGTTG
CTGTTTGTCT GGTTTCGGGA TAGTTTACCC CCCTTTTGGC GTAAATGGCT ATTACCTTTA
GTTTTAGGGG GTTTTGCCGG GGTTGTTCTC GTTGCTATCG TCGCTTTAGA ACCGATAAGA
TTACGAGTCA TGAGTATTTT TGCGGGACGA GAAGACAGTA GTAATAATTT TCGGATGAAT
GTTTGGATGG CAGCGATCGA GATGATTAAA GATTATCCCC TAACGGGTAT TGGACCGGGG
AATGCTGCTT TTAATAGTAT TTATCCCCGT TATATGAGTC CTAAATATAG TGCCCTAAGT
TCCTATTCTA TCTTTTTAGA AAACGCTGTA GAAATGGGAT TAATTGGACT AAGTATTTTC
CTTTGGTTGA TTATTGTGAC GGTTAATCAA GGAATCGCAC AAATGCAACG GTTACGGTTA
GAAAATAATC GCCAAGGGAT TTGGTTAATT GCCGCGATCG CCGGAATGGC TGGTTTATTA
GGACAAGGTT TAGTAGATAC GGTTTGGTAC CGTCCCCAAG TGAATATTTT CTGGTGGTTT
TTAGTCGCTT TAATTGCCAG TCAATATCAG TTTAAAGGCA ATGGGGAACA GTAA
 
Protein sequence
MNPVWHQFTL SDFSPYRWLA ASYVYRIIGL LGQWKQGSFL LQWGEPLGAL LISIVFIFGP 
FISTGLIGLW LFALAAYWGL LTLADKGKPG ITPIHLLVMV YWGMAAIAVA LSPVKTAALT
GFVKLSLYLL FFLLSARILQ SPRLTNGLIT VVLLIGLVVS SYGVRQNFFG VEQLATWNDP
TSELAQATRV YSYLGNPNLL CSYLFAAIAL SIGAVFVWQG RLPKALAVTM VLVNSSCLYF
TGSRGGWIGM MALLVSFALL LFVWFRDSLP PFWRKWLLPL VLGGFAGVVL VAIVALEPIR
LRVMSIFAGR EDSSNNFRMN VWMAAIEMIK DYPLTGIGPG NAAFNSIYPR YMSPKYSALS
SYSIFLENAV EMGLIGLSIF LWLIIVTVNQ GIAQMQRLRL ENNRQGIWLI AAIAGMAGLL
GQGLVDTVWY RPQVNIFWWF LVALIASQYQ FKGNGEQ