Gene PCC8801_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3169 
Symbol 
ID7105890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3314210 
End bp3316531 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content43% 
IMG OID643476193 
ProductCellulose synthase (UDP-forming) 
Protein accessionYP_002373304 
Protein GI218247933 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA AGTCAAATTT GGGTTCAAGC ACCCTTACTA AACCGAGAAA ACGGCGATCG 
CTGAAAGCAC GCCTATGGCG ATCGCGTTTT TTGCCCCTTC ATCTAGCTAC CCTAGTGATG
TTAGTGGGTG TTGGCCTCTT TATTGCCTTG TCTTTGAGTT GGTTACTGGG CAATCCTACC
ATCACTAATT TAGCCCTGAG TATCCATCAG CAGCAACTCG ATCCCCCCTG GTTTGTCCGT
GTTCCTGAAA CTCCCTATCG ACAGTTTTTA ATTGGCATTT TTGTGGCATT AGTGGGGATA
ATCTTGGCGA TTACTCAGAC CGTCCCGAAA CCGACGCGAT GGACTAAAGC GATTATTGCT
GGTATTTTAA TCGCTTTGGC GGTTCGCTAC CTCTTTTGGC GGATTTTGGC TACGTTGAAT
TTAAGTAATG CACTGACAGG CATTTTAGCC CTCACCTTAT TATTCTTAGA AATTCCCTTT
GTTTTAGCGG GTTTACCCCA ACTTTTTTGG GTATCGAACA CGAGGAATTA CAGTAAGCAA
GCGGATGGTT ACGAAATTGC CGTTAAACAG GGAAAATATC GCCCTACAGT GGATATTTTT
ATCCCGACTT ATAATGAACC CGAATCGATT GTGCGACGAA CGATTATCGG CTGTCAAGCC
ATTGATTATG AACCAAAAAC GATTTATCTT TTGGATGATG GACAGCGATC GCCCATTGAA
TCCCTGGCTC AAGAATTGGG CTGTAACTAT ATTACCCGTA GCGATCGCCG TCACTATAAA
GCCGGAAATC TAAACAACGC CCTCCAATAC ACCCAAGGGG ACTTAATTGC CGTATTTGAC
GCGGATTTTG TCCCCACTCG CAACTTTTTG CTGCGAACGG TGGGCTTTTT TCAACAACCT
GATATCGGGA TTGTTCAATC TCACCAAAAT TACTACAATC CTGACGCAAT TGTTCGGAAT
TTGGGGTTAG CTCAGTATTT AACCAGTAAT CGAGAAGGTT TCTCTCGCTA CGTTCAACCT
ACCCTCGATA GTGTGGGAGC AACAGTTTGT GATGGGTCTG CTTTTGTCGT TCGTCGTCGA
GATCTCGATA AAATTGGTGG TTTTGTGACA GAATCTTTGT GTGAAGATTA TTTTACGGGA
ATTTTGTTAG ATTCCCACCA TCAGAAGGTG ATTTACCTCG ATGAAAATCT CAGTGCAGGT
CTGGCGGCCG AGAGTTTGAA TGATTACGTC GGCCAGTATC AACGCTGGTT GATGGGCAGT
TTACAGGCGT TTTTTATTAA AACAAACCCT CTGACCCTTT CTGGGTTAAC GCTACGGCAA
CGGATGGCTC ATCTGATGAG TTTAGTTTAC TGGTTGACGG GTTTTCCCCG TTTGCTGATT
TTGTTAGTTC CGATTATCTG CGGTTTAGCG TCGATTTTTC CGATTATTAT TACCCCTGAT
GATTGGCTGT ATTTCTTATT TTTGCCCCAT TTATTGCTAT TATTCTCGAT GCACTGGTTA
AGCGATCGCT CTACATCGAT GTTACTATCG GAAATTTACA CGATTATCCA TGCAATTCCT
TTTAGTTTAA CGGCGGTTCA AGTCTTTTTG CGACCCTTTT CTCGCGCGTT TCAGGTCACT
CCGAAGGGGT TTTTATCGGA CGGGTTTCGG GTCAATGCTT GGTTAACGAT TCCTTTGGGG
TTACTATGGC TAGGGAATGG AGGAATGCTG GTTAATTTTC TCTGGCAACG CATCTATAAC
CCCCAGGGTC TTCCGGCCGG GTTTGCTGAT ATTTGGGGAG GAACGGGCGG AATTTTGGTC
TTTTGGTGGG TTTATAATCT GATTTTCTTA GGGTTAGCTA TTTTAGCTTG TATTGATCCT
CCTAAACCAG AAACTTGCGA GTGGTTTAAG TTAGAACGAC CAATGGTTTT AAGTTGGGAA
AATAGTTTAA TCAAGGGAAT AACGCATCTG GTTTCTGAAA AAGGTGCGCG TGTTCGAGTG
AACACAAAAT CTCAGGAGAA ACTCAATATT TCCTGCGGAG ATGTTATCAG TATGGAAATT
AAATTAAATG AATGGCCAGG AAATCTTAGA GTAGAGGGAC GGGTTACGAA AATTATTAAC
ACGAAGGGTC ATTGTATTAA AGAAATTGAT ATTAAGTTTG AAGGGATGAC TTCTCAACAG
TATCGTCATT TAGTAGAACT GCTTTTTTGT CGGCCTGGTC AATGGGTGAG ACGAGAACAT
CCTAATGAAC TCATCACGCT TATCGCTTTA GGTAAACGAC TGTTACGACC TCGCTTTTTA
TTGAACAATG ATGAAGCTAT TGATGCTATT TCTATTCATT AA
 
Protein sequence
MPKKSNLGSS TLTKPRKRRS LKARLWRSRF LPLHLATLVM LVGVGLFIAL SLSWLLGNPT 
ITNLALSIHQ QQLDPPWFVR VPETPYRQFL IGIFVALVGI ILAITQTVPK PTRWTKAIIA
GILIALAVRY LFWRILATLN LSNALTGILA LTLLFLEIPF VLAGLPQLFW VSNTRNYSKQ
ADGYEIAVKQ GKYRPTVDIF IPTYNEPESI VRRTIIGCQA IDYEPKTIYL LDDGQRSPIE
SLAQELGCNY ITRSDRRHYK AGNLNNALQY TQGDLIAVFD ADFVPTRNFL LRTVGFFQQP
DIGIVQSHQN YYNPDAIVRN LGLAQYLTSN REGFSRYVQP TLDSVGATVC DGSAFVVRRR
DLDKIGGFVT ESLCEDYFTG ILLDSHHQKV IYLDENLSAG LAAESLNDYV GQYQRWLMGS
LQAFFIKTNP LTLSGLTLRQ RMAHLMSLVY WLTGFPRLLI LLVPIICGLA SIFPIIITPD
DWLYFLFLPH LLLLFSMHWL SDRSTSMLLS EIYTIIHAIP FSLTAVQVFL RPFSRAFQVT
PKGFLSDGFR VNAWLTIPLG LLWLGNGGML VNFLWQRIYN PQGLPAGFAD IWGGTGGILV
FWWVYNLIFL GLAILACIDP PKPETCEWFK LERPMVLSWE NSLIKGITHL VSEKGARVRV
NTKSQEKLNI SCGDVISMEI KLNEWPGNLR VEGRVTKIIN TKGHCIKEID IKFEGMTSQQ
YRHLVELLFC RPGQWVRREH PNELITLIAL GKRLLRPRFL LNNDEAIDAI SIH