Gene PCC8801_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1034 
Symbol 
ID7104252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1088376 
End bp1089545 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content37% 
IMG OID643474125 
Productglycosyl transferase group 1 
Protein accessionYP_002371265 
Protein GI218245894 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAT TACAAATTGT GCCTTCAATT TCTTTGGTTT ATGGTGGACC TAGTCAAATG 
GTTTTAGGAC TATCGGAAGC ATTAGCTAAC CAAGGTATTG ATGTTACTAT CTTAACTACC
AATTCCAATG GAGATGCCGG ACAACTCCCC CTAGATGTTC CTTTAGGCAT TCCTATCCAA
CAAAAAGGCT ATCAAATTAT TTATTTTCCC TGTTCTCCTT TCCGTCGCTA TAAGTTTTCT
TTGGATTTAT TAAAATGGTT AATCAACCAC GCTTCAAATT ACGATATTGC TCATATTCAT
GCCTTGTTTT CTCCTATTAG TACGGCTGCT GCTACGGTTG CTCGCTATTG CCAATTACCT
TATATTCTGC GACCTTTAGG AACCTTAGAT CCGGCTGATT TGCAGAAAAA AAAGCTTTTA
AAAAAAATTT ATGGAAACTG TCTAGAAAGA GCCAATTTAT TAGGTTCTGT AGCAGTCCAT
TTTACCACCG AGCAAGAGGC AAAAATTTCC CACCGTTACG GGGTTAAAAC CAATGATTTA
GTTATTCCTT TGGGGGTTAA TTTACCTGAT TATTTTCCTC CTGTGGGACA CACTAGACAA
CAATTAGGAA TTGCTAACGA TGTTCCTTTA GTCCTATTTA TGTCTCGTAT TGATCCCAAA
AAAGGCTTGG AGTTACTGTT AGAATCAGCC GAAAAGTTAG CAAAAAAAGG CGTTGAATTT
AAGTTAGTTA TAGCGGGGTC TAATCCTCAA GACCCGATTT ATGAGAAAAA AATTCAAGAA
AAAATTACTA ATTCTTGTTT AGCAAAACAA ACAGCTATTA CAGGGTTTGT TCAAGGAGAA
TTAAAGTTAG GTTTGCTACA AGATGCCGAT TTATTTGTGT TACCTTCCTA TTACGAAAAT
TTCGGTATTG CTGTTGCTGA AGCGATGGCA GTAGGGACTC CCGTAGTCAT CTCTCAAGGG
GTTTATATTT GGCCAGATGT TCAAAAAGCT GCTGCGGGTT GGGTGACATC AATGGATATA
GAAGACTTAA CCAATACCTT AGATGAGGCA ATTTTTAATC AAAATGAAAG GCAAAAACGC
GGACAAAATG CGCGTGAATT GGTTGTGAAA AACTATCTTT GGCCGACTAT TGCTCAACAA
ATGATTAACG CTTATAGCCA CTTTCAATAA
 
Protein sequence
MKVLQIVPSI SLVYGGPSQM VLGLSEALAN QGIDVTILTT NSNGDAGQLP LDVPLGIPIQ 
QKGYQIIYFP CSPFRRYKFS LDLLKWLINH ASNYDIAHIH ALFSPISTAA ATVARYCQLP
YILRPLGTLD PADLQKKKLL KKIYGNCLER ANLLGSVAVH FTTEQEAKIS HRYGVKTNDL
VIPLGVNLPD YFPPVGHTRQ QLGIANDVPL VLFMSRIDPK KGLELLLESA EKLAKKGVEF
KLVIAGSNPQ DPIYEKKIQE KITNSCLAKQ TAITGFVQGE LKLGLLQDAD LFVLPSYYEN
FGIAVAEAMA VGTPVVISQG VYIWPDVQKA AAGWVTSMDI EDLTNTLDEA IFNQNERQKR
GQNARELVVK NYLWPTIAQQ MINAYSHFQ