Gene PCC8801_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0906 
Symbol 
ID7102002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp952723 
End bp956181 
Gene Length3459 bp 
Protein Length1152 aa 
Translation table11 
GC content35% 
IMG OID643473999 
Productglycosyl transferase family 2 
Protein accessionYP_002371139 
Protein GI218245768 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG TCTATGTTTT ACCTTCAAAT TTAATCTGTG GCGGTCATAT AGTTGTCTTG 
CAACACATTA CCCGATTGCA AGCACGAGGA TATGATATTT CTATCTGTTT GTTAGATCGA
AATACTCCCG CGAGTCCTGA GTCTTTTGAA TGGTTTCCCT ATAGTCATAT ACCGGTTTTT
CATCCTGAAG CATTTCCCGA AGATGTTGAT ATTTGTGTAG CTACTTTTTG GGCAACCTTA
ATTCCTGTTC TGAATTTTCC AGCACAGCAT AAGGTTTATT TTGTCCAATC CGATGAAACT
CGTTTTTATG ATAATCCTAT CTATCAAAGT CAAGTCGCTT TAACCTATTT TTCTAATATC
CATTTTATGA CCGAAGCTCG CTGGATTGTT GAGTGGTTAA ATCTGAATTT TGGAAAATCT
GCTTGTTATA TTCCTAACGG ATTTGATCCG ACGATTATTC ATCGGGTTGA TCCCATCATT
CCCAAACCAA AAGATAAGTA TAGAATTTTA GTCGAAGGCT CAATTAATAC TCCTTTTAAA
CGAATTAAAG AAGCTTTAGA AGTTTGTTCT TATTTTGATT GTGAAGTCTG GTGTGTTTCT
AATCAAGGTA AACCTTGTTC GGATTGGAAA ATTGATCGAT TTTTTTATCA AGTTCCCTTT
AATGAAATGA AATCGATTTA CTCTAGTTGT GATATTTTGT TGAAATTGAG TAGTGTAGAA
GGGTTCTTTG GTCCGCCACT AGAAATGATG GCTTGTGGTG GTGTTTGTGT GGTCAGTGAT
GTCACGGGTT ATGATGAATA TATTGTTAAT GAATATAATG CTTTAGTTGT CCCCAATGGA
GAAATTGAAT TAGCCAGAAA AGCTATACAA AGGTTGATTG ATGATCGGGA TTTATATCGT
CAATTGCAAA AAAATGGACA ACAGACATCA CAACAAATGA AATGGGATAC TTCTATTAAT
AAATTAGAAT CTTTTTTCTT AGAAATTGCT CAAGGATCTC GTCTGAAATC CCATACCTCA
GCTTGGATTA AGCAAATTAA TCAAACCTTA AATGCTTATC TTAGTCTCTC TCAAGAAATC
GAAATGCTAC GGTTGAAAGC GCAAGAAGTA GAAACGTTGC GATCGCAGGT TCAAGCCTTA
GAAAATAGTT CTGGTTTTCA ACACAGTGGT AATTTTAACT TCAATCAATC AGTTTTTTCT
GTTTTCAAAA AAACCTCGAA TTTTTTCTAT CAATTTAGCG TTTTAGTTTT GACAAAACTC
TTTAAATTAG TCTATTGGAT CATCACTTTT CAATTTCCTC AAAAGCTAAG AAGATGGAGA
ATAGCCCAGA TAATTAGCAA AAGTGGTTTA TTTGATACAG GGTATTATTG GCAACAAAAT
CCTGACGTTA AATTATCTAA TGTTGATCCT CTTGGTCACT ATCTAGATAT CGGTGCATCC
CAAGGAAAAG ATCCTAACCC ACTTTTTGAT ACGTCTTTCT ACCTAGAGCA ATATCCTGAA
GTAACTCAAT TAGGAATGAA TCCCCTGGCT CATTATATTA TGATAGGGGC AAAAAAAGGA
AATAAACCCC ATCCTTTATT TGATACGACT TATTATCTAG AACAGTATCC CGATGTTGTT
GAATTAGGGA TCAATCCTTT ACTACACTAC TTAACCATTG GATTAGAAGA AAATCGCCAT
CCTTTTCCTT GTTCTACCAT TGAATTACCG TCTTTTAAAT TACGAGCTGA ACCGATTGTA
TCAGAGGAAA TTCCCAAAAG TTTAGATCCT AACTATCAAA CTTGGTTGAG TTTTAATTAT
CCTACGAGTG AGGATCTCAA CACAATGGCT GAACAGATTC AAAAATTAGA ATATCAGCCT
TTAATTAGTA TCATTATGCC TGTTTTTAAT CCTAAGATAG CTTATCTAAA AACGGCTATT
AATTCAGTCC TCAATCAAGT TTATCAAAAT TGGCAACTGT GTATTGCTGA TGATGCGTCA
ACTAACCCCC AAGTTTACGA GATTTTAGCA GATTATGCTG CTGAAGATAC ACGCATTAAA
GTAGTTTTTC GTCAAGAAAA TGGTCATATT GCTGAAGCGT CTAATTCAGC GTTTGAAATA
GCAATGGGAG AATTTATTGC ACTGCTTGAT CACGATGATG TTTTGACCCC CCATGCGTTA
TATCACGTCG TTTCTATGCT CAATGATCAT GCCGATGCTG ATATGATTTA TTCAGATGAA
GATAAAATTG ATGAACAAGG TTATTTAAGT GATCCCTTTT TTAAACCTGA TTGGTGTCCT
GATTCTTTTT TATCGAAAAT GTATACCTGT CATTTAGGGG TTTATCGTCG TTCTTTGGTT
GAACAAATAG GTGCATTTCG CGTCGGATAC GAAGGAAGTC AGGACTATGA TTTAGTGTTG
AGATTAACCG AAAAAACCGA TAAAATATTT CATATTCCTA ATGTTCTCTA CCATTGGCGG
ATTCATGCTC AGTCAACGTC TACTAATATT GATAGTAAAA ATTATGCAGT GATGACGGCT
AAAAAAGCGC TTTCTGAAGC GATTGAAAGA CGAGGAGAAC CAGGAACTGT TACGGATGTA
CCCTATTGTT TAGGAAACTA TCATATTCGT TATGAACTAA AAACTGATGA TTTAGTCAGT
ATTATCATCC CGACAAAGGA CTTAGGAGAT ACCCTAAATC AATGTTTAAA ATCAATTTTT
GAACAATCCA CCTATCCCAA CTTTGAGATC ATTCTCATTG ATAATGGTAG TACGGAAGAA
CGATCTCTAG AAGTCATGAA ACAATGGCAA GAAAAAGAAC CTGAAAAGTT AAAAGTTTTT
CCCTTAAAAA TTCCTTTTAA TTATTCACAA ATTAATAATT TTGCGGTTCA GCATTCTCAA
GGAAAATACC TACTTTTTCT CAATAATGAC ATCGAAGTAA TTACCCCTGA TTGGATTGAA
GCGTTAGTTG AACAAGCACA ACGTCCTTCC ATCGGTGCTG TCGGTGCATT ATTATTATTC
CCTGATGATA CTATCCAACA CGCGGGTGTC ATAGGAGGTA TTTTTTACTC GTGTGGACAC
AGTCATAAAC GCTTTCCGTT CCGTTCTCCA GGTTATTTTA ATCAACTCAA TACTATTACG
AATTATTCGG CTGTTACGGG TGCTTGTTTA ATGTGTCGGC GAGATGTTTT TGAAGAAATT
GGAGGGTTTG ATGAAACACT TGCGGTCAAT TATAATGATA TTGATCTCTG TTTTAAGATG
ATTGATAAGG GGTATCGAAA TATCTATTTA CCCCATGTTG TTCTCTATCA TTATGAATCG
AAGAGTCGAG GTTATGACAG CTTGAATAAT TTCAAAAAAG CACGACTATT TTGTGAAGGT
AAATATTTTC AAACTCAGTG GAAAGTGTTA ATAGAACATG ACCCTTGTTA TAATCCTAAC
CTGACTTTTG ATCAAGAGGA TTATAGTATT AGAACTTAG
 
Protein sequence
MKIVYVLPSN LICGGHIVVL QHITRLQARG YDISICLLDR NTPASPESFE WFPYSHIPVF 
HPEAFPEDVD ICVATFWATL IPVLNFPAQH KVYFVQSDET RFYDNPIYQS QVALTYFSNI
HFMTEARWIV EWLNLNFGKS ACYIPNGFDP TIIHRVDPII PKPKDKYRIL VEGSINTPFK
RIKEALEVCS YFDCEVWCVS NQGKPCSDWK IDRFFYQVPF NEMKSIYSSC DILLKLSSVE
GFFGPPLEMM ACGGVCVVSD VTGYDEYIVN EYNALVVPNG EIELARKAIQ RLIDDRDLYR
QLQKNGQQTS QQMKWDTSIN KLESFFLEIA QGSRLKSHTS AWIKQINQTL NAYLSLSQEI
EMLRLKAQEV ETLRSQVQAL ENSSGFQHSG NFNFNQSVFS VFKKTSNFFY QFSVLVLTKL
FKLVYWIITF QFPQKLRRWR IAQIISKSGL FDTGYYWQQN PDVKLSNVDP LGHYLDIGAS
QGKDPNPLFD TSFYLEQYPE VTQLGMNPLA HYIMIGAKKG NKPHPLFDTT YYLEQYPDVV
ELGINPLLHY LTIGLEENRH PFPCSTIELP SFKLRAEPIV SEEIPKSLDP NYQTWLSFNY
PTSEDLNTMA EQIQKLEYQP LISIIMPVFN PKIAYLKTAI NSVLNQVYQN WQLCIADDAS
TNPQVYEILA DYAAEDTRIK VVFRQENGHI AEASNSAFEI AMGEFIALLD HDDVLTPHAL
YHVVSMLNDH ADADMIYSDE DKIDEQGYLS DPFFKPDWCP DSFLSKMYTC HLGVYRRSLV
EQIGAFRVGY EGSQDYDLVL RLTEKTDKIF HIPNVLYHWR IHAQSTSTNI DSKNYAVMTA
KKALSEAIER RGEPGTVTDV PYCLGNYHIR YELKTDDLVS IIIPTKDLGD TLNQCLKSIF
EQSTYPNFEI ILIDNGSTEE RSLEVMKQWQ EKEPEKLKVF PLKIPFNYSQ INNFAVQHSQ
GKYLLFLNND IEVITPDWIE ALVEQAQRPS IGAVGALLLF PDDTIQHAGV IGGIFYSCGH
SHKRFPFRSP GYFNQLNTIT NYSAVTGACL MCRRDVFEEI GGFDETLAVN YNDIDLCFKM
IDKGYRNIYL PHVVLYHYES KSRGYDSLNN FKKARLFCEG KYFQTQWKVL IEHDPCYNPN
LTFDQEDYSI RT