Gene Cyan8802_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0933 
Symbol 
ID8390242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp950956 
End bp954414 
Gene Length3459 bp 
Protein Length1152 aa 
Translation table11 
GC content35% 
IMG OID644978949 
Productglycosyl transferase family 2 
Protein accessionYP_003136702 
Protein GI257058814 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG TCTATGTTTT ACCTTCAAAT TTAATCTGTG GCGGTCATAT AGTTGTCTTG 
CAACACATTA CCCGATTGCA AGCACGAGGA TATGATATTT CTATCTGTTT GTTAGATCGA
AATACTCCCG CGAGTCCTGA GTCTTTTGAA TGGTTTCCCT ATAGTCATAT ACCGGTTTTT
CATCCTGAAG CATTTCCCGA AGATGTTGAT ATTTGTGTAG CTACTTTTTG GGCAACCTTA
ATTCCTGTTC TGAATTTTCC AGCACAGCAT AAGGTTTATT TTGTCCAATC CGATGAAACT
CGTTTTTATG ATAATCCTAT CTATCAAAGT CAAGTCGCTT TAACCTATTT TTCTAATATC
CATTTTATGA CCGAAGCTCG CTGGATTGTT GAGTGGTTAA ATCTGAATTT TGGAAAATCT
GCTTGTTATA TTCCTAATGG ATTTGATCCG ACGATTATTC ATCGGGTTGA TCCCATCATT
CCCAAACCAA AAGATAAGTA TAGAATTTTA GTCGAAGGCT CAATTAATAC TCCTTTTAAA
CGAATTAAAG AAGCTTTAGA AGTTTGTTCT TATTTTGATT GTGAAGTCTG GTGTGTTTCT
AATCAAGGTA AACCTTGTTC GGATTGGAAA ATTGATCGAT TTTTTTATCA AGTTCCCTTT
AATGAAATGA AATCGATTTA CTCTAGTTGT GATATTTTGT TGAAATTGAG TAGTGTAGAA
GGGTTCTTTG GTCCGCCACT AGAAATGATG GCTTGTGGTG GTGTTTGTGT GGTCAGTGAT
GTCACGGGTT ATGATGAATA TATTGTTAAT GAATATAATG CTTTAGTTGT CCCCAATGGA
GAAATTGAAT TAGCCAGAAA AGCTATACAA AGGTTGATTG ATGATCGGGA TTTATATCGT
CAATTGCAAA AAAATGGACA ACAGACATCA CAACAAATGA AATGGGATAC TTCTATTAAT
AAATTAGAAT CTTTTTTCTT AGAAATTGCT CAAGGATCTC GTCTGAAATC CCATACCTCA
GCTTGGATTA AGCAAATTAA TCAAACCTTA AATGCTTATC TTAGTCTCTC TCAAGAAATC
GAAATGCTAC GGTTGAAAGC GCAAGAAGTA GAAACGTTGC GATCGCAGGT TCAAACCTTA
GAAAATAATC CTGGTTTTCA ACACAGTGGT AATTTTAACT TCAATCAATC AGTTTTTTCT
GTTTTCAAAA AAACCTCGAA TTTTTTCTAT CAATTTAGCG TTTTAGTTTT GACAAAACTC
TTTAAATTAG TCTATTGGAT CATCACTTTT CAATTTCCTC AAAAGCTAAG AAGATGGAGA
ATAGCCCAGA TAATTAGCAA AAGTGGTTTA TTTGATACAG GGTATTATTG GCAACAAAAT
CCTGACGTTA AATTATCTAA TGTTGATCCT CTTGGTCACT ATCTAGATAT CGGTGCATCC
CAAGGAAAAG ATCCTAACCC ACTTTTTGAT ACGTCTTTCT ACCTAGAGCA ATATCCTGAA
GTAACTCAAT TAGGAATGAA TCCCCTGGCT CATTATATTA TGATAGGGGC AAAAAAAGGA
AATAAACCCC ATCCTTTATT TGATACGACT TATTATCTAG AACAGTATCC CGATGTTGTT
GAATTAGGGA TCAATCCTTT ACTACACTAC TTAACCATTG GATTAGAAGA AAATCGCCAT
CCTTTTCCTT GTTCTACCAT TGAATTACCG TCTTTTAAAT TACGAACTGA ACCGATTATA
TCAGAGGAAA TTTCCAAAAG TTTAGATTCT AACTATCAAA CTTGGTTGAG TTTTCATTGT
CCTACAAATG AGGATCTCAA TACGATGGCT GAACAGATTC AGAAATTAGA ATATCAGCCT
TTAATTAGTA TCATCATGCC TGTTTTTAAT CCTAAGATAG CTTATCTAAA AACGGCTATT
AATTCAGTCC TCAATCAAGT TTATCAAAAT TGGCAACTGT GTATTGCTGA TGATGCGTCA
ACTAACCCCC AAGTTTACGA GATTTTAGCA GATTATGCTG CTGAAGATAC ACGCATTAAA
GTAGTTTTTC GTCAAGAAAA TGGTCATATT GCTGAAGCGT CTAATTCAGC GTTTGAAATA
GCAATGGGAG AATTTATTGC ACTGCTTGAT CACGATGATG TTTTGACCCC CCATGCGTTA
TATCACGTCG TTTCTATGCT CAATGATCAT GCCGATGCTG ATATGATTTA TTCAGATGAA
GATAAAATTG ATGAACAAGG TTATTTAAGT GATCCCTTTT TTAAACCTGA TTGGTGTCCT
GATTCTTTTT TATCGAAAAT GTATACCTGT CATTTAGGGG TTTATCGTCG TTCTTTGGTT
GAACAAATAG GTGCATTTCG CGTCGGATAC GAAGGAAGTC AGGACTATGA TTTAGTGTTG
AGATTAACCG AAAAAACCGA TAAAATATTT CATATTCCTA ATGTTCTCTA CCATTGGCGG
ATTCATGCTC AGTCAACGTC TACTAATATT GATAGTAAAA ATTATGCAGT GATAACGGCT
AAAAAAGCGC TTTCTGAAGC GATTGAAAGA CGAGGAGAAC CAGGAACTGT TACGGATGTA
CCCTATTGTT TAGGAAACTA TCATATTCGC TATGAACTAA AAACTGATGA TTTAGTCAGT
ATTATCATCC CGACAAAGGA CTTAGGAGAT ACCCTAAATC AATGTTTAAA ATCAATTTTT
GAACAATCCA CCTATCCCAA CTTTGAGATC ATTCTCATTG ATAATGGCAG TACGGAAGAA
CGATCTCTAG AAGTCATGAA ACAATGGCAA GAAAAAGAAC CTGAAAAGTT AAAAGTTTTT
CCTTTAAAAA TTCCTTTTAA TTATTCACAA ATTAATAATT TTGCGGTTCA GCATTCTCAA
GGAAAATACC TACTTTTTCT CAATAATGAC ATCGAAGTAA TTACCCCTGA TTGGATTGAA
GCGTTAGTTG AACAAGCACA ACGTCCTTCC ATCGGTGCTG TCGGTGCATT ATTATTATTC
CCTGATGATA CTATCCAACA CGCGGGTGTC ATAGGAGGTA TTTTTTACTC GTGTGGACAC
AGTCATAAAC GCTTTCCGTT CCGTTCTCCA GGTTATTTTA ATCAACTCAA TACTATTACG
AATTATTCGG CTGTTACGGG TGCTTGTTTA ATGTGTCGGC GAGATGTTTT TGAAGAAATT
GGAGGGTTTG ATGAAACACT CGCGGTCAAT TATAATGACA TTGATCTCTG TTTTAAGATG
ATTAATAAGG GGTATCGAAA TATCTATTTA CCCCATGTTG TTCTCTATCA TTATGAATCG
AAGAGTCGAG GTTATGACAG CTTGAATAAT TTCAAAAAAG CACGACTATT TTGTGAAGGT
AAATATTTTC AAACTCAGTG GAAAGTGTTA ATAGAACATG ACCCTTGTTA TAATCCTAAC
CTGACTTTTG ATCAAGAGGA TTATAGTATT AGAACTTAG
 
Protein sequence
MKIVYVLPSN LICGGHIVVL QHITRLQARG YDISICLLDR NTPASPESFE WFPYSHIPVF 
HPEAFPEDVD ICVATFWATL IPVLNFPAQH KVYFVQSDET RFYDNPIYQS QVALTYFSNI
HFMTEARWIV EWLNLNFGKS ACYIPNGFDP TIIHRVDPII PKPKDKYRIL VEGSINTPFK
RIKEALEVCS YFDCEVWCVS NQGKPCSDWK IDRFFYQVPF NEMKSIYSSC DILLKLSSVE
GFFGPPLEMM ACGGVCVVSD VTGYDEYIVN EYNALVVPNG EIELARKAIQ RLIDDRDLYR
QLQKNGQQTS QQMKWDTSIN KLESFFLEIA QGSRLKSHTS AWIKQINQTL NAYLSLSQEI
EMLRLKAQEV ETLRSQVQTL ENNPGFQHSG NFNFNQSVFS VFKKTSNFFY QFSVLVLTKL
FKLVYWIITF QFPQKLRRWR IAQIISKSGL FDTGYYWQQN PDVKLSNVDP LGHYLDIGAS
QGKDPNPLFD TSFYLEQYPE VTQLGMNPLA HYIMIGAKKG NKPHPLFDTT YYLEQYPDVV
ELGINPLLHY LTIGLEENRH PFPCSTIELP SFKLRTEPII SEEISKSLDS NYQTWLSFHC
PTNEDLNTMA EQIQKLEYQP LISIIMPVFN PKIAYLKTAI NSVLNQVYQN WQLCIADDAS
TNPQVYEILA DYAAEDTRIK VVFRQENGHI AEASNSAFEI AMGEFIALLD HDDVLTPHAL
YHVVSMLNDH ADADMIYSDE DKIDEQGYLS DPFFKPDWCP DSFLSKMYTC HLGVYRRSLV
EQIGAFRVGY EGSQDYDLVL RLTEKTDKIF HIPNVLYHWR IHAQSTSTNI DSKNYAVITA
KKALSEAIER RGEPGTVTDV PYCLGNYHIR YELKTDDLVS IIIPTKDLGD TLNQCLKSIF
EQSTYPNFEI ILIDNGSTEE RSLEVMKQWQ EKEPEKLKVF PLKIPFNYSQ INNFAVQHSQ
GKYLLFLNND IEVITPDWIE ALVEQAQRPS IGAVGALLLF PDDTIQHAGV IGGIFYSCGH
SHKRFPFRSP GYFNQLNTIT NYSAVTGACL MCRRDVFEEI GGFDETLAVN YNDIDLCFKM
INKGYRNIYL PHVVLYHYES KSRGYDSLNN FKKARLFCEG KYFQTQWKVL IEHDPCYNPN
LTFDQEDYSI RT