Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2933 |
Symbol | |
ID | 7104474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3027949 |
End bp | 3031482 |
Gene Length | 3534 bp |
Protein Length | 1177 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643475969 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002373085 |
Protein GI | 218247714 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCCAAG AAACTGATTA CATAATTCAT GATAAATGTC CGTTTTGCCA ATCGAGAGAT CTTGATAAAT ATAAGCAAAG AGCCGATCAG TTATGGGTTT TACTTTGCCA TAATTGCCGC TTAGGATTTG TGGAAAAGTA TCCAAAAAAC TTGGCACAAT GGTATGACTT AGAATACTAT GAAAAGTCTA CTGGGCAGGC AAATTCAGGC ATTGGCTATG ATAATTATCA AGACGTTGCT TATGATTATT TTCTTTGGGC GATCGCATTA GTGGCTTTAA CGAAAACCAA AGGATCTTTG TTTGATTTGG GCTGTAGTAA TGGCTTATTT CTTGATTTAG CTAAAAGTTA TGGGTGTACC GATCTTGGGG GTGTTGAGTT AACCCCTGAA TATGCAGAAA TTGCTCAGCA AAAAGGTTAC TCTGTTTATA ACCAAAGTTT CCTTGATATT CAATTCGATG CTAATCAAAA ATATGATATA GTAACAGCTT GGGCTGTCTT AGAACACATC CCTGAACTCA ACGAAACCTT AGCAAAAATC AAATCGATTC TCAAACCCGA AGGACTGTTA TTTTTTGAGG TTCCTTGTAT TGTATTCGAT GAAAGGGATG ATTATTGGCT CAATTCATCT TTAGAACATA TTTATTACTT TACTGAAGAA AGTTTTAAAA CCATTCTGAA ACGCCATTTC AATAACTGTT ATATAGGGGG CGTTTTTTCC CTAGATGGGT ATGGTGCAAG TCTAGCGGGG TTTGTCAGTA ATTCTCCTGA GAAAACCATC GACTATCAAC CGATTAATGA TTACTTGAAG TCGATGACAA GAGTTGATTT ATCGACTTTA AGCGAGGTCG AAATTATTTG TTATTTTATC ATTCACTTCA GATATACGCA GAATATTGAA GCTTGCAAAA CCATTGTTGA TTACTTGAGT CAAAAAGAAA CTAATAGTGA TCAAGCAGTT AGTTTAAACC TATATTATTG GTCTTATCTT GCTTCTAATT TTCTGAAAGC TCATTTAGAC AATCAGAACT ATATCGAAGC AAAAGACTAC TTTTTAGAAC AGATTAGTGG ACTTGAATCT AACCTAGAAA ACCATCAGTT ATCCATTAAA ACGTTACAAC AACAATTAGC AGAAGAAGTA GCAGAAGAAG CCGCTAATTA TCGTAAGCTT TATCAAGAAC TGATTCGCAG TAATAACCAA TTAGAACAAG TCAATCATGA CTTAGCACAA ACCCAAGAAC AACTTCATCA AACTCAAGGA GAACTTCATC ACACCCAAGA ACAACTCCAT CACAAGCAAG GAGAACTGGT TCAAACCCAA GAACAACTCT ATGAGAAGTA TCATCAAATT GATCAAATTA CCCATGAAAG AAACTATTGG AAATCTCGTG TAGAAGCCAT AGAAACAAGC AAGTTCTGGA AACTCAGAGA TCAATGGTTT AAAGTCAGAA GTCTCGTAGG AGCTAGGAAT GAAAATCTAT CTTTCTTGCA GTCTTTAGTA ACTCCATTAC CCCCTGAACA AAGCAACAAA CAAGAGTTAT TGCTTGATAA TAATTTAGCT TTAGAAGAGA GTGAAAGTAG TCCCATTGAA GTGATGACTC AAGAAGAATT AGAACCCATT CAAGAAACTC CAGTTCAAAG GATGACTCAA GAAAAATGGG ATCAAAATCT TCCCTTAGTG AGTGTAATTA TTCCTTGTTA TAATTATGGC CAATATCTAG AGGAAGCCAT CGACTCAGTT TTGCAGCAGA CTTTCCAAAA TTTTGAGATT ATTGTGGTTG ATGATGGGTC AACCGATTCC AAAACCCAAG AGGTATTAGA TAATCTCAAT AAACCAAAAA CTACCCTGAT TCGACAAAAG AATCAAGGCG TTGCGATCGC GCGAAATGAA GGCATTTTTC AGGCTAAAGG GAAATACATC TGTTGTCTAG ACGCTGATGA TAAACTCAAA CCAGCCTATC TAGAGAAATG TTTGATTAAA CTCGAAACCG AAAATTTAGA TATTTGCTAC ACTTGGATTC AAGAATTTGA GGAAAGTGAT CTTGTCTGGA AAACGGCTTC TTTTGAGTTA AGCAAATTAC TCGAAGAAAA CTGTCTTGAA GTATCGGCTG TTTTTCGCCG TGATATCTGG GAAAAAGTTG GCGGATATGA CCCCCAAATG GCTTACGAAG ATTGGGATCT TTGGATTACA ATGGCTAAGA TGGGAGCCAT TGGTGATGTT ATCCCAGAAC CCCTATTTTT GTATCGGAAA CATGGGATAT CCAAGCACGA TCTTGACTTC AGTAACCATG AAGAAATCAA GCAAAAAATT GAGACTAAAC ACCAAAAATT ATACCAAGAA CCCGACCGTG TTCGTGCCAT TGAACAAGCT AAACCTAAAT ATGGCGTTCA GGATGGCTAT AAAAACCTCC TAATTCAGTC AGAAAATTCC CCTAATGAAA AACGCAAAAC CTTACTTTAT GCCCTTCCCT TTACCGTTAT GGGGGGTGTA GATACAGTAT TGTTAACGTT GATGAAAAAT TTCAAACAAC AGGGGTTTGA TATTTATGTT TTAACGACCC TAAGACCCCT AACTCCCAAA GAAGATACGA CGGAAAGATA CGAAGAAATT GTCGATGGAA TTTATCACTT TCCCAACTTA TTAACGGAGG ATAAGTGGCC AGAACTGGTT AATTACTTGA TTGAATCTAA GCAAATTGAT CTGGTTTTAA TGGCCGGTTC GAGTTACTTT TATTCCCTAA TTCCAGACCT GAAGGATCGC TATCCTAATC TGAAAATTGT CGATCAACTT TACAACGAAT ATGGTCACAT TGCCAATAAT CGGAAATACG CGGATTATAT TGATCTCAAT ATTGTCGAAA ATGAACGGGT TAAAACTTGT TTATTAGATG AGTATGAGGA GAAACCTGAA AAAATTTCTC TGATTACTAA CGGGGTCGAT ATTAACCATT TCAATCCTGA TTTCATTGAA GCTTCAAACT TACCCTCCTT AGTCATTCCT CCCGAAAAGT TTGTCATTTC CTATATTGGT CGTTTCTCAG AGGAGAAATG TCCAGAGGTC TTTGTAGAAA TCGTTAATCA CTTTAAAAAT GACCACAGAC TGTGCTTTAT TATGGCAGGG TACGGACCCA TGGAAGACCA AATTAAAGAC CAGATTAAAA CCTATGGGTT AGAGTTCCGG ATTCACTTTC CGGGGATTGT GGAAACCAAA CCCTATTTAG CTATCACTGA CTTGATGATT CTGCCCTCAA AAATTGATGG CCGTCCTAAT ATTGTGCTAG AAAGTTTAGC GATGGGCATC CCTGTAATTG CCTCAGCCAT TGGGGGACTA CCCCAGATTA TTCAAGACGG TGACAATGGC TTTCTGTGTG ATCCCGATAA TACAGAGGAA TTTATCGAGA AAATTGAAAA AATTACTTCA GATACCAACT TGTATCAACA AATGAAGCAA AACGCGAGAA AATATGCTGT TAAGTCCTTG GATATGGCAG TTATGAAAAC CCAGTATCTG GAGTTGATTA ATCGTCTTAT TTAA
|
Protein sequence | MFQETDYIIH DKCPFCQSRD LDKYKQRADQ LWVLLCHNCR LGFVEKYPKN LAQWYDLEYY EKSTGQANSG IGYDNYQDVA YDYFLWAIAL VALTKTKGSL FDLGCSNGLF LDLAKSYGCT DLGGVELTPE YAEIAQQKGY SVYNQSFLDI QFDANQKYDI VTAWAVLEHI PELNETLAKI KSILKPEGLL FFEVPCIVFD ERDDYWLNSS LEHIYYFTEE SFKTILKRHF NNCYIGGVFS LDGYGASLAG FVSNSPEKTI DYQPINDYLK SMTRVDLSTL SEVEIICYFI IHFRYTQNIE ACKTIVDYLS QKETNSDQAV SLNLYYWSYL ASNFLKAHLD NQNYIEAKDY FLEQISGLES NLENHQLSIK TLQQQLAEEV AEEAANYRKL YQELIRSNNQ LEQVNHDLAQ TQEQLHQTQG ELHHTQEQLH HKQGELVQTQ EQLYEKYHQI DQITHERNYW KSRVEAIETS KFWKLRDQWF KVRSLVGARN ENLSFLQSLV TPLPPEQSNK QELLLDNNLA LEESESSPIE VMTQEELEPI QETPVQRMTQ EKWDQNLPLV SVIIPCYNYG QYLEEAIDSV LQQTFQNFEI IVVDDGSTDS KTQEVLDNLN KPKTTLIRQK NQGVAIARNE GIFQAKGKYI CCLDADDKLK PAYLEKCLIK LETENLDICY TWIQEFEESD LVWKTASFEL SKLLEENCLE VSAVFRRDIW EKVGGYDPQM AYEDWDLWIT MAKMGAIGDV IPEPLFLYRK HGISKHDLDF SNHEEIKQKI ETKHQKLYQE PDRVRAIEQA KPKYGVQDGY KNLLIQSENS PNEKRKTLLY ALPFTVMGGV DTVLLTLMKN FKQQGFDIYV LTTLRPLTPK EDTTERYEEI VDGIYHFPNL LTEDKWPELV NYLIESKQID LVLMAGSSYF YSLIPDLKDR YPNLKIVDQL YNEYGHIANN RKYADYIDLN IVENERVKTC LLDEYEEKPE KISLITNGVD INHFNPDFIE ASNLPSLVIP PEKFVISYIG RFSEEKCPEV FVEIVNHFKN DHRLCFIMAG YGPMEDQIKD QIKTYGLEFR IHFPGIVETK PYLAITDLMI LPSKIDGRPN IVLESLAMGI PVIASAIGGL PQIIQDGDNG FLCDPDNTEE FIEKIEKITS DTNLYQQMKQ NARKYAVKSL DMAVMKTQYL ELINRLI
|
| |