Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3255 |
Symbol | |
ID | 7104097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 3394421 |
End bp | 3397450 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643476274 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002373384 |
Protein GI | 218248013 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTC TGATTGCTGA TTTTGATTTA TTTAGTAAAG TCGGAGGAGG TCAAACCTTC TATCGCAGTA TTATTAAGAA AAATCCTCAA ATTGATTTTT ACTATATTGC CGAGAAAGAA CCATTACAAA CAACCCGTCC ACCTAATGCT CACGCTATCC CTTACCAAGA ACAATTATTA CTAAGTGACT TCAAAAATTT CTTCGAGGTT ACGCCGCCGA AATGGGTTGA AAGAGCCTTC GTTATGGCCA GCAATATTGC GGCATCGGTA GCGAATCAAC AATTTGATGT CATTGATGTC CCTGACTATG AACAATGGGG AATGTTTCTA AGGCCTGCGC TGCAACATTA TCGGGTTAAT TTTAGCAAAA TTGCCCTATC AATGCACGGA AAAATCTCTA AAACCTTGCG TCTTGATTGG TTTGATTTTG GTAAGGATAA TATCCCCCTT GATCTACAGG AAAAAATGCA GTACAAAACT GTAGATATGC GCTATGGTAT TAGCAAGAGT TATCTAGATG AATGGCGAGA AATTTCGGAC TTAGAGTCCC ATTATTATCA TCCACTTCAT TTTCTTGACC TTCCTAAACC GACGAGAAAC TTACCTTCAG AAGCACCCCC AAGTTTAAAT TTTATCGGTC GAACAGAAAA GCGTAAAGGA CCCGATATTT TTATTGATTT AGTTTGGTGG CTTCCCCGTT CTAGTTACAG CAAAGCTCAG CTTATCGGTC CCCACAGCTA CAACTACAAC AACACCCAAT CTTCTGAATC AATTTTACAA AGCATGGTGA AAAACCGACA AAAAAATATT GCCATGCGTC CTCCCATGAA ACGGGAAGAA TTAGCCGAAT TATTTGCTAG TAAATCCGTT ACTTTCCTTC CTTCGAGATA CGATACTTTA AATCTTGTGG CTTTGGAATC GCTATTTTCA GGTTGTCCAA CAGCTATCGG AAATGGTGCA GGGATCTGTC GTTTCTTAAC GGAAAGTTTT CCAGAAGTTC CCTTCATTAA TATCGATATT AATGACATTT ACTCCTCTCT TCCTGCTATT GGTCAAGTGC TAGATAATTA TGAAGATTAT CGTCGAAAAT TAGTTGATAC TTTGTTATCA ATTAATTTAG AAGTCACTGA TCCCATTTTA GAAGAAATTT ATAATCAATC TGTCGTATCT GATGGAGAAA CTCAACAGGA ATTAGAGCAA TGGTATAGTC AATTAATGAC CTATTGGGAA GATGGTCAGC AAGACTTTGG TATCTATAAA ATTCCTGGGG TAAAATTAGT TAAATCTCAA CTAAAATCCC AACTTAAACC AACCTATAAG CAATTCAAAG CAACCCTTAA TGATGTTAAA GCACAACTCA TTAAGCCATT GGAAGATAGC CGTAATGCTC AAACGGTAAA AGCTAGTAAA TTAGTGAGTC GGTATAAAGC AACGTTTAAT GCGTCAGAAT TGAATCAAAA AGATTTAACC TACAAAGTCA AAGAATGTTG GCGGTTAGGT TCAGCTTTTG GAGAATATGT TTCTGCTTTT CAAGAGTCTG CTAATCTGAG TAGCTTAAAA GATAAGTTAA GCAATGGTTA CCAAATTGAT CGGATTCGCC TGTGGCGAGA AATTGCTCGA ATTGAAGAAT TACGAGGTAA TGACTTAGTA TCAGCTACCT ACAAATTGCG CGGAATACGC TTACTAAATA ACGATATTTT TGGCGATCTT TCCTTTGTTT TACGAACCCT AGAAACCAAA GGATTTACCC GTGAAGCGCA AGCAGTAGAG GCAATATATG GCAAACCAGC CGAACGAAAT GAGCGTTGTC AAGAACTGAT TGAACAATCT TTGCGGGATA ACGCCGTTAA CCCAGAAAGA GAATACGAAT TTATTGATGA TCGCCGTCAG AAATCAACCT ACAGGGTTAG TATTATTGTC TCGCTTTATA ACGCAGCCGA TAAGCTTTCT TTATTCCTAA AAGCTTTACA GCATCAAACT TTAATCAACA AGGGAGAAGC TGAAATTATC CTAGTTGATA GTGGATCTCC AGGGGATGAA TACGCTGTCT TTAAACAGTT AGAACCAAAA CTAAATATTC CCATTATTTA CGCGCGATCG CAGCAAAGAG AAACTATTCA AACCGCTTGG AACCGAGGAA TTTCTCTCTC CAAAGCACCC TATTTATGTT TCTTAGGAGT TGATGAAAAC ATCTTACCCG ACTGTTTAGA AGTCTTATCC AAAGAATTAG ATAAAGATCC CCAACTTGAT TGGGTCATTG GTCATAGTTT AGTCACCAAT GTTGATAAAC AAGGTAGTTG GGTTAGTGAT ATTATGCCCT ACTATCGCAA GGAATATAAG CAAGATTTAG TCTACTTAGA AACCTGCTAT TTATCATGGG TAGGAGCTCT CTATCGTCGC TCAATTCATG AGCGATTTGG TTACTATGAT GGTAGCTTCC GAGGTGCAGG AGATACCGAG TTTAAAAACC GTGTTTTACC CTTTATTAAA AGTAAAGTAG TTGACCGAAC GTTAGGGGTT TTTTGGAACT ATCCTGATGA ACGAACCACC CAAAGTCCTC TCGCAGAATT AGAAGATATG AGAGCCTGGT ATCTTCATCG CACTTTACCC GGTATTCGAT ATGCCTTTTC TAACCGAAAA GTAGAGGAAT TAGAGCAACT GATTTACTTA TCTCTGTGCT ACCGCAAATC CTATTGTACC CACACCAGTA GTGATTTAGA ATATGCCTAT AATTTAAGTC TTTATCTCAA AGAAATTGCT CCTAATTCTC AAGCTCTCAA GTATTTTAAG GGGATAGAAA CCTTGCTAAA TGCCTATCGT GGTTTAGACT GGATGCCAAA ACTATCTCGC TTTTCTCCTA TTGCTAAAGT TTTAGAAACT CGCAAAATTG CTCACACAAT CCAACAGGAA CATCAAACTT CTTGGAATAA AGAAATGAGT TTAGGTTTAG TACCGAATTA TAAAATTTTT AATGACAACC GTCATGAACA ACATTCTGTC CTTTGGTTAA CTGAGGTACC TAAAAATTAA
|
Protein sequence | MKVLIADFDL FSKVGGGQTF YRSIIKKNPQ IDFYYIAEKE PLQTTRPPNA HAIPYQEQLL LSDFKNFFEV TPPKWVERAF VMASNIAASV ANQQFDVIDV PDYEQWGMFL RPALQHYRVN FSKIALSMHG KISKTLRLDW FDFGKDNIPL DLQEKMQYKT VDMRYGISKS YLDEWREISD LESHYYHPLH FLDLPKPTRN LPSEAPPSLN FIGRTEKRKG PDIFIDLVWW LPRSSYSKAQ LIGPHSYNYN NTQSSESILQ SMVKNRQKNI AMRPPMKREE LAELFASKSV TFLPSRYDTL NLVALESLFS GCPTAIGNGA GICRFLTESF PEVPFINIDI NDIYSSLPAI GQVLDNYEDY RRKLVDTLLS INLEVTDPIL EEIYNQSVVS DGETQQELEQ WYSQLMTYWE DGQQDFGIYK IPGVKLVKSQ LKSQLKPTYK QFKATLNDVK AQLIKPLEDS RNAQTVKASK LVSRYKATFN ASELNQKDLT YKVKECWRLG SAFGEYVSAF QESANLSSLK DKLSNGYQID RIRLWREIAR IEELRGNDLV SATYKLRGIR LLNNDIFGDL SFVLRTLETK GFTREAQAVE AIYGKPAERN ERCQELIEQS LRDNAVNPER EYEFIDDRRQ KSTYRVSIIV SLYNAADKLS LFLKALQHQT LINKGEAEII LVDSGSPGDE YAVFKQLEPK LNIPIIYARS QQRETIQTAW NRGISLSKAP YLCFLGVDEN ILPDCLEVLS KELDKDPQLD WVIGHSLVTN VDKQGSWVSD IMPYYRKEYK QDLVYLETCY LSWVGALYRR SIHERFGYYD GSFRGAGDTE FKNRVLPFIK SKVVDRTLGV FWNYPDERTT QSPLAELEDM RAWYLHRTLP GIRYAFSNRK VEELEQLIYL SLCYRKSYCT HTSSDLEYAY NLSLYLKEIA PNSQALKYFK GIETLLNAYR GLDWMPKLSR FSPIAKVLET RKIAHTIQQE HQTSWNKEMS LGLVPNYKIF NDNRHEQHSV LWLTEVPKN
|
| |