Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1507 |
Symbol | |
ID | 7105378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 1580784 |
End bp | 1582220 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643474581 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002371718 |
Protein GI | 218246347 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AGGGTAAAAG TTCAATTATT GCTTTAAATA ATCCAGATCA TGGCTTAAGT TCCTATATTG CGCCAGCAAA AGACCTATTT GTTCCCAACA AAAAGGGGGA TTATCTTGAA GATATTCGCC GTAAAATTAC CCCTACTTGG TTATTAATTC CTCTGGTAAC ATTGGGGATG ACAGTAGTAG GCGCAATCTG GACAGTTAAA CAACCAGCAA TTTATCAAGG AAGGTTTCAA TTATTACTTG ATAATCCTCT AACACCTGTA AATAAAGAAC GAGAAGACAC GAAAATTGAC TATGCTACTC AAGTCCAAGT GTTGCAAAGT TCTAGTGTTT TAAAGCCCAT TTTAAAGCAA GTAGAAGCTC AATATTCCGA CCTAGACTAC TCAACTTTAA TAACAGGAAA AGAATCCCCT TTGACGATTG AACAACTCTC AGGAACGAAA ATTATTGAAG TGACTTTTAC TAATACTAAC CCCCGAAAAA TAGAAGGACT TTTAGATCAT TTAGCACAAT CTTATCTGAA TTATAACCAA AAACAAACCA CGGTTAAAAA ACCCGCAGAA ATGACATTTG TTAACCAACA GTTAGCCCAA TTACAACAAC AGATTAGTCA ACGTCAGCAA CAGTTGGAAA AGTTACGTCA GGAACATAAT TTCCTCAACC CGCAACAAAA ATCCCAGGAA CTGTCCCAAT TATTGCAACA GTTACAAGCT TTAGACTTTG AAACGCAGGT TAAGCGCAAA GAAACTGAAG CTATTTATGA TCTATTGCAA CAAAAGCTAG AATTATCCCC CCAAGAAGCA TTAGCTGCTA GTATTTTAAG TGAATCTCCC CGTTATCAAG CGATTCTCAA TGAACTTCAG AACGTAGAAG TTGAACTAGC CAAGGAATCT GCTAGATTTT TAGAAGATAG TCCAGTGATT CAAGGAATTA AAGATAAAAA AGCCAATTTA TTATTACTAT TAGAGCAAGA AGCCCAAAAA AACCTAGGTA ATCAGGCAAA TACTGACATT TCCTTACAAT CTTCCGGGGT TTCTCCGAGT AGTTTACGGT TATCTCTTCA GCAGCAACTC GTTGAAACCG AGAGTCAAAT GGCGGTTTTA AGGGTAAGAC AGACGGCTAT TAACGAAGAA ATTCAGGCAG TTAAAGCTAA AATAGCAGAA ATGCCCCTTT TAGAGCGTCA ATATACTAAT ATACAGCGAG AATTAACGAT TGCTACCGAA AATTTCAACC GTTTAATGGC AACGTCTCAA CAAATGCAAC TAGAAGCAGC GAGTCAAAAA ACAGTTTCTT GGCAATTAAT TAGTCCTCCT GAAGTCAAAC AAATGCCAAT TTATTCTCAA CTGATTCAAA ATATGAGTGT AGGGGCAATT TTCGGATTAT TATTAGGAAT AGTCATGGCA AATATTCCAA TAAAAAATGA ACAGTGA
|
Protein sequence | MKNKGKSSII ALNNPDHGLS SYIAPAKDLF VPNKKGDYLE DIRRKITPTW LLIPLVTLGM TVVGAIWTVK QPAIYQGRFQ LLLDNPLTPV NKEREDTKID YATQVQVLQS SSVLKPILKQ VEAQYSDLDY STLITGKESP LTIEQLSGTK IIEVTFTNTN PRKIEGLLDH LAQSYLNYNQ KQTTVKKPAE MTFVNQQLAQ LQQQISQRQQ QLEKLRQEHN FLNPQQKSQE LSQLLQQLQA LDFETQVKRK ETEAIYDLLQ QKLELSPQEA LAASILSESP RYQAILNELQ NVEVELAKES ARFLEDSPVI QGIKDKKANL LLLLEQEAQK NLGNQANTDI SLQSSGVSPS SLRLSLQQQL VETESQMAVL RVRQTAINEE IQAVKAKIAE MPLLERQYTN IQRELTIATE NFNRLMATSQ QMQLEAASQK TVSWQLISPP EVKQMPIYSQ LIQNMSVGAI FGLLLGIVMA NIPIKNEQ
|
| |