Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1534 |
Symbol | |
ID | 8390846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 1573834 |
End bp | 1575270 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644979531 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_003137281 |
Protein GI | 257059393 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.16836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATA AGGGTAAAAG TTCAATTATT GCTTTAAATA ATCCAGATCA TGGCTTAAGT TCCTATATTG CACCAGCAAA AGACCTATTT GTTCCCAACA AAAAGGGGGA TTATCTTGAA GATATTCGCC GTAAAATTAC CCCTACTTGG TTATTAATTC CTCTGGTAAC ATTGGGGATG ACAGTAGTAG GCGCAATCTG GACAGTTAAA CAACCAGCAA TTTATCAAGG AAGGTTTCAA TTATTACTTG ATAATCCTCT AACACCTGTA AATAAAGAAC GAGAAGACAC GAAAATTGAC TATGCTACTC AAGTCCAAGT GTTGCAAAGT TCTAGTGTTT TAAAGCCCAT TTTAAAGCAA GTAGAAGCTC AATATTCCGA CCTAGACTAC TCAACTTTAA TAACAGGAAA AGAATCCCCT TTGACGATTG AACAACTCTC AGGAACGAAA ATTATTGAAG TGACTTTTAC TAATACTAAC CCCCGAAAAA TAGAAGGACT TTTAGATCAT TTAGCACAAT CTTATCTGAA TTATAACCAA AAACAAAACA CGGTTAAAAA ACCCGCAGAA ATGACATTTG TTAACCAACA GTTAGCCCAA TTACAACAAC AGATTAGTCA ACGTCAGCAA CAGTTGGAAA AGTTACGTCA GGAACATAAT TTCCTCAACC CGCAACAAAA ATCCCAGGAA CTGTCCCAAT TATTGCAACA GTTACAAGCT TTAGACTTTG AAACGCAGGT TAAGCGCAAA GAAACTGAAG CTATTTATGA TCTATTGCAA CAAAAGCTAG AATTATCCCC CCAAGAAGCA TTAGCGGCTA GTATTTTAAG TGAATCTCCC CGTTATCAAG CGATTCTCAA TGAACTTCAG AACGTAGAAG TTGAACTAGC CAAGGAATCT GCTAGATTTT TAGAGGATAG TCCAGTGATT CAAGGAATTA AAGATAAAAA AGACAATTTA TTATTACTGT TAGAGCAAGA AGCCCAAAAA AACCTAGGTA ATCAGGCAAA TACTGACATT TCCTTACCCT CTTCCGGGGT TTCTCCCAGT AGTTTACGGT TATCTCTTCA GCAGCAACTC GTTGAAACCG AGAGTCAAAT GGCGGTTTTA AGGGTAAGAC AGACGGCTAT TAACGAAGAA ATTCAGGCAG TTAAAGCTAA AATAGCAGAA ATGCCCCTTT TAGAGCGTCA ATATACTAAT ATACAGCGAG AATTAACGAT TGCTACCGAA AATTTCAACC GTTTAATGGC AACGTCTCAA CAAATGCAAC TAGAAGCAGC GAGTCAAAAA ACAGTTTCTT GGCAATTAAT TAGTCCTCCT GAAGTCAAAC AAATGCCAAT TTATTCTCAA CTGATTCAAA ATATGAGTGT AGGGGCAATT TTCGGATTAT TATTAGGAAT AGTCATGGCA AATATTCCAA TAAAAAATGA ACAGTGA
|
Protein sequence | MKNKGKSSII ALNNPDHGLS SYIAPAKDLF VPNKKGDYLE DIRRKITPTW LLIPLVTLGM TVVGAIWTVK QPAIYQGRFQ LLLDNPLTPV NKEREDTKID YATQVQVLQS SSVLKPILKQ VEAQYSDLDY STLITGKESP LTIEQLSGTK IIEVTFTNTN PRKIEGLLDH LAQSYLNYNQ KQNTVKKPAE MTFVNQQLAQ LQQQISQRQQ QLEKLRQEHN FLNPQQKSQE LSQLLQQLQA LDFETQVKRK ETEAIYDLLQ QKLELSPQEA LAASILSESP RYQAILNELQ NVEVELAKES ARFLEDSPVI QGIKDKKDNL LLLLEQEAQK NLGNQANTDI SLPSSGVSPS SLRLSLQQQL VETESQMAVL RVRQTAINEE IQAVKAKIAE MPLLERQYTN IQRELTIATE NFNRLMATSQ QMQLEAASQK TVSWQLISPP EVKQMPIYSQ LIQNMSVGAI FGLLLGIVMA NIPIKNEQ
|
| |