Gene PCC8801_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1507 
Symbol 
ID7105378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1580784 
End bp1582220 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content36% 
IMG OID643474581 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002371718 
Protein GI218246347 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AGGGTAAAAG TTCAATTATT GCTTTAAATA ATCCAGATCA TGGCTTAAGT 
TCCTATATTG CGCCAGCAAA AGACCTATTT GTTCCCAACA AAAAGGGGGA TTATCTTGAA
GATATTCGCC GTAAAATTAC CCCTACTTGG TTATTAATTC CTCTGGTAAC ATTGGGGATG
ACAGTAGTAG GCGCAATCTG GACAGTTAAA CAACCAGCAA TTTATCAAGG AAGGTTTCAA
TTATTACTTG ATAATCCTCT AACACCTGTA AATAAAGAAC GAGAAGACAC GAAAATTGAC
TATGCTACTC AAGTCCAAGT GTTGCAAAGT TCTAGTGTTT TAAAGCCCAT TTTAAAGCAA
GTAGAAGCTC AATATTCCGA CCTAGACTAC TCAACTTTAA TAACAGGAAA AGAATCCCCT
TTGACGATTG AACAACTCTC AGGAACGAAA ATTATTGAAG TGACTTTTAC TAATACTAAC
CCCCGAAAAA TAGAAGGACT TTTAGATCAT TTAGCACAAT CTTATCTGAA TTATAACCAA
AAACAAACCA CGGTTAAAAA ACCCGCAGAA ATGACATTTG TTAACCAACA GTTAGCCCAA
TTACAACAAC AGATTAGTCA ACGTCAGCAA CAGTTGGAAA AGTTACGTCA GGAACATAAT
TTCCTCAACC CGCAACAAAA ATCCCAGGAA CTGTCCCAAT TATTGCAACA GTTACAAGCT
TTAGACTTTG AAACGCAGGT TAAGCGCAAA GAAACTGAAG CTATTTATGA TCTATTGCAA
CAAAAGCTAG AATTATCCCC CCAAGAAGCA TTAGCTGCTA GTATTTTAAG TGAATCTCCC
CGTTATCAAG CGATTCTCAA TGAACTTCAG AACGTAGAAG TTGAACTAGC CAAGGAATCT
GCTAGATTTT TAGAAGATAG TCCAGTGATT CAAGGAATTA AAGATAAAAA AGCCAATTTA
TTATTACTAT TAGAGCAAGA AGCCCAAAAA AACCTAGGTA ATCAGGCAAA TACTGACATT
TCCTTACAAT CTTCCGGGGT TTCTCCGAGT AGTTTACGGT TATCTCTTCA GCAGCAACTC
GTTGAAACCG AGAGTCAAAT GGCGGTTTTA AGGGTAAGAC AGACGGCTAT TAACGAAGAA
ATTCAGGCAG TTAAAGCTAA AATAGCAGAA ATGCCCCTTT TAGAGCGTCA ATATACTAAT
ATACAGCGAG AATTAACGAT TGCTACCGAA AATTTCAACC GTTTAATGGC AACGTCTCAA
CAAATGCAAC TAGAAGCAGC GAGTCAAAAA ACAGTTTCTT GGCAATTAAT TAGTCCTCCT
GAAGTCAAAC AAATGCCAAT TTATTCTCAA CTGATTCAAA ATATGAGTGT AGGGGCAATT
TTCGGATTAT TATTAGGAAT AGTCATGGCA AATATTCCAA TAAAAAATGA ACAGTGA
 
Protein sequence
MKNKGKSSII ALNNPDHGLS SYIAPAKDLF VPNKKGDYLE DIRRKITPTW LLIPLVTLGM 
TVVGAIWTVK QPAIYQGRFQ LLLDNPLTPV NKEREDTKID YATQVQVLQS SSVLKPILKQ
VEAQYSDLDY STLITGKESP LTIEQLSGTK IIEVTFTNTN PRKIEGLLDH LAQSYLNYNQ
KQTTVKKPAE MTFVNQQLAQ LQQQISQRQQ QLEKLRQEHN FLNPQQKSQE LSQLLQQLQA
LDFETQVKRK ETEAIYDLLQ QKLELSPQEA LAASILSESP RYQAILNELQ NVEVELAKES
ARFLEDSPVI QGIKDKKANL LLLLEQEAQK NLGNQANTDI SLQSSGVSPS SLRLSLQQQL
VETESQMAVL RVRQTAINEE IQAVKAKIAE MPLLERQYTN IQRELTIATE NFNRLMATSQ
QMQLEAASQK TVSWQLISPP EVKQMPIYSQ LIQNMSVGAI FGLLLGIVMA NIPIKNEQ