Gene PCC8801_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1643 
Symbol 
ID7102386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1718125 
End bp1719687 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content48% 
IMG OID643474714 
ProductWD-40 repeat protein 
Protein accessionYP_002371850 
Protein GI218246479 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTCA AACAATTGCA CTGGCTTGAA GTGGCTGAAT GTTTAGCCTT GGGACTGGCG 
GTGGTTTTCC TCTTGGTGGC TATCGCGTCC CAAGATTGGT TATTACCCTT GGTTTTTTTG
ACCTTTGGGT TGATTTTTAA TAGCCTTAAC CGCATTCGTT GGCAATATCT GATGCGCCAG
CGACTTTCAG CGACAACTAA GCAACTCAAG CAACAAATTG CCCAAGAACT CGAAAAGATT
CAGGTTTCTG CTCCCGTCAC CCCCGCAGAA GAAACAGAAA GCCGCGCGAT CGCCCGTCTA
CAAGACAATT GCGTCAGTCT GGAGCAATCT CTCAATAGTG TGGTGCAATA CCTCAATACC
CAAGTGTTAC CCGAAAGAAT CGAACATTTG GAAAAAGCCT ACCTACAATT GAGTCAAGAT
CTGCGGACAC TGACTCGCCA AGTTGAAGAT CCTTCCGTCG AACCTCTTGC CCCTTCACCG
CCTGAATTGG AACCCTTTGA CATTTCTTCA GCCACAGGGC TCAATGTTTC CCAAGTAACA
ACCCAACTTC CCATCGTTGC GCCGACTTCC TTTGTTTCTC AGTCTCCAAG TATCCCGACT
TGGCAAGAAC TTGACCCCCT GATGGCTCAT GATGATGCGG TTAGTTGTTT GGCCATCAGT
CCTGACGGAC AATGGCTAGT CAGTGGCAGT TGGGATCAGA CGTTACGGGT TTGGGACTTA
GCTACCCGAA CCCTAAAAGC TCAAGTGAGT GCCCATTATC AGGGGTTACT CGCGGTGGTG
GTTGTTCCCA TACAAGCCTC TGGAACGGGT TATCGGATTG TCACAGGGAG TTTTGACCAT
ACGATTAAGG TCTGGTTAGC GGATACGGAA GATCCTGAGC ATTTGACCCT AACGATTGAG
GAGACATTAA CCCAACATAC GGGCTCGGTG CAATCTTTGG CTTTATCCTA CAACCCCTTA
TTATTGGTCA GTGGTAGTTA CGATCAGACG GTGAAACAAT GGCAATTATT AACAGGGGAG
ATGGTGTGTA GTTCCTATGA TCCTTTGGGG GCAATTTATG CGATCGCAGT TGATACGTCT
CAGGAGTTAA TTGCTAGTGC CGGAGGAGAT GGCCGAGTCA CGCTTTGGAA ATTGGGGACA
GGCGAACAAA TTGGCTTTTT AGCGGGGAAT GTCTCTTCAG TGGAGTCTTT GGCTTTTAGC
CCCGATGGAG AAACCTTAGC GGCCGGTTGT GTCGATGGGA CAATTAAATT GTGGCAACTC
GATGCTAGTC GTTTTGGGGC TGGTCGTCCG TTGCAACCTG TTCGCATCTT GGAAGCTCAT
AATGGTCAAG TTAAGGCGCT CCTGTTCAAT GGTGAGGAGC AAATTCTCTT TAGTGGGGGA
GCCGATGGTT ATGTGAAAAT TTGGCATCCG AGTCGCCGGG AGGCGATCGC AGTTCTGGGG
GTTAATGAGG GTGCTGAATC CGGCCGTAGT TCGATTTTAT CCTTGGCTTT AAGCGATGAT
AGTTACTTAT TAATCGCTGG AACGGCTGAT GGCATAATTC AAATTTGGAG AAAAACCGAT
TGA
 
Protein sequence
MNFKQLHWLE VAECLALGLA VVFLLVAIAS QDWLLPLVFL TFGLIFNSLN RIRWQYLMRQ 
RLSATTKQLK QQIAQELEKI QVSAPVTPAE ETESRAIARL QDNCVSLEQS LNSVVQYLNT
QVLPERIEHL EKAYLQLSQD LRTLTRQVED PSVEPLAPSP PELEPFDISS ATGLNVSQVT
TQLPIVAPTS FVSQSPSIPT WQELDPLMAH DDAVSCLAIS PDGQWLVSGS WDQTLRVWDL
ATRTLKAQVS AHYQGLLAVV VVPIQASGTG YRIVTGSFDH TIKVWLADTE DPEHLTLTIE
ETLTQHTGSV QSLALSYNPL LLVSGSYDQT VKQWQLLTGE MVCSSYDPLG AIYAIAVDTS
QELIASAGGD GRVTLWKLGT GEQIGFLAGN VSSVESLAFS PDGETLAAGC VDGTIKLWQL
DASRFGAGRP LQPVRILEAH NGQVKALLFN GEEQILFSGG ADGYVKIWHP SRREAIAVLG
VNEGAESGRS SILSLALSDD SYLLIAGTAD GIIQIWRKTD