Gene PCC8801_2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2470 
Symbol 
ID7105518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2555000 
End bp2556187 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content43% 
IMG OID643475512 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_002372635 
Protein GI218247264 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCCG CTAGAATTGA CGCGGCGAAA GAAACACGGG TCAACCAAGA CTTAAAGAAA 
CTGGTTTTAA AGACACAAGA TCTTCCAGGG AAAGCTTTGG GGACAACTGA AACAGACCAC
ATTCTCAACC TTGCTTTAGC AGTTCTTCAC GAAGGGGATT TTCAACAACG GTGGGAAGTG
GCTAAAATTT GGCCGAAACT AGGAAAAAAA TCGATCGCTC CCTTATTAAC GATTCTCGCT
GATGAAGAGG CTGATGTAGA AATTCGTTGG TTTGTCGGGC GGATTTTGGG GGAATTTGAT
GATCCCCAAG TCATTATGGC TTTAACCAAC CTCCTTCAAG TCACGGAAGA AGAAGAATTG
TCTATGATGG CAGCTTCAAC CCTAGCTAAA ATTGGAGAGG GTGCCATTGA AAGCTTAAGC
AAGTTATTGC TCGAGCCGTC GTTAAGAATG ACAGCAGTGC ATTCTTTAGC CCAAATTCGT
CATTCTCAAA CGATTTTACC CCTATTAACA GTGATCGATG ATCCGAATCC TCAAGTCCGT
GCTACGGTAA TAGAAGCTTT AGGCAGCTTT CATGATGAGC ATCTCGTCAC TTTTTTGCTA
AAAGGGTTGA AAGATCCCAC TGCTAGGGTT CGTAAAGAAG CGGTCATTGC CTTGGGAATG
CAACATCAAT TCAAGAAAAA ATTTGGGTTG GTGGAGTACC TTAAACCATT ACTTTACGAT
CACGATCCTC AAGTTTGTCA ACAGGCAATT ATTGCTTTAG GACGTATGGC AGATAATTCA
GCAGCAGAGG CATTATTTAT CTTGCTTAAG TCCCCTGCTA CCCCTAATAT GATGAAAAAA
GAGGTGATAC GAGCTTTAAG TTGGATAGAA ACCCCTCAAG CGTTGGTGTA TTTGCAAGAA
GGACTGCGTT GGGGAAATCT GAAAGTTTGC GAAGAAATTA TCAGTGCCTT GGGACGAGAG
CAAAGACCTC AATTAAAAAC TCAGGCTACC CAAATTTTGC TCAACTTTAT TGATTCTGAG
CAAGAAGCAA CAAGAGAACC CCAGATTAAA CAAAGTGTTG CTATGGCTCT CGGAGAATTA
CGCGATCCTA TCTCTGTTGA GGTTTTAGAG AATCTAACAA CTGATCCCGA TCAAGGAGTC
CGTCTTCATG CGATCGCAGC GTTACGCAAA TTTTCGGACG TGAATTAG
 
Protein sequence
MEAARIDAAK ETRVNQDLKK LVLKTQDLPG KALGTTETDH ILNLALAVLH EGDFQQRWEV 
AKIWPKLGKK SIAPLLTILA DEEADVEIRW FVGRILGEFD DPQVIMALTN LLQVTEEEEL
SMMAASTLAK IGEGAIESLS KLLLEPSLRM TAVHSLAQIR HSQTILPLLT VIDDPNPQVR
ATVIEALGSF HDEHLVTFLL KGLKDPTARV RKEAVIALGM QHQFKKKFGL VEYLKPLLYD
HDPQVCQQAI IALGRMADNS AAEALFILLK SPATPNMMKK EVIRALSWIE TPQALVYLQE
GLRWGNLKVC EEIISALGRE QRPQLKTQAT QILLNFIDSE QEATREPQIK QSVAMALGEL
RDPISVEVLE NLTTDPDQGV RLHAIAALRK FSDVN