Gene PCC8801_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0352 
Symbol 
ID7104068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp344304 
End bp345767 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content34% 
IMG OID643473461 
ProductCarotenoid oxygenase 
Protein accessionYP_002370606 
Protein GI218245235 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAA GAAAGTTTTT AATTCAATCT TCTCTGGCTG CCGTTGCTGC GATTACTTGG 
GATCATTTAC CCAGTCAATC CACTCCTGTT TATAAGCATT ATAAGCATTA TAAAAGCTGG
AAAAGTGATA ATAGTTTTCT CAAAGGAATT AATGAACCTG TTTTTGCTGA AATCGAGCTT
GATAACTTAA AAATATCAGG AACAATTCCT CAGCAATTAC AAGGAATGTA CGTCCGTAAC
GGACCCAATC CTATGTTTAA ACCTACCTCC TATAACTATC CCCTAGAAGG GGATGGAATG
GTTCATGGAA TGTATTTTGA TCAAGGAAAA ATTGGCTACA AAAATCGCTG GATACAAACC
CGTGGACTAG CTTATGAAAG GTTTGAAGGC AAGGAACTAG CTGAATTAAA ATTTAAAAAC
TATGCAAATA CTAATATTAT TGGCTACGGG GATAAACTTT TAGCTTTATA TGAAGTGGGT
TTACCCTATC AAATGGATAA AAATTTAGAA ACCATTGGAG AATGGAATTT CCAAGGAAAA
TTAGAACAGT CAATGACAGC GCATCCGAAA TTTGATCCTG AAACGGGAGA ATTACACTTT
TATCAATATT CATTTTTTAA TACGCCCTAT TTGCATTATT ATATTGCCAA TCAAAAAGGA
GAGATAGTTA GAAAATCTCC CATTGAAATT GCGAATCCTG TGTTAATTCA TGATATGGTT
CTTACCAAGA ATTACGCCAT ATTTTTTGAC TGTCCTTTAG TCTTTAATAT GTCAAAAGCT
AAAGCGAATA AAACCCCTTT TATGTGGCAA CCCGAAGCAG GAACAAAGAT TATTTTAGTT
GATCGTCATA ATCCCAATAA AAAGCCGATT TATCTGAAAA CAGACGCTTT TTGGGTGTGG
CATTTCATGA ATGGATTTGA AGAAAACAAT AAAATTATTA TTGATTTTGT TTACTATCCT
AGTATTAACA TGGAAAGTCA TTGGCAAGCC ATGTTATCCA ATAAATCTAA CTTACAAAGA
ATAATTATTG ATCAAAAAAC ACATCAAATC GTCTCAGAGA AATTAGGAGA TCACTATGTT
GATTTTCCTA GCATCAATAC TCAAAAGCTA GGACAAACCT ATCGTTTTGG CTATACTCCT
CTGATTGATA CTGAATTATT ATCTCAGAAA AAAAGTCCGA ACTATTTCCC TTCCTTAATT
CAATATGATG TCATGAATAA AACCCATAAA ATTCATCAAT TTAAACCAGG ATGCTACGGA
GGAGAACCCG TTTTTATTCC TAACCCTAAC TCCCAATCGG AGTTAGATGG TTATGTAGCC
ACTTTTGTTT ATAATGAAAA TACCAACACC AGTGATTTTG TAATGCTTGA TCCCGCTAAT
TTTGAAAGTG AACCCATTGC AACAGTTCAT TTGCCTGTTC GGGTTCCTAG TGGTTTTCAT
GGCAATTGGA TCACTGATAG CTAA
 
Protein sequence
MNRRKFLIQS SLAAVAAITW DHLPSQSTPV YKHYKHYKSW KSDNSFLKGI NEPVFAEIEL 
DNLKISGTIP QQLQGMYVRN GPNPMFKPTS YNYPLEGDGM VHGMYFDQGK IGYKNRWIQT
RGLAYERFEG KELAELKFKN YANTNIIGYG DKLLALYEVG LPYQMDKNLE TIGEWNFQGK
LEQSMTAHPK FDPETGELHF YQYSFFNTPY LHYYIANQKG EIVRKSPIEI ANPVLIHDMV
LTKNYAIFFD CPLVFNMSKA KANKTPFMWQ PEAGTKIILV DRHNPNKKPI YLKTDAFWVW
HFMNGFEENN KIIIDFVYYP SINMESHWQA MLSNKSNLQR IIIDQKTHQI VSEKLGDHYV
DFPSINTQKL GQTYRFGYTP LIDTELLSQK KSPNYFPSLI QYDVMNKTHK IHQFKPGCYG
GEPVFIPNPN SQSELDGYVA TFVYNENTNT SDFVMLDPAN FESEPIATVH LPVRVPSGFH
GNWITDS