Gene PCC8801_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3957 
Symbol 
ID7105822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4145143 
End bp4146273 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content47% 
IMG OID643476954 
Productmonooxygenase FAD-binding 
Protein accessionYP_002374055 
Protein GI218248684 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGA TCATCATTAT CGGGGGTGGA ATTGGGGGCA CTGCAACTGC GCTTGCTCTG 
AATCAAGCAG GTTTTGAGCC TGTCGTTTAT GAGCGCACCC AGGTCTTGCG GGAAGTCGGT
GCTGGAATTG CACTGTGGGC AAACGCGACT CACATCTTGA AGAAGTTAGG ATTATTGGAA
ACAGCGATTC AGGTTGGCTG TCTCACCACC AATTATCAAT TCAACTCCCA ACGAGGCAAA
GAGCTAGTTA ACATCGATCT CGATGGTTTT GAGTTACCTG TTGTGGCCAT TCATCGCGCT
GAATTGCATC AACTTCTGTG GCGTAATGTA CCTGGAGAAA AATTTCACTT GGGAGAAACG
TTTGAACGAT TTGAGCACCA GCATGATCGG GTTCATGCCT ATTTTGTCTC TGGATTAGAA
GTCGAAGGGG ATGGATTAAT CGGTGCAGAT GGATTGCGTT CACGAGTCAG AGCTACTCTT
TTAGGCGATA CTCCTCCCAC ATACCGGAAT TTCAAAACTT GGCGAGGGTT GACTGATTAC
GTCCCGAGTA ATTATCGGCC GGGTTACATT CAGGAGTTTT TAGGTGGTGG TAAAGGTTTT
GGCTTCATGA TGCTGGGCAA AGGAAAAATG TATTGGTATG CCGCAGCTAC CGCACCTGAA
GCACAACCGG ATGCAGTGTT CGGGCGCAAA CAGGAACTTG AGACAATGTA TCAAGACTGG
TTTTCAGCGA TTCCTGAATT GATTGCAGCA ACGGATGAGG CAAATATCTT GACCACGGAT
CTTTACGATC GCCCTCCGAC TCAACCTTGG AGCAAAGGCA ATATTACCCT TTTAGGCGAC
GCTGCTCACC CAATGTTACC CACAATGGGA CAAGGAGCTT GTACCGCTTT AGAAGATGCG
TATGTTGTTG CAAAATGCTT AGAAGAAAAT TCTGATCCGA TCGCTGCATT TCAACGCTAT
GAAGATCTAC GATTTCCTCG CACCAAAGCA ATCGTTGAAC AGTCTTTACG ATCTCGGAAG
ATGGGTGAAT TGAAGAATCC CTTCGCTGTT AGTCTCCGTA ATACTTCGAT GAAAATCATG
GGTTCAGCAA TCAGCAGCAG CTTTAAATCT CTTCATGCTT ACCGAGCCTA G
 
Protein sequence
MRKIIIIGGG IGGTATALAL NQAGFEPVVY ERTQVLREVG AGIALWANAT HILKKLGLLE 
TAIQVGCLTT NYQFNSQRGK ELVNIDLDGF ELPVVAIHRA ELHQLLWRNV PGEKFHLGET
FERFEHQHDR VHAYFVSGLE VEGDGLIGAD GLRSRVRATL LGDTPPTYRN FKTWRGLTDY
VPSNYRPGYI QEFLGGGKGF GFMMLGKGKM YWYAAATAPE AQPDAVFGRK QELETMYQDW
FSAIPELIAA TDEANILTTD LYDRPPTQPW SKGNITLLGD AAHPMLPTMG QGACTALEDA
YVVAKCLEEN SDPIAAFQRY EDLRFPRTKA IVEQSLRSRK MGELKNPFAV SLRNTSMKIM
GSAISSSFKS LHAYRA