Gene Cyan8802_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4002 
Symbol 
ID8393352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4119120 
End bp4120250 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content47% 
IMG OID644981923 
Productmonooxygenase FAD-binding 
Protein accessionYP_003139637 
Protein GI257061749 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGA TCATCATTAT CGGGGGTGGA ATTGGGGGCA CTGCAACTGC GCTTGCTCTG 
AATCAAGCAG GTTTTGAGCC TGTCGTTTAT GAGCGCACCC AGGTCTTGCG GGAAGTCGGT
GCTGGAATTG CACTGTGGGC AAACGCGACT CACATCTTGA AGAAGTTAGG ATTATTGGAA
ACAGCGATTC AGGTTGGCTG TCTCACCACC AATTATCAAT TCAACTCCCA ACGTGGCAAA
GAGCTAGTTA ACATCGAGAT CGATGGTTTT GAGTTACCTG TTGTGGCCAT TCATCGCGCT
GAATTGCATC AACTTCTGTG GCGTAATGTA CCTGGAGAAA AATTTCACTT GGGAGAAACG
TTTGAACGAT TTGAGCACCA GCATGATCGG GTTCATGCCT ATTTTGTCTC TGGATTAGAA
GTCGAAGGGG ATGGATTAAT CGGTGCAGAT GGATTGCGTT CACGAGTCAG AGCTACTCTT
TTAGGCGATA CTCCTCCCAC ATACCGGAAT TTCAAAACTT GGCGAGGGTT GACTGATTAC
GTCCCGAGTA ATTATCGGCC GGGTTACATT CAGGAGTTTT TAGGTGGTGG TAAAGGTTTT
GGCTTCATGA TGCTGGGCAA AGGAAAAATG TATTGGTATG CCGCAGCTAC CGCACCTGAA
GCACAACCGG ATGCAGTGTT CGGGCGCAAA CAGGAACTTG AGACAATGTA TCAAGACTGG
TTTTCAGCGA TTCCTGAATT GATTGCAGCA ACGGATGAGG CAAATATCTT GACCACGGAT
CTTTACGATC GCCCTCCGAC TCAACCTTGG AGCAAAGGCA ATATTACCCT TTTAGGCGAC
GCTGCTCACC CAATGTTACC CACAATGGGA CAAGGAGCTT GTACCGCTTT AGAAGATGCG
TATGTTGTTG CAAAATGCTT AGAAGAAAAT TCTGATCCGA TCGCTGCATT TCAACGCTAT
GAAGATCTAC GATTTCCTCG CACCAAAGCA ATCGTTGAAC AGTCTTTACG ATCTCGGAAG
ATGGGTGAAT TGAAGAATCC CTTCGCTGTT AGTCTCCGTA ATACTTCGAT GAAAATCATG
GGTTCAGCAA TCAGCAGCAG CTTTAAATCT CTTCATGCTT ACCGAGCCTA G
 
Protein sequence
MRKIIIIGGG IGGTATALAL NQAGFEPVVY ERTQVLREVG AGIALWANAT HILKKLGLLE 
TAIQVGCLTT NYQFNSQRGK ELVNIEIDGF ELPVVAIHRA ELHQLLWRNV PGEKFHLGET
FERFEHQHDR VHAYFVSGLE VEGDGLIGAD GLRSRVRATL LGDTPPTYRN FKTWRGLTDY
VPSNYRPGYI QEFLGGGKGF GFMMLGKGKM YWYAAATAPE AQPDAVFGRK QELETMYQDW
FSAIPELIAA TDEANILTTD LYDRPPTQPW SKGNITLLGD AAHPMLPTMG QGACTALEDA
YVVAKCLEEN SDPIAAFQRY EDLRFPRTKA IVEQSLRSRK MGELKNPFAV SLRNTSMKIM
GSAISSSFKS LHAYRA