Gene PCC8801_2626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2626 
Symbol 
ID7105847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2715869 
End bp2716939 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content47% 
IMG OID643475667 
Productmonooxygenase FAD-binding 
Protein accessionYP_002372786 
Protein GI218247415 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCATTA TCGGTGGGGG TCCGGCCGGG TTAGCAACGG CGATCGCTCT CACCGATCTT 
GACATCAATT CTATCGTGAT CGAGAGCAGC CATTATCTTG ACCCTCGTTT GGGAGAACAT
CTGACCCCGG TCGGGGTGGG AATCTTAAAA CAACTAGGGA TCTGGGATAG CCAATTTCTA
GAAAAACACC GTTTATGTTA TGGGGTGCGT TCTGCTTGGG GAGAGACTCA AGTGACCTAC
AGTGACTACC TCTTTCATCC CGATGGTACG GGAGTTAATT TGAGTCGTCC CACCTTTGAC
CGCAATTTAG CAACGTTAGC GGATGGTAAG GGGGTTCGTT TGTTGCTCTC AAGTCAACTC
AAACAGGCTC AACAGGAACA GAACGGATGG ATACTTTCTC TCGACACTCC AAAGGGTCTT
CAGGAGGTAA GAGCTAGAGT GGTTGTGGAT GCGAGTGGAC GCAAGGCTTT ATTTGCTAGG
AGTCAGGGTC GAACTTCTGT CTATTGCGAT CGCTTGGTGG GTATTGCTGC TTTTTTAGAG
CCTTTGGCAG AAAATCATGA TCAGGAGGAA ACCTTGTTGC TCGAATCGGG AGAGTTTGGC
TGGTGGTACT TTGCCCGTCT TCAGGATAAT AGGGGGGTTT TTTTGCATAT AACGGATGCT
GATCAACTTG AGTCCAGAAA AGATGCTCCT CTGCAAACGT GGTCAAAACG GCTAAAATCA
ACTAACTTTT TCTCGGAACT GGCTGGTTAT TATCATCCTG TTGAAAAGGT TCTGGTGCGA
TCGGCTCGTA GTCATTGTCT TGATCAAGCA ACAGGTCATC ATTGGCTGGC TGTGGGGGAT
GCTGCCATGA GTTTTGATCC CTTATCGTCT ATGGGGATTA CTAAAGCTTT AAAGGCTGGT
ATTTTTTCGA GTCAAGTCAT TTTAAGGGTT TTGAATGGGG AAACAACGGT TCTGAAAGAC
TATGAGGCAG AAATTCAGCA ACAATTTAAC GAATATCTCC AGATTCGCAC TCAATATTAT
CAGATCGAGC AGCGTTGGCC AAGCTCACTT TTTTGGCAGC GGCGACATTA G
 
Protein sequence
MVIIGGGPAG LATAIALTDL DINSIVIESS HYLDPRLGEH LTPVGVGILK QLGIWDSQFL 
EKHRLCYGVR SAWGETQVTY SDYLFHPDGT GVNLSRPTFD RNLATLADGK GVRLLLSSQL
KQAQQEQNGW ILSLDTPKGL QEVRARVVVD ASGRKALFAR SQGRTSVYCD RLVGIAAFLE
PLAENHDQEE TLLLESGEFG WWYFARLQDN RGVFLHITDA DQLESRKDAP LQTWSKRLKS
TNFFSELAGY YHPVEKVLVR SARSHCLDQA TGHHWLAVGD AAMSFDPLSS MGITKALKAG
IFSSQVILRV LNGETTVLKD YEAEIQQQFN EYLQIRTQYY QIEQRWPSSL FWQRRH