Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3411 |
Symbol | |
ID | 7105213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3556543 |
End bp | 3557634 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643476426 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002373535 |
Protein GI | 218248164 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCG ATCATATTCA TTTCTACGTT GAAGATGCAG CACATCAACG AGATTGGTTT ATTGATAAAA TGGGGTTTCA ATCCATCAGC AACAGTATCC ATGATGACAC TTATAGCGAA GTAGTAGGGA ATCAGTCTGT TTACTTTATC TTATCTTCTC CCCTCAACGA TGCTAGTCCA GTTTCTTATT ACTTGAAATC TCATCCTCCG GGGGTTGCTG ATGTTGCTTT TCGTGTTGAC AATCTTAATT TTTTATTAGA CAAAGTATCC CGTTTTAAGG TCGAAATTAT TAATCAATCT AGTCTAACAG CTTTTCCTCT AAATAAACCA GTGAAATTCG CGAAACTTAA AGGATGGGGT TCTGTCAATC ATACCTTAAT TGATCAGGCA AGTCCTAGGA CTTTTATTAG CTCAAAAATG ATTGCTAAAA GCGATATTAT TGGGATTGAT CATGTTGTTT TAAATGTTCC TCAAGGTGAA CTCCCCTTAG CCATAAATTG GTACAAAAAT GTATTTGATT TTATAAGTCA TCAACAGTTC AACATCCAAA CAGAACATTC GGGGTTATCT AGTGAAGCCT TAGTTGATAG TTCAGGAAAA GTACAATTTA ATATTAATCA ACCAAGTTCT ACTAATTCTC AGATTCAGGA ATTTTTAGAC CATAATAACG GTTCAGGCAT TCAACATATT GGTTTAAAAT CAAGTAATAT TTTACAAAGT GTTGCACAAA TGCGTCAAAG GGGATTACCC TTTTTATCCG TTCCTAATTC CTATTACCAA AACCTAAAAG AATTGATTAG AAAATCGACA ATTTCTTGTT TAAGCCAACA GGAACTAGAA CAAATTGAAA CTGAACAAAT TCTAGTTTGT TGGCCAGAAG ATAACCCGAC TTCAATCCTG ATGCAAATTT TCACTCAACC CATTTTTAAG CAGCCGACTT TCTTTTTTGA ATTAATTCAA AGACGCAACC AAGCACAGGG ATTTGGCCAA GGTAATTTTC AAGCGTTATT TGAAGCCATA GAATCAGAAC AAATCAAGAG AAATAGGGTA TCCTCACGAG TCACTTTACA GGCTGTAACA CCCCAATCTT GA
|
Protein sequence | MEIDHIHFYV EDAAHQRDWF IDKMGFQSIS NSIHDDTYSE VVGNQSVYFI LSSPLNDASP VSYYLKSHPP GVADVAFRVD NLNFLLDKVS RFKVEIINQS SLTAFPLNKP VKFAKLKGWG SVNHTLIDQA SPRTFISSKM IAKSDIIGID HVVLNVPQGE LPLAINWYKN VFDFISHQQF NIQTEHSGLS SEALVDSSGK VQFNINQPSS TNSQIQEFLD HNNGSGIQHI GLKSSNILQS VAQMRQRGLP FLSVPNSYYQ NLKELIRKST ISCLSQQELE QIETEQILVC WPEDNPTSIL MQIFTQPIFK QPTFFFELIQ RRNQAQGFGQ GNFQALFEAI ESEQIKRNRV SSRVTLQAVT PQS
|
| |