Gene PCC8801_3411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3411 
Symbol 
ID7105213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3556543 
End bp3557634 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content35% 
IMG OID643476426 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002373535 
Protein GI218248164 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCG ATCATATTCA TTTCTACGTT GAAGATGCAG CACATCAACG AGATTGGTTT 
ATTGATAAAA TGGGGTTTCA ATCCATCAGC AACAGTATCC ATGATGACAC TTATAGCGAA
GTAGTAGGGA ATCAGTCTGT TTACTTTATC TTATCTTCTC CCCTCAACGA TGCTAGTCCA
GTTTCTTATT ACTTGAAATC TCATCCTCCG GGGGTTGCTG ATGTTGCTTT TCGTGTTGAC
AATCTTAATT TTTTATTAGA CAAAGTATCC CGTTTTAAGG TCGAAATTAT TAATCAATCT
AGTCTAACAG CTTTTCCTCT AAATAAACCA GTGAAATTCG CGAAACTTAA AGGATGGGGT
TCTGTCAATC ATACCTTAAT TGATCAGGCA AGTCCTAGGA CTTTTATTAG CTCAAAAATG
ATTGCTAAAA GCGATATTAT TGGGATTGAT CATGTTGTTT TAAATGTTCC TCAAGGTGAA
CTCCCCTTAG CCATAAATTG GTACAAAAAT GTATTTGATT TTATAAGTCA TCAACAGTTC
AACATCCAAA CAGAACATTC GGGGTTATCT AGTGAAGCCT TAGTTGATAG TTCAGGAAAA
GTACAATTTA ATATTAATCA ACCAAGTTCT ACTAATTCTC AGATTCAGGA ATTTTTAGAC
CATAATAACG GTTCAGGCAT TCAACATATT GGTTTAAAAT CAAGTAATAT TTTACAAAGT
GTTGCACAAA TGCGTCAAAG GGGATTACCC TTTTTATCCG TTCCTAATTC CTATTACCAA
AACCTAAAAG AATTGATTAG AAAATCGACA ATTTCTTGTT TAAGCCAACA GGAACTAGAA
CAAATTGAAA CTGAACAAAT TCTAGTTTGT TGGCCAGAAG ATAACCCGAC TTCAATCCTG
ATGCAAATTT TCACTCAACC CATTTTTAAG CAGCCGACTT TCTTTTTTGA ATTAATTCAA
AGACGCAACC AAGCACAGGG ATTTGGCCAA GGTAATTTTC AAGCGTTATT TGAAGCCATA
GAATCAGAAC AAATCAAGAG AAATAGGGTA TCCTCACGAG TCACTTTACA GGCTGTAACA
CCCCAATCTT GA
 
Protein sequence
MEIDHIHFYV EDAAHQRDWF IDKMGFQSIS NSIHDDTYSE VVGNQSVYFI LSSPLNDASP 
VSYYLKSHPP GVADVAFRVD NLNFLLDKVS RFKVEIINQS SLTAFPLNKP VKFAKLKGWG
SVNHTLIDQA SPRTFISSKM IAKSDIIGID HVVLNVPQGE LPLAINWYKN VFDFISHQQF
NIQTEHSGLS SEALVDSSGK VQFNINQPSS TNSQIQEFLD HNNGSGIQHI GLKSSNILQS
VAQMRQRGLP FLSVPNSYYQ NLKELIRKST ISCLSQQELE QIETEQILVC WPEDNPTSIL
MQIFTQPIFK QPTFFFELIQ RRNQAQGFGQ GNFQALFEAI ESEQIKRNRV SSRVTLQAVT
PQS