Gene PCC8801_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3897 
Symbol 
ID7103851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4087756 
End bp4088721 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content32% 
IMG OID643476901 
Product2-keto-4-pentenoate hydratase-like protein 
Protein accessionYP_002374002 
Protein GI218248631 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3971] 2-keto-4-pentenoate hydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGA TAAACTATTT TTTGTTCCCC TTCTTCATTT TACTAAGTCC CCTTCCTGAA 
CTGGCACAAG TTAAAATAAA AGATCCATTA TTTAAGTCAA ATTATCATCA AATACACAAT
ACAAATATTG CTGACTTTAA AGATGACTTT ATAACTTTAT CGAATCAAGA CTTAGATAAA
TTAGCAGAAA AGTTAGCTAA TTATTATTTG ACTCAACAAA AAATTGATGA TTTTCCTGAC
AATATAACTT CTAATCAGTC TCTTCTTATC CAATCTAAAT TTGTCAACAA TTTAATTAAT
AATCAAGGCA ATATCATTGG TTATAAAGCA GGTTTGACTA ACCAAAAAAT CCAAGAAAGA
TTTAACACAA ATCAACCTGT ATTAGGAACT TTACTCGAAA AAATGTTATT GCCATCAGGA
ACAATCGTTT CCTCTAAATT TGGTGCTATT CCTATGATGG AAGGAGATTT AATGGTCAGA
GTGAAAAGTG AGAAAATTAA TCAAGCAAAA ACGACCGAAG AAGTCTTAAA CTATTTAGAT
GCTGTTATTC CATTTTTAGA ATTACCTGAT TTAATGTATA GCCAAGATCT AAAATTAAAT
AAGGAAATGT TAGTCGCTAT TAATGTTGGT GCAAGATTAG GAATTATGGG AGAACCTATT
CCGTTAGAAG CAACGAAAGA ATGGCACACT AAGTTAAGTA ATATTCAGGT TACTATTAAA
GATGAATTGG GTCAAGAATT AGCCCAAGGA AACGGTAAAG CATTATTAGG AGATCCCTTA
ACAGTTGTAC TCTGGATTAA AGATGAGCTA CGATCTCAAG GAAAAAGCCT AAAAAAAGGT
GATTTGTTAT CTTTAGGAAG TATTACCCCT TTAATACCCG TTAAACCAGG AAAAACAATT
TCAGCGCAGT ATTTAGGATT AAATGAAGCG AGCCCAGTTC AACTATCCGT CCACTTTGAA
GAATAA
 
Protein sequence
MTKINYFLFP FFILLSPLPE LAQVKIKDPL FKSNYHQIHN TNIADFKDDF ITLSNQDLDK 
LAEKLANYYL TQQKIDDFPD NITSNQSLLI QSKFVNNLIN NQGNIIGYKA GLTNQKIQER
FNTNQPVLGT LLEKMLLPSG TIVSSKFGAI PMMEGDLMVR VKSEKINQAK TTEEVLNYLD
AVIPFLELPD LMYSQDLKLN KEMLVAINVG ARLGIMGEPI PLEATKEWHT KLSNIQVTIK
DELGQELAQG NGKALLGDPL TVVLWIKDEL RSQGKSLKKG DLLSLGSITP LIPVKPGKTI
SAQYLGLNEA SPVQLSVHFE E