Gene PCC8801_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1649 
Symbol 
ID7101624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1723768 
End bp1724880 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content45% 
IMG OID643474720 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002371856 
Protein GI218246485 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTGG CACAAAAGCC GGAGTTAGTG CGTCAACATC CCCATGAGGA GGAAGACTAC 
CCTATGAGTT CCCTTTTGCG TAACGCTTGG TACGTTGCTT TACCTGGAAA GCAGCTAAAA
CCAGGGAAAA TGACCCATAA AAAGATGTTA GGAGAACCCG TCTTAGTGGG ACGACGGGAA
GATGGGGAAG TCTTTGCTAT GCGCGATATT TGTCCCCATC GCGGTATCCC CCTACAGTAC
GGATGGCTCG AAGGGGATGG GGTTTGTTGT TGCTATCATG GCTGGAAATT TAACACCAGC
GATGGCCGGT GTAGTGAAAT TCCCTCCTTA ACCGAGTACG ATGACTTAGA TATTAGCCGT
ATTCGTGTCC CTACCTACCC TTGTCGAGAA GTTCAAGGCA ATATTTGGGT CTATTTTGCT
GAAGACTCCA AAAAAGAAAT TAACCCTTCA GAGCTTCCTC CCGTGCCAAC AATCCCCGAT
TTTGGTAAAG TTGAGCCTGG AATCTCGGAA ACCATCCATT TTGCTTGCCA TATTGACCAT
GCGATTATTG GCTTAATGGA CCCAGCCCAT GGCCCCTACG TTCATAGTTC CTGGTGGTGG
CGCAGTGGTC CACGGAAGTT TCGAGTTAAA GAAAAGCAAT ATGAACCCGT AGCCCAAGGA
TTTCGCCTCG TTCCTTATGA TATGCCAGTT AGTGCGCGAC CTTACAAGAT TTTAGGCAAT
CAAGTGTCTA TTGAAATCGT TTTTGAGTTG CCCAGTGTAC GGACAGAGAT TTTAAGAGGC
GATCGCTATT CGGCTTGCTT ATTGACTACC ATTACCCCCA TTGATGAAAA CGAATGCGAA
GCCTTTCAAA GCATTTATTG GACAATTCCT TGGATGGGAC TATTTAAACC CCTATTAAGT
TTGTTAACCC GTCAATTTCT GGCGCAAGAT CGAGATGTGG TTATTCAACA ACAAGAAGGG
TTAAAATACA ATCCAGCCTT AATGCTAATT GATGACGCGG ATACTCAAGC TAAATGGTAT
TTTCGCCTCA AACAGGAATA TCAAAAGTCC CAAGCAGAAA ATCGTCCCTT TAAAAATCCT
GTAGAACCAA GGATTTTACG CTGGCGTAGC TGA
 
Protein sequence
MDLAQKPELV RQHPHEEEDY PMSSLLRNAW YVALPGKQLK PGKMTHKKML GEPVLVGRRE 
DGEVFAMRDI CPHRGIPLQY GWLEGDGVCC CYHGWKFNTS DGRCSEIPSL TEYDDLDISR
IRVPTYPCRE VQGNIWVYFA EDSKKEINPS ELPPVPTIPD FGKVEPGISE TIHFACHIDH
AIIGLMDPAH GPYVHSSWWW RSGPRKFRVK EKQYEPVAQG FRLVPYDMPV SARPYKILGN
QVSIEIVFEL PSVRTEILRG DRYSACLLTT ITPIDENECE AFQSIYWTIP WMGLFKPLLS
LLTRQFLAQD RDVVIQQQEG LKYNPALMLI DDADTQAKWY FRLKQEYQKS QAENRPFKNP
VEPRILRWRS