Gene PCC8801_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2998 
SymbolhemE 
ID7104490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3104861 
End bp3105925 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content47% 
IMG OID643476027 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_002373141 
Protein GI218247770 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAG GTAATAATAC CCCCTATTTA CTCCGTGCTG CACGGGGAGA GATATTAGAC 
AGACCCCCCG TGTGGATGAT GCGACAAGCG GGGCGTTATA TGAAGGTTTA TCGAGATTTA
CGCGACAAAT ACCCCAGTTT TCGGGATAGA TCGGAGAATG CAGACTTAGC CATCGAAATT
TCCCTACAAC CCTGGAAAGC ATTCCGACCC GATGGGGTGA TCATGTTCTC TGATATTCTA
ACCCCTCTCC CTGGCATTGG CATTTCCTTT GAAATTCCTG AAAGCAAGGG TCCGATGATT
GATTCTCCTA TTCGTACCCA GGAACAGGTC GATAATCTCC ATTCTCTCGA CCCTGAAGCG
TCTTTACCCT TCATAAAGAC TATTTTAAAG GCTTTACGCG ACGAAGTGAA GAACGAAGCT
ACGGTTTTAG GGTTTGTCGG TTCTCCTTGG ACTCTTGCAG CTTATGCTAT TGAGGGGAAA
AGTTCCAAAG ATTATGCCAA CATTAAACGG ATGGCTTTCT CTGAACCAGC CATTTTACAC
CAGTTTTTGA GTAAATTAGC CGATGCGATC GCGGTTTATG TTCGCTATCA GATCGATTGT
GGTGCTCAAG TAGTGCAATT GTTCGACTCT TGGGCGGGTC AATTGAGTCC CCAAGATTAC
AAAGTGTTTG CACTTCCCTA CCAGCAACAG GTCGTCCGTC AGGTGAAAGC AACCCATCCT
GATACCCCGC TTATTCTCTA TATTAGCGGC AGTGCCGGGG TTTTGGAACT GATGGGTCAG
TCGGGGGTAG ACATCGTTAG CGTTGACTGG ACGGTGGATA TGGCTGACGC TAGACAACGG
TTAGGACGTA ATATGATGGT ACAGGGGAAT ATCGATCCAG GTATCTTATT TGGGTCAAAA
CAGGTAATCC GCGATCGCAT TTTAGACACA GTTCAAAAAG CGGGTAAAGG TGGCCATATC
TTGAATTTAG GTCATGGTGT CTTGGTGGGA ACTCCTGAAG AGAATGTTGG TTACTTCTTT
GAAACGGCTA AGCAGGTTGA TCAATTACTC GCGGTTCCCG TTTAG
 
Protein sequence
MTQGNNTPYL LRAARGEILD RPPVWMMRQA GRYMKVYRDL RDKYPSFRDR SENADLAIEI 
SLQPWKAFRP DGVIMFSDIL TPLPGIGISF EIPESKGPMI DSPIRTQEQV DNLHSLDPEA
SLPFIKTILK ALRDEVKNEA TVLGFVGSPW TLAAYAIEGK SSKDYANIKR MAFSEPAILH
QFLSKLADAI AVYVRYQIDC GAQVVQLFDS WAGQLSPQDY KVFALPYQQQ VVRQVKATHP
DTPLILYISG SAGVLELMGQ SGVDIVSVDW TVDMADARQR LGRNMMVQGN IDPGILFGSK
QVIRDRILDT VQKAGKGGHI LNLGHGVLVG TPEENVGYFF ETAKQVDQLL AVPV