Gene Syncc9605_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1017 
SymbolhemE 
ID3735264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp964299 
End bp965357 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content62% 
IMG OID637775609 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_381330 
Protein GI78212551 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CTTTACCCCT GCTCCTGCGT GCTGCGCGCG GTGAATCGGT GGAGCGTCCT 
CCGGTGTGGA TGATGCGCCA GGCCGGTCGC TATATGAAGA TCTACCGGGA CCTGCGGGAC
AAGTACCCGA GCTTCCGTGA GCGTTCCGAG AACCCCGATC TCTCCTATGA GATCTCGATG
CAGCCGTTTC ACGCTTTCAA GCCCGATGGA GTGATCCTGT TCTCCGACAT CCTCACGCCC
CTGCCGGGGA TGGGAATCGA TTTCGACATC ATCGAGAGCA AAGGCCCCCA GATCGGCGAT
CCAATCCGGA GCATGGCCCA GGTGGATGCT TTGCGCCCGC TGAACCCCTC CGAGTCGATG
CCCTTTGTGG GTGAAGTGCT GGGTCGTCTC CGCCAGAGTG TTGGCAACGA AGCAGCCGTT
CTCGGCTTCG TTGGTGCCCC TTGGACCCTG GCCGCCTACG TGGTGGAGGG CAAGAGCAGC
AAGAACTACG CTGTGATCAA GGCCATGGTC TTCCGCGAAC CCGAGATCCT GCACAAGCTG
CTCGACCACT TCGCCGAGTC GATCGCCAAC TACCTGCGCT ATCAGATTGA TTCCGGTGCC
CAGGTGGTGC AGATGTTTGA TTCCTGGGCT GGCCAGCTGA GCCCAGCGGA CTACGACACC
TTCGCTGCGC CCTATCAGAA GAAGGTGGTG GATCTGGTCA AGAAGACGCA CCCCGACACT
CCCTTCATCC TCTACATCTC CGGCAGCGCC GGCGTGATCG AGCGGATGGC CAACACCGGT
GTTGACATCG TCTCGCTGGA TTGGACCGTG GACATGGCCG AGGCCCTGGC ACGTCTGCCG
GAGCACATTG GTGTTCAGGG CAATGTGGAC CCCGGCCTGC TGTTCGGCAC CCCGGAAGCG
ATTGAGGCCC GCATCGATGA CTGCGTGCGC AAGGCCCGCG GCCGGAAGCA CATCCTCAAC
CTCGGCCATG GAATCCTGCC GGGTACACCG GAAGAGAACG GCGAAGCCTT CTTCCGGGCC
GGCAAGAGCG TGATGGACCG TCTCGGGGCT GTCGTTTGA
 
Protein sequence
MSDSLPLLLR AARGESVERP PVWMMRQAGR YMKIYRDLRD KYPSFRERSE NPDLSYEISM 
QPFHAFKPDG VILFSDILTP LPGMGIDFDI IESKGPQIGD PIRSMAQVDA LRPLNPSESM
PFVGEVLGRL RQSVGNEAAV LGFVGAPWTL AAYVVEGKSS KNYAVIKAMV FREPEILHKL
LDHFAESIAN YLRYQIDSGA QVVQMFDSWA GQLSPADYDT FAAPYQKKVV DLVKKTHPDT
PFILYISGSA GVIERMANTG VDIVSLDWTV DMAEALARLP EHIGVQGNVD PGLLFGTPEA
IEARIDDCVR KARGRKHILN LGHGILPGTP EENGEAFFRA GKSVMDRLGA VV