Gene Sare_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1474 
SymbolhemE 
ID5706067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1703981 
End bp1705078 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content72% 
IMG OID641270982 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001536363 
Protein GI159037110 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.514226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000238489 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACCG ACACCACGGG CACCGCCGCC CGAGACGAGG GTCCTCGCCC CGGCGATCCG 
GCCGACTCAC CTTTCGTCCG CGCGTGCCGG CGCGAACCCG GGCCGCACAC CCCGGTCTGG
TTCATGCGAC AGGCGGGCCG CTCGCTCCCG GAGTACCGGA AGATCCGGTC CGAGGTGGCG
ATGCTGGAGT CCTGCCGCCG GCCGGACCTG ATTACCGAGA TCACCCTCCA GCCGGTGCGC
CGACACAAGG TCGACGCGGC GATCCTGTTC AGCGACATCG TGGTGCCGGT CGCCGCCGCC
GGGGTGGCGT TGGACATCGT CCCCGGCACC GGCCCGGTGG TGACCGACCC GGTGACCACC
AGGGCAGACG TGGAGCGAAT CCGGCTGATC GACCGCGACG ATGTCCACTA CGTGGACGAG
GCGGTCCGGA TGCTCGTCGA CGAGCTGGGC GGCACCCCGC TGATCGGCTT CGCCGGTGCT
CCGTTCACGC TGGCCAGCTA CCTCGTCGAG GGAGGCCCGT CCCGCACCCA CGTGCGGACC
AAGGCCCTGA TGTACGGCGA CCCGGACCTG TGGCACGCCC TGGCCGGCCG GCTCGCCGAG
ATGACGCTCG CGTTCCTGAA GGTGCAGATC GACGCCGGCG TCTCCGCGGT GCAGCTCTTC
GACTCCTGGG CGGGTGCGCT CTCCGAAGCC GACTACCGCC GGTACGTGCT GCCGCACTCG
CGGGCGGTGC TCGCCGGGCT CGCCGACGCC GGAGTCCCCC GTATCCACTT CGGGGTGGGC
ACCGGCGAGC TGATCGCCGC GATGGGCGAG GCGGGCGCCG ACGTGGTGGG CGTCGACTGG
CGTACGCCGC TGGACGTCGC CACTCGCCGG ATCGGTCCCG AGCGGGCCGT GCAGGGCAAC
CTCGACCCGT GCCTGCTGTT CGCCCCGTGG CCGGTCATCG AGGCCGAGGT ACGGCGGGTG
CTGGCCCAGG GGCGTGCCGC CCCCGGGCAC ATCTTCAATC TCGGCCACGG AGTGCTGCCG
GAGACCGACC CCGAGGTGCT GACCCGGGTG GTGGCCCTGG TCCACGAGCT GACCGTGCGT
CCGGATGGAA GGAGCTGA
 
Protein sequence
MSTDTTGTAA RDEGPRPGDP ADSPFVRACR REPGPHTPVW FMRQAGRSLP EYRKIRSEVA 
MLESCRRPDL ITEITLQPVR RHKVDAAILF SDIVVPVAAA GVALDIVPGT GPVVTDPVTT
RADVERIRLI DRDDVHYVDE AVRMLVDELG GTPLIGFAGA PFTLASYLVE GGPSRTHVRT
KALMYGDPDL WHALAGRLAE MTLAFLKVQI DAGVSAVQLF DSWAGALSEA DYRRYVLPHS
RAVLAGLADA GVPRIHFGVG TGELIAAMGE AGADVVGVDW RTPLDVATRR IGPERAVQGN
LDPCLLFAPW PVIEAEVRRV LAQGRAAPGH IFNLGHGVLP ETDPEVLTRV VALVHELTVR
PDGRS