Gene Sbal195_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4042 
SymbolhemE 
ID5755861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4758400 
End bp4759464 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content48% 
IMG OID641290388 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001556462 
Protein GI160877146 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.984022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAT TAAAGAATGA TCGTTATTTA CGCGCCCTAC TAAAAGAGCC AGTCGATGTG 
ACCCCTGTGT GGATGATGCG TCAAGCTGGC CGTTATCTTC CTGAATATAA AGCTACGCGC
GCACAAGCGG GTGACTTTAT GTCTTTGTGT AAAAACCACG AGCTTGCCTG TGAAGTCACG
CTGCAACCTC TACGCCGATA TGATCTTGAT GCGGCGATTC TGTTTTCCGA TATTTTAACT
GTGCCTGATG CTATGGGTTT AGGTTTGTAT TTTGAGGCGG GTGAAGGCCC ACGTTTCGAG
CGCCCTACAG ACACTATCGA TGCAATTAAA AAGCTATCAA TCCCAGATCC AGAAGATGAG
CTTGGCTATG TGATGAAAGC CGTGAGCACT ATTCGCCGTG AGCTTAATGG CGCAGTGCCG
TTAATTGGCT TTTCTGGCTC GCCATGGACC TTAGCCACTT ATATGGTTGA AGGTGGCTCG
AGCAAAACTT TCGAAAAAAT TAAAAAGATG GCTTACGCTG AGCCAATGGC ATTACACATG
CTGTTAGACA AGCTAGCTGA TTCAGTGATC TTATACCTGA ATGCCCAAGT CGCCAACGGC
GCACAATCTT TGATGATTTT TGACTCATGG GGCGGCGCGT TATCGCACTC AGCTTATCGT
GAGTTCTCTT TGCGTTACAT GCAGAAGATT ATTGATGGTC TGACACGCTT TGCCGATGGA
CGTAAAGTGC CTGTGACGCT GTTCACTAAA GGCGGCGGTT TATGGTTAGA AGCCATGGCA
GAAACAGGTT GTGATGCGCT CGGTTTAGAT TGGACGGTAG ACATTGCCGA TGCTCGTCGC
CGTGTAGGCC ATAAAGTGGC CCTGCAAGGC AACATGGACC CTTCAATGTT GTATGCACCT
ATTCCACGCA TCGAAGAAGA AGTGGGCCAT ATCCTCGCGG GTTATGGTGA AGGTACTGGT
CATGTATTTA ACTTAGGCCA TGGTATACAT CAGCATGTCG ATCCAGAGCA TGCTGGCGCC
TTTATTAAGG CGGTTCACGC ACAATCAAAG CAATACCATA AGTAA
 
Protein sequence
MAELKNDRYL RALLKEPVDV TPVWMMRQAG RYLPEYKATR AQAGDFMSLC KNHELACEVT 
LQPLRRYDLD AAILFSDILT VPDAMGLGLY FEAGEGPRFE RPTDTIDAIK KLSIPDPEDE
LGYVMKAVST IRRELNGAVP LIGFSGSPWT LATYMVEGGS SKTFEKIKKM AYAEPMALHM
LLDKLADSVI LYLNAQVANG AQSLMIFDSW GGALSHSAYR EFSLRYMQKI IDGLTRFADG
RKVPVTLFTK GGGLWLEAMA ETGCDALGLD WTVDIADARR RVGHKVALQG NMDPSMLYAP
IPRIEEEVGH ILAGYGEGTG HVFNLGHGIH QHVDPEHAGA FIKAVHAQSK QYHK