Gene Mmar10_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2973 
SymbolhemE 
ID4286869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp3255552 
End bp3256589 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID638142469 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_758192 
Protein GI114571512 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC TTACAAAAAA ACCGATCCTC CAGGTCCTGG CCGGAGAAAC CGTCTCACCG 
CCCCCTGTCT GGCTGATGCG CCAGGCCGGT CGCTATCTGG CTGAATATCG ACAAGTCCGC
TCTCGAGCGA AGAACTTCAT CGATTTCTGT TTCTCTCCGG ATCTAGCCGC AGAAGTGACT
CTGCAGCCCA TACGCCGCTT TGGCTTTGAC GCTGCGATCC TGTTTGCCGA CATCCTGCTG
GTGCCGATCG CGCTCGGTCG CAAGGTCTGG TTCGTGACCG GCGAGGGTCC CCAGCTGGAG
CCGTTCGATC CGCGCCAATT TGAAGAGCTC CGGCTCGACC AGACCGAGGC GGTTCTCGGT
TCAATTGGCG AGACGCTCAA GCGCGTGGTG CCGGAACTGC CGGACACGAC CACCATGATC
GGGTTTGCCG GATCACCCTG GACAGTCGCA ACCTACATGG TCGAAGGCGG CGGATCGAAA
GATCGTTTCC GGACCCGGGT CGCTGCCTGG GAGTATCCCG AAGCCTTTGA TGGCATGCTC
GATCGGATCG CCGACGTCAC GGCAGAGTAT CTGATCATGC AGGCCCGTAG CGGTGCTGAA
GTCCTCAAAC TGTTTGACAG CTGGGCCGAG GGTCTGCCGG AACCGCTGTT CGAACGGGTC
GTGATCCGGC CGACCAAACG GATCGTTGAT GCTGTGCGGG CTGCCGGCAT TGACGTGCCG
ATCATCGGGT TTCCACGCGG CGCCGGGACG CTCTACCCGC GATATGCGCG CGAGACCGGT
GTGACGGCGA TTGCTGTCGA TACGGGTGTT GATCCGGCCT GGATCCAGTC AGTCCTGCCT
GCAGGCATGC CGGTCCAGGG ACATCTCGAC CCGTCTGTCC TGCGGGCAGG CGGCGCAGCA
CTGGATGGCG AGGTCGATCG ATTGTTGGAC CAATGGGCTG GTCGGCCCCA TATCTTCAAT
CTTGGCCACG GCATCACGCC GGACGTGCCG GTAGCTCATG TCGAACAGCT TCTGGCGCGT
ATTCGCGATC GCAGCTGA
 
Protein sequence
MNDLTKKPIL QVLAGETVSP PPVWLMRQAG RYLAEYRQVR SRAKNFIDFC FSPDLAAEVT 
LQPIRRFGFD AAILFADILL VPIALGRKVW FVTGEGPQLE PFDPRQFEEL RLDQTEAVLG
SIGETLKRVV PELPDTTTMI GFAGSPWTVA TYMVEGGGSK DRFRTRVAAW EYPEAFDGML
DRIADVTAEY LIMQARSGAE VLKLFDSWAE GLPEPLFERV VIRPTKRIVD AVRAAGIDVP
IIGFPRGAGT LYPRYARETG VTAIAVDTGV DPAWIQSVLP AGMPVQGHLD PSVLRAGGAA
LDGEVDRLLD QWAGRPHIFN LGHGITPDVP VAHVEQLLAR IRDRS