Gene Smed_3208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3208 
SymbolhemE 
ID5324087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3383576 
End bp3384607 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content64% 
IMG OID640792156 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001328867 
Protein GI150398400 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAA CGCATCGCAA AGTGCTAGAG GTTTTGAACG GGAGATCGCT CACGCCTCCC 
CCCATCTGGC TGATGCGGCA GGCTGGACGC TACCTCCCCG AATACAGGGC AACCCGGATA
AAGGCCGGAA GCTTCCTCGA CCTCTGCTAC ACACCCGAGC TTGCAGTCGA AGTGACATTG
CAGCCGATCC GCCGCTACGG CTTCGATGCC GCGATCCTCT TCTCCGATAT TCTGGTCGTT
CCCGATGCAC TCAATCGAAA TGTTCGCTTC GAAGAGGGGC AGGGACCGCG GATGGATCCG
ATCGACGAGG ACGGTATAGC GCAGCTGAGC CAGACAGGCG TCATCGAGCA CCTTGCCCCG
GTCTTCGAGA CGGTCTCTCG ACTCAGGGGC GAATTGGCGG CGGAAATCAC GCTGCTCGGC
TTTTGCGGGG CGCCCTGGAC CGTGGCGACC TATATGATCG CCGGTCGCGG GACGCCAGAC
CAGGCGCCGG CGCGCCTCTT CGCCTATCGT CATCCCAAGG CCTTTGAACG GCTTCTGGCG
CTGCTTGCCG ATATTTCCGC CGACTACCTG GTCGAACAGA TCGATCGCGG TGCGGATGCG
GTGCAAATCT TCGACTCCTG GGCCGGTGTG CTCGGCGAGG AAGAATTCCA ACGTTACGCG
GTGGAGCCTG TCCGGCGCAT CATCGCCTCG GTCCGGTCCC GCCGGCCTTC GGCGAAAATC
ATCGCCTTTG CGAAAGGCGC CGGCATCCTG TTGAAGAACT ATCGGCAAGC GACCGGCGCG
GACGCAATCG GCCTCGATTG GTCGGTGCCG CTCTCCTTCG CAGCGGAGCT GCAGAAGGAC
GGGCCGGTTC AGGGCAATCT CGATCCGGTG CGGGTCGTGG CCGGTGGCGC GGCGCTGGAG
CATGGAATCG ACCGTATCCT GGACGTCCTC GGGCAAGGGC CGCTGATCTT CAATCTGGGC
CACGGCATCA CGCCGGACGC GGATCCGGAG CATGTCGCCG CGCTTGTGTC TCGCGTCCGA
GGACGCCGAT GA
 
Protein sequence
MSGTHRKVLE VLNGRSLTPP PIWLMRQAGR YLPEYRATRI KAGSFLDLCY TPELAVEVTL 
QPIRRYGFDA AILFSDILVV PDALNRNVRF EEGQGPRMDP IDEDGIAQLS QTGVIEHLAP
VFETVSRLRG ELAAEITLLG FCGAPWTVAT YMIAGRGTPD QAPARLFAYR HPKAFERLLA
LLADISADYL VEQIDRGADA VQIFDSWAGV LGEEEFQRYA VEPVRRIIAS VRSRRPSAKI
IAFAKGAGIL LKNYRQATGA DAIGLDWSVP LSFAAELQKD GPVQGNLDPV RVVAGGAALE
HGIDRILDVL GQGPLIFNLG HGITPDADPE HVAALVSRVR GRR