Gene Smed_4522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4522 
Symbol 
ID5318497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1007760 
End bp1008920 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID640776323 
Productsiroheme synthase 
Protein accessionYP_001313255 
Protein GI150376659 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCCT CACCGTCGCA GCAGAGTGAG CGGCCCGCAA AGCGAGAGCG CGTGGCCGCG 
CTCGCGACGC TGCCGCTTTT CTGGCCTCTG AAGAGCAAAC GGGTGCTCGT CGCCGGCGGC
AGCGATGCAG CCGCCTGGAA AGCCGAACTG CTGTCGGCCT GCGGTGCGGA AGTTCACGTC
TATGCGCCGC GCGCGATACT GAGCGGTCTC TTCCTCGACG TCCTCGCCCG TGGAGCCGCG
CATGAGCTCG GCCGTTTCGT TCACCACGAT AAGGCCTGGC ATGCAGACGC ATTTCGAGAC
GCGGCAATCG CCATTGCCGA TTGTGACGAG CAATCCGAGG CGGAAGCGTT CTTCCATGCC
GCACGAACCG CAGGCGTGCC CGTCAACGTC ATCGACAAGC CGGCCTTCTG CGAATTTCAG
TTCGGATCGA TCGTGAACCG CTCGCCCGTG ATCGTCGCGA TCTCGACCGA TGGCGCGGCG
CCGATACTGG CGCAGGCGAT CCGCAGACGG ATCGAGGCCC TGCTGCCGCC AGCGCTCAAG
CATTGGGCGT CGATTGCCCA GGCTATCCGC GATCGCGTGA ATGCCCGTCT GAGCCCTGGG
GCTGCCCGCC GCATCTTCTG GGAACGTTTC GTCGACCGGG CCTTTCTCGG CAAACCGGAG
CAGGGTGTGG AGATGCGGCT GATGGCGGAG GCGGATCGTC AGGTCACGCG CCCCTCCGCC
ATCGGCCGCG TCACCATCGT GGGGGCAGGA CCCGGCGACG CGGAACTCCT CACCTTGAAG
GCGGTCCGCG CGCTGCAGGC TGCCGACGTG ATACTCTTTG ACGAATGCAT TCAGGACGAG
GTCCTTGAAC TGGCGCGCCG GGAGGCGAGG CGCGTCCCTG TCGCAGGCAG CGACAAAGAT
CGCAGCAGCA GCAACGCGAT TCTCGTCCAG GGAGACATTG CGGCGCTGGT CCGGAAGGGA
AAGAACGTGG TGCGCCTCCG GTCCGGGAAT CCGATGGCAG TCGACGAGGA ATTCGCGGCA
TTCGAACGCC TCGGACTGCC TGTGCAGATC GTACCTGGCG TCGAAGCCGA GGCTTACCGC
CCCGACATGG GCTCCGATAT GGGCGAGCTA GCATTCAGCG GGAGACCGGT CCAGCCGGGA
CTGCACACCA CCAACCACTG A
 
Protein sequence
MLASPSQQSE RPAKRERVAA LATLPLFWPL KSKRVLVAGG SDAAAWKAEL LSACGAEVHV 
YAPRAILSGL FLDVLARGAA HELGRFVHHD KAWHADAFRD AAIAIADCDE QSEAEAFFHA
ARTAGVPVNV IDKPAFCEFQ FGSIVNRSPV IVAISTDGAA PILAQAIRRR IEALLPPALK
HWASIAQAIR DRVNARLSPG AARRIFWERF VDRAFLGKPE QGVEMRLMAE ADRQVTRPSA
IGRVTIVGAG PGDAELLTLK AVRALQAADV ILFDECIQDE VLELARREAR RVPVAGSDKD
RSSSNAILVQ GDIAALVRKG KNVVRLRSGN PMAVDEEFAA FERLGLPVQI VPGVEAEAYR
PDMGSDMGEL AFSGRPVQPG LHTTNH