Gene YpsIP31758_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3305 
SymbolcysG2 
ID5386338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3714795 
End bp3716213 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID640866320 
Productsiroheme synthase 
Protein accessionYP_001402262 
Protein GI153949295 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACTATC TACCTCTATT TGCCGACTTG AAACAACGCC CGGTATTGAT CGTTGGCGGC 
GGCGAAGTTG CTGCGCGAAA AATTGAACTA CTGCACCGCG CAGGTGCGCA GGTATGGGTA
GTGGCGCAAA CCCTCTCATC TGAACTCGAA CAGCAGTATC AGGATGGCCG TATCCATTGG
CTGGCACAGG ATTTTCTGCC TGAACAGTTG GACAACGTGT TTCTGGTGAT TGCCGCAACT
AACGATACCG TACTGAATGC CGCTGTTTTC GCGGCGGCAG ATCAACGGTG TATTTTGGCG
AATGTGGTGG ATGACCAACC CCTTTGTTCG TTTATTTTCC CTTCGATTGT TGATCGTTCA
CCGCTGGTGG TCGCGATCTC GTCTTCCGGC CAGGCACCGG TGCTGGCGCG GATACTGCGT
GAGAAGTTAG AAGCCTTACT GCCCACTCGT CTCAGTGATA TGGCCGCTAT AGCTGGGCGC
TGGCGTGGGC GGGTTAAGCA GCATATGGCC TCTATGGGGG AGCGCCGCCG CTTTTGGGAA
CATGCATTCA GCGGGCGCTT TGCCAGCCTT ATCAGCCGTG GTCAGTTAAC CGAGGCTGAA
AATGAATTAC AACTGTCGCT AGAGGGCCAA CACCGTGCAC TCGGCGAAGT GGCTTTGGTC
GGTGCAGGTC CTGGCGATGC AGGGTTATTA ACGTTACGTG GTTTGCAAGT GATGCAGCAG
GCAGATGTGG TGCTGTATGA CCATTTAGTC AGTCCGGAAG TCTTGGATTT GGTGCGTCGC
GATGCTGAAC GCATCTGTGT GGGGAAACGG GCGGGGGCGC ATTCGGTTAC CCAAGAGGCA
ACCAATCAGT TGTTGGTCAC TCTGGCACAG CAAGGAAAAC GGGTCGTACG GCTAAAAGGT
GGCGATCCCT TTATTTTTGG TCGGGGGGGG GAAGAGTTAC AAGTTGTGGC GCAAGCGGGG
ATCCCGTTTC AGGTGGTACC GGGGGTGACG GCTGCTGCGG GGGCGACAGC CTATGCCGGT
ATTCCCCTGA CACACCGCGA TCATGCGCAA AGCGTGACAT TTATTACCGG GCATTGCCGT
CCTGATGGCG ATGATCTTGA TTGGCAGGCG CTGGCCCGTG GTCGTCAGAC GCTGGCGATC
TACATGGGAA CAGTGAAAGC GGCGGCAATC AGCCAACAGT TGATCGCCCA TGGCCGGTCC
AGTACCACGC CGGTGGCAGT GATTGGCCGT GGAACCCGTG TGGACCAGCA GGTGCTGATC
GGTACGTTGG CACAACTTGA ATCACTCGCA CAGCAGGCAC CGACACCAGC ATTACTGGTG
ATTGGTGAAG TGGTGAATTT ACATCACCAA ATTGCCTGGT TTGGGCAACA ACCGCAGACT
GAATCGGCGA TCAACCCATC AGTCGTGAAT TTGGCATAA
 
Protein sequence
MDYLPLFADL KQRPVLIVGG GEVAARKIEL LHRAGAQVWV VAQTLSSELE QQYQDGRIHW 
LAQDFLPEQL DNVFLVIAAT NDTVLNAAVF AAADQRCILA NVVDDQPLCS FIFPSIVDRS
PLVVAISSSG QAPVLARILR EKLEALLPTR LSDMAAIAGR WRGRVKQHMA SMGERRRFWE
HAFSGRFASL ISRGQLTEAE NELQLSLEGQ HRALGEVALV GAGPGDAGLL TLRGLQVMQQ
ADVVLYDHLV SPEVLDLVRR DAERICVGKR AGAHSVTQEA TNQLLVTLAQ QGKRVVRLKG
GDPFIFGRGG EELQVVAQAG IPFQVVPGVT AAAGATAYAG IPLTHRDHAQ SVTFITGHCR
PDGDDLDWQA LARGRQTLAI YMGTVKAAAI SQQLIAHGRS STTPVAVIGR GTRVDQQVLI
GTLAQLESLA QQAPTPALLV IGEVVNLHHQ IAWFGQQPQT ESAINPSVVN LA