Gene YpAngola_A0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0970 
SymbolcysG2 
ID5799433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp989956 
End bp991374 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID641338962 
Productsiroheme synthase 
Protein accessionYP_001605534 
Protein GI162419435 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTATC TACCTCTATT TGCCGACTTG AAACAACGCC CGGTATTGAT CGTTGGCGGC 
GGCGAAGTTG CTGCGCGAAA AATTGAACTA CTGCACCGCG CAGGTGCGCA GGTATGGGTA
GTGGCGCAAA CCCTCTCATC TGAACTCGAA CAGCAGTATC AGGATGGCCG TATCCATTGG
CTGGCACAGG ATTTTCTGCC TGAACAGTTG GACAACGTGT TTCTGGTGAT TGCCGCAACT
AACGATACCG TACTGAATGC CGCTGTTTTC GCGGCGGCAG ATCAACGGTG TATTTTGGCG
AATGTGGTGG ATGACCAACC CCTTTGTTCG TTTATTTTCC CTTCGATTGT TGATCGTTCA
CCGCTGGTGG TCGCGATCTC GTCTTCCGGC CAGGCACCGG TGCTGGCGCG GATACTGCGT
GAGAAGTTAG AAGCCTTACT GCCCACTCGT CTCAGTGATA TGGCCGCTAT AGCTGGGCGC
TGGCGTGGGC GGGTTAAGCA GCATATGGCC TCTATGGGGG AGCGCCGCCG CTTTTGGGAA
CATGCATTCA GCGGGCGCTT TGCCAGCCTT ATCAGCCGTG GTCAGTTAAC CGAGGCTGAA
AATGAATTAC AACTGTCGCT AGAGGGCCAA CACCGTGCAC TCGGCGAAGT GGCTTTGGTC
GGTGCAGGTC CTGGCGATGC AGGGTTATTA ACGTTACGTG GTTTGCAAGT GATGCAGCAG
GCAGATGTGG TGCTGTATGA CCATTTAGTC AGTCCGGAAG TCTTGGATTT GGTGCGTCGC
GATGCTGAAC GCATCTGTGT GGGGAAACGG GCGGGGGCGC ATTCGGTTAC CCAAGAGGCA
ACCAATCAGT TGTTGGTCAC TCTGGCACAG CAAGGAAAAC GGGTCGTACG GCTAAAAGGT
GGCGATCCCT TTATTTTTGG TCGGGGGGGG GAAGAGTTAC AAGTTGTGGC GCAAGCGGGG
ATCCCGTTTC AGGTGGTACC GGGGGTGACG GCTGCTGCGG GGGCGACAGC CTATGCCGGT
ATTCCCCTGA CACACCGCGA TCATGCGCAA AGCGTGACAT TTATTACCGG GCATTGCCGT
CCTGATGGTG ATGATCTTGA TTGGCAGACG CTGGCCCGTG GTCGTCAGAC GCTGGCGATC
TACATGGGAA CAGTGAAAGC GGCGGCAATC AGCCAACAGT TGATCGCCCA TGGCCGGTCC
AGTACCACGC CGGTGGCAGT GATTGGCCGT GGAACCCGTG TGGACCAGCA GGTGCTGATC
GGTACGTTGG CACAACTTGA ATCACTCGCA CAGCAGGCAC CGACACCAGC ACTACTGGTG
ATTGGTGAAG TGGTGAATTT ACATCACCAA ATTGCCTGGT TTGGGCAACA ACCGCAGACT
GAATCGGCGA TCAGCCCATC AGTCGTGAAT TTGGCATAA
 
Protein sequence
MDYLPLFADL KQRPVLIVGG GEVAARKIEL LHRAGAQVWV VAQTLSSELE QQYQDGRIHW 
LAQDFLPEQL DNVFLVIAAT NDTVLNAAVF AAADQRCILA NVVDDQPLCS FIFPSIVDRS
PLVVAISSSG QAPVLARILR EKLEALLPTR LSDMAAIAGR WRGRVKQHMA SMGERRRFWE
HAFSGRFASL ISRGQLTEAE NELQLSLEGQ HRALGEVALV GAGPGDAGLL TLRGLQVMQQ
ADVVLYDHLV SPEVLDLVRR DAERICVGKR AGAHSVTQEA TNQLLVTLAQ QGKRVVRLKG
GDPFIFGRGG EELQVVAQAG IPFQVVPGVT AAAGATAYAG IPLTHRDHAQ SVTFITGHCR
PDGDDLDWQT LARGRQTLAI YMGTVKAAAI SQQLIAHGRS STTPVAVIGR GTRVDQQVLI
GTLAQLESLA QQAPTPALLV IGEVVNLHHQ IAWFGQQPQT ESAISPSVVN LA