Gene YpAngola_A1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1785 
SymbolmenD 
ID5800256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1848041 
End bp1849744 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content57% 
IMG OID641339719 
Product2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylate synthase 
Protein accessionYP_001606274 
Protein GI162421028 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1165] 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 
TIGRFAM ID[TIGR00173] 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylic-acid synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.468181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAA GTGTTTTTAA CCGTCGTTGG GCGGCACTGC TACTGGAGGC TTTGACCCGC 
CACGGTGTGC GTCATATCTG TATTGCCCCA GGGTCTCGCT CTACACCGTT AACGTTAGCG
GCTGCCGCTA ATCCGTCGTT GGTTTGCCAT ACCCATTTTG ATGAGCGCGG TCTGGGCCAT
CTGGCTCTGG GGCTGGCGAA GGCCTCAACA GAACCGGTGG CGGTCATTGT CACCTCGGGT
ACGGCGGTGG CGAATCTTTA TCCTGCGCTA ATCGAAGCGG GGTTAACGGG TGAGCGGCTG
ATATTATTGA CCGCTGATCG CCCACCGGAG TTAATCGATT GCGGTGCCAA TCAGGCTATC
CGCCAGCAGG GGCTGTTTGC CAGCCATCCA ACCCTAAGTG TTAATCTCCC CAGACCGACG
CCCGATATCT CTGCCCGTTG GTTGGTTTCC ACACTCGACA GCGCGATGGC GCAATTGCAA
CATGGCGCTT TGCACATTAA TTGTCCCTTT GCTGAACCAC TGTATGGGGG GGATGAGCAA
CAGTATGCCG ACTGGTCTGC CTCGCTGGGC GATTGGTGGC AGGATTGTCA CCCATGGCTG
CGTCAAACCT GTTACCCGCC CTCTCTTTAT CAACCACCAG CACAGCAAGC TGACTGGTTT
TTCTGGCGGC AAAAACGTGG CGTCGTGATT GCCGGACGGA TGGGCGCGCA AGAGGGTAGG
CAACTGACCG CATGGGCTGC AATGCTGGGC TGGCCACTGA TTGGCGATGT ATTGTCACAA
ACCGGTCAGC CATTGCCTTG TGCTGACTTA TGGCTGGCGC ACCCACGGGC GCAAGAAACC
CTCGCTCAAG CACAGATGGT GTTGCAATTT GGCAGCAGTC TGACCAGTAA GCGCCTTTTA
CAATGGCAGA CCGCATGTCA GCCACAGGAA TACTGGCTGG TGGATAGCGC CCCAGGCCGC
CTTGATCCTG CTAACCATCG TGGCCGACGT ATTATTTGCC CGGTGGGCGA GTGGTTGAGT
CGGCACCCCG CGCAGCGGCG TACACCTTGG GCCACGGAAC TGGCGGCATA TTCGGAAAGT
GCGCAAGCAC AGGTGATCGA GACGCTGGCC GGCCAATTCA GTGAGGCCGC CGTGGCACAT
CAGCTCGCGG AATTATTACC GGATAACGGC CAGCTATTTG TTGGCAACAG TTTGATTGTC
CGGTTGATTG ATGCGCTGGG GCAGCTACCC GCAGGCTATC CGGTTTACAG CAATCGGGGG
GCCAGCGGTA TCGATGGTTT GCTGTCAACC GCCGCCGGTG TACAGCGCGC GACGGCGAAA
CCGACATTAG CTATCGTCGG TGATTTGTCT GCCTTATATG ACCTTAATGC GCTGGCACTT
TTGCGCCAGA GCTCGGCACC GATGGTGCTA CTGGTGATCA ATAACAATGG GGGGCAGATT
TTCTCTTTAT TGCCAACGCC AGAGGCCGAG CGCCAGCGTT TCTACTGTAT GCCGCAGGAT
GTGAACTTCG AACACGCGGC GGTTATGTTC AGCTTAGGTT ATGCTCGCCC TAATAGCTGG
CCTCAGCTAC GGGAGCTAGT GCATCAATGC TGGCTACGGG GGGGAACCAC GCTGATTGAA
GTCCAGGTGC CACCAAGTCA GGGGGCAGAA ACACTGCAAC AGTTAGTACA ACAGGTGACA
TTAATACCGC AGGTGGCCCC ATGA
 
Protein sequence
MSTSVFNRRW AALLLEALTR HGVRHICIAP GSRSTPLTLA AAANPSLVCH THFDERGLGH 
LALGLAKAST EPVAVIVTSG TAVANLYPAL IEAGLTGERL ILLTADRPPE LIDCGANQAI
RQQGLFASHP TLSVNLPRPT PDISARWLVS TLDSAMAQLQ HGALHINCPF AEPLYGGDEQ
QYADWSASLG DWWQDCHPWL RQTCYPPSLY QPPAQQADWF FWRQKRGVVI AGRMGAQEGR
QLTAWAAMLG WPLIGDVLSQ TGQPLPCADL WLAHPRAQET LAQAQMVLQF GSSLTSKRLL
QWQTACQPQE YWLVDSAPGR LDPANHRGRR IICPVGEWLS RHPAQRRTPW ATELAAYSES
AQAQVIETLA GQFSEAAVAH QLAELLPDNG QLFVGNSLIV RLIDALGQLP AGYPVYSNRG
ASGIDGLLST AAGVQRATAK PTLAIVGDLS ALYDLNALAL LRQSSAPMVL LVINNNGGQI
FSLLPTPEAE RQRFYCMPQD VNFEHAAVMF SLGYARPNSW PQLRELVHQC WLRGGTTLIE
VQVPPSQGAE TLQQLVQQVT LIPQVAP