Gene YpAngola_A2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2602 
SymbolaroH 
ID5801074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2725087 
End bp2726133 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content51% 
IMG OID641340471 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001607010 
Protein GI162420455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.823041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAGA TAGATGAACT GCGGACCGCT CGCATCGATA GCTTGATTAC ACCGCAGCAA 
CTGGCTGAAA AGTTACCGAT TTCTGAGGTT ATTGCAGATA ACGTGACGGC GTCACGTAAA
CGAATAGAGA AAATACTTAT TGGTGAAGAC CCACGTCTAC TCGTGGTGAT TGGCCCCTGC
TCTATTCACG ACCTTGATGC AGCCGTTGAT TATGCCACCC GGCTCAAGGT GCTACGAGAA
CGCTATCAAG ACCGGCTGGA AATCGTGATG CGCACCTATT TCGAGAAACC ACGGACTGTA
GTGGGTTGGA AGGGGCTGAT TTCTGATCCG GCACTTGACG GCTCATGCCA GGTGAACTTG
GGTATTGAAC TGGCACGTAA GCTACTGTTA GCCGTGAATG AACTCGGGCT GCCGACCGCT
ACCGAGTTCC TCAATATGGT AACAGGCCAA TATATTGCCG ACCTCATCAG TTGGGGGGCA
ATAGGCGCAC GTACCACCGA AAGCCAGATC CACCGAGAGA TGGCCTCGGC ACTCTCCTGC
CCCGTGGGTT TCAAAAATGG TACAGATGGC AATGTGCGTA TTGCTATTGA TGCCATTCGC
GCCGCACAAG CCAGCCATAT GTTCCTTTCT CCGGATAAAA CCGGCCAAAT GACGATTTAC
CAAACCAGTG GTAACCCCTA TGGGCATATT ATTATGCGGG GTGGAAAGCA ACCTAACTAT
GATGCCTCTG ATATCGCAGC CGCCTGTGAC AGCTTGCGGG AATTTGATTT GCCAGAACAT
CTGGTGGTAG ATTTTAGCCA CGGCAATTGC CAGAAGATGC ATCGCCGCCA GTTGGATGTT
GCCGAAAATA TCGGGCTACA GATCCGTGCG GGTTCAACAG CGATTGTCGG TGTTATGGCT
GAGAGTTTCC TGATTGAGGG CACACAGAAG ATTGTTGCCG GACAGCCCTT AACTTATGGG
CAATCCATCA CTGACCCTTG CCTGAATTGG GATGATACTG AACAACTGTT AAGCCTATTG
GCAGATGCAG TAGACAGCCG GTTTTAA
 
Protein sequence
MYKIDELRTA RIDSLITPQQ LAEKLPISEV IADNVTASRK RIEKILIGED PRLLVVIGPC 
SIHDLDAAVD YATRLKVLRE RYQDRLEIVM RTYFEKPRTV VGWKGLISDP ALDGSCQVNL
GIELARKLLL AVNELGLPTA TEFLNMVTGQ YIADLISWGA IGARTTESQI HREMASALSC
PVGFKNGTDG NVRIAIDAIR AAQASHMFLS PDKTGQMTIY QTSGNPYGHI IMRGGKQPNY
DASDIAAACD SLREFDLPEH LVVDFSHGNC QKMHRRQLDV AENIGLQIRA GSTAIVGVMA
ESFLIEGTQK IVAGQPLTYG QSITDPCLNW DDTEQLLSLL ADAVDSRF