Gene YpAngola_A3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3484 
SymbolaroF 
ID5801960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3699561 
End bp3700631 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content49% 
IMG OID641341301 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001607814 
Protein GI162420963 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.242874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACTCGCTCAA TAACGTCCAT ATCAGTGCCG AACAAATCCT GATAACCCCG 
GAAGAACTGA AAAATCAGTT TCCACTGAGC GAAAATGATC AGTATTCGAT AGAGCGCGCA
CGTAAAACCA TTGCTGACAT TATTCAGGGG CGAGATCCGC GTCTGTTGGT CGTTTGTGGG
CCCTGTTCAA TTCATGATGT GGATGCGGCA CTGGATTACG CGCGTCGTTT GAAAAAACTC
TCTGTGGAAT TGGATGACAG CTTATATATC GTTATGCGTG TCTATTTTGA GAAGCCAAGA
ACTACCGTGG GTTGGAAAGG CCTGATCAAT GACCCTGCAA TGGATGGTTC ATTTGATGTA
GAGGCAGGTT TACACATTGC CCGTCGTTTA TTGCTGGATT TAGTGGGCAT GGGGTTGCCG
TTAGCGACTG AAGCTCTGGA TCCTAATAGC CCACAATATT TAGGTGACCT GTTCAGTTGG
TCGGCCATTG GTGCCCGTAC AACGGAGTCA CAGACCCACC GTGAAATGGC ATCAGGCTTG
TCTATGCCGG TTGGATTTAA AAATGGCACT GACGGTAGCC TAGGCACGGC AATCAATGCA
ATGCGCGCCG CTGCCATGCC ACATCGCTTT ATGGGGATCA ATCAGTCGGG CCAGGTCTGC
CTGTTACAAA CTCAGGGTAA CCCACACGGC CATGTCATTC TACGGGGAGG TAAAACACCA
AACTACAGTG CACAAGATGT CGCTCAGTGT GAAAAACAGA TGCAGGATGC GGGACTCATC
CCATCCTTAA TGATAGATTG CAGTCACGGT AATTCAAATA AAGACTACCG CCGTCAGGTT
GCGGTGGCTG AATCTGTGGT TGAACAGATC AAGGCGGGCA ATCGTTCAAT TACAGGTGTG
ATGCTGGAAA GCCACATCCA CGAAGGAAAT CAGTCATCTG AACAGCCACG TGCTGATATG
CGCTACGGTG TTTCTGTGAC TGACGCCTGT ATTAACTGGG AAAGCACTGA AACCCTGTTA
CGTGGTATGC GCCAAGAATT GCTTGCAGCA CTGACGGCAC GGACTGCATG A
 
Protein sequence
MQKDSLNNVH ISAEQILITP EELKNQFPLS ENDQYSIERA RKTIADIIQG RDPRLLVVCG 
PCSIHDVDAA LDYARRLKKL SVELDDSLYI VMRVYFEKPR TTVGWKGLIN DPAMDGSFDV
EAGLHIARRL LLDLVGMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLGTAINA MRAAAMPHRF MGINQSGQVC LLQTQGNPHG HVILRGGKTP
NYSAQDVAQC EKQMQDAGLI PSLMIDCSHG NSNKDYRRQV AVAESVVEQI KAGNRSITGV
MLESHIHEGN QSSEQPRADM RYGVSVTDAC INWESTETLL RGMRQELLAA LTARTA