Gene YpAngola_A1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1953 
SymbolaroA 
ID5800423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2030318 
End bp2031604 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID641339877 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001606427 
Protein GI162418202 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000347896 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.405463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAT CCCTGACCTT ACAACCCATT GCCCTAGTTA ATGGCACCGT TAATTTACCT 
GGTTCGAAGA GTGTCTCTAA CCGCGCACTG CTTCTGGCCG CGTTGGCCGA AGGGACCACT
CAGTTGAATA ACGTGTTAGA CAGCGATGAC ATCCGCCACA TGCTCAATGC ATTACAGGCA
TTAGGGGTGG ACTTCCGCCT TTCTGCTGAT CGCACATGCT GTGAGGTTGA TGGTCTGGGG
GGGAAATTAG TGGCTGAACA GCCATTGTCG CTTTTCTTGG GCAATGCCGG CACAGCCATG
CGTCCTTTGG CCGCGGTGTT ATGTTTGGGT AATAGCGATA TCGTACTGAC GGGTGAGCCT
CGGATGAAGG AGCGGCCAAT TGGCCATTTG GTGGATGCGC TACGTCAGGG CGGTGCACAG
ATCGATTATC TGGAACAAGA AAATTACCCG CCATTACGTT TACGTGGTGG TTTCCGAGGG
GGGGAGTTAA CTGTTGATGG GCGTGTCTCT AGCCAGTTCC TGACTGCTTT ATTGATGACC
GCCCCGCTGG CGGAGCAAGA TACGACTATT CGGATTATGG GTGATCTGGT TTCCAAACCT
TATATCGATA TTACTCTGCA CTTGATGAAA GCATTTGGTA TTGACGTGGG GCATGAGAAC
TACCAAATTT TCCACATCAA AGGGGGCCAG ACCTACCGCT CACCAGGGAC TTATTTGGTT
GAGGGCGATG CCTCGTCGGC TTCCTACTTC TTAGCGGCTG CGGCTATTAA GGGGGGAACA
GTGCGTGTCA CTGGTATTGG CAAGAAAAGT GTACAGGGCG ACACTAAATT TGCCGATGTG
TTGGAAAAAA TGGGCGCGAA AGTGACGTGG GGGGATGATT ATATCGAGTG CAGTCGTGGT
GAATTACAGG GCATTGACAT GGATATGAAC CACATTCCTG ATGCTGCAAT GACCATTGCG
ACTACGGCAT TATTTGCCAC GGGCCCAACG ACGATCCGCA ATATCTACAA CTGGCGGGTA
AAAGAAACTG ACCGGCTGAC GGCGATGGCA ACCGAGTTGA GAAAAGTAGG TGCTGAAGTG
GAAGAGGGGG AAGATTACAT CCGCGTGGTT CCACCCTTGC AGCTAACTGC TGCAGATATT
GGTACCTACG ATGACCACCG TATGGCGATG TGTTTCTCGC TGGTCGCGTT ATCAGATACC
CCCGTGACGA TCCTTGACCC GAAATGTACC GCAAAAACCT TCCCTGATTA TTTTGAACAG
TTTGCGCGTC TGAGCCAACT GGCCTGA
 
Protein sequence
MLESLTLQPI ALVNGTVNLP GSKSVSNRAL LLAALAEGTT QLNNVLDSDD IRHMLNALQA 
LGVDFRLSAD RTCCEVDGLG GKLVAEQPLS LFLGNAGTAM RPLAAVLCLG NSDIVLTGEP
RMKERPIGHL VDALRQGGAQ IDYLEQENYP PLRLRGGFRG GELTVDGRVS SQFLTALLMT
APLAEQDTTI RIMGDLVSKP YIDITLHLMK AFGIDVGHEN YQIFHIKGGQ TYRSPGTYLV
EGDASSASYF LAAAAIKGGT VRVTGIGKKS VQGDTKFADV LEKMGAKVTW GDDYIECSRG
ELQGIDMDMN HIPDAAMTIA TTALFATGPT TIRNIYNWRV KETDRLTAMA TELRKVGAEV
EEGEDYIRVV PPLQLTAADI GTYDDHRMAM CFSLVALSDT PVTILDPKCT AKTFPDYFEQ
FARLSQLA