Gene YpAngola_A2508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2508 
Symbol 
ID5800978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2624441 
End bp2626045 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content46% 
IMG OID641340378 
Productchorismate-binding domain-containing protein 
Protein accessionYP_001606921 
Protein GI162419062 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000325173 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCTGCT CTGGAGAACC GCTTTACTAT GCGCAGTGTC AACAAACGAT CTGTGCTCCC 
CCACTGACAA TGGCGACAAA ATTAGTCCAG CGTTATCAAG AGGGTGCCCG CAATATCTCT
CACAGTAAAA GCGGCTCAGA AGGATATTCG TCACTCACTG ACAATGAAGA ATGGCAACCC
TACGCTATCT ATGAAAAAGA CGGTGAGTTC TCGGTCGGTA TTGGGCTGGC GGCACTGATA
ACTGCCTATC CTAATTATGT TCACATACGT TACCAGCAGC AAAATGCTCA TGGCGATAGC
TTTAGCGGCG AACAATTTCA AGAGCGTGTT TGGTTATCCT CCGACATCGT CGGTAATATT
TCATCCGCAC TGGCAAGTAT CCCGATTAAA GAGTGGCGAG CTTATGGTCT CAGCCAATTG
GAATTGGTTC ATCTCTTCCA TAACCCGGCG TCTCACGCGG CCCCCGGTAC CGCACTACTG
CAAATTTTCC TTCCGCTGCA TGAATATAGA CTCAATCGCG GCAGCGCTAT AATACGCAGT
TTGCTTCCCT CTCACTTACC CCAACTGCTT ACGATGCTGC AGCAATGTGA CAGTGACCAA
TATGACAACG CGCCCTCTAT CAACAAGTCA TCTATCTATA AATCATCTAG TTGTGAACCA
TCTATTTATG AGCCATCTAT TTATGAACAA CCTGGCGATG AATATGATGA AAATAATCAA
CGTGCCCATA AGCAAGACAG CCACCTGCAG ACACAAGTAG AAAGAAGCGC CATCGAAATG
CAGATACGCC AAACTGACCC TACTATTTTC TGTGATCGGG TAGCAAAAAC CGTTAATGAA
ATCAGGCAAG GTAAATATCA GAAAGCCATT TTATCGCGCC AGATACCGCT GCCAAATAAT
ATTAATTTAC TCGCAAGCTA CCAACGTGGG CGCATAAATA ACACCTCGGC ACGTTCCTAT
GCTTTCCGGA TGCAAGGATT TGAATTAATG GGCTTTAGCT CAAAGACTGC CGTGACTGTC
TCAGCCAATG GCTGTCTGAT CACCCAACTT TTAACCGATA CACATGCACT GTCGTCAGAC
CAAACTCAAT CGGTGCCACT TCATCATGAA TTACGAATTA ATACTAAAGA TATCACTGAA
CATACCAGCT CAATCCTCTC TGTTGTTGCA ACGCTAACAC CAATCTGTGT ACCAGGCTCA
GTTGCCATTG TTCCGTATAT GAAGGTGCTT ACTTGCGGTA AGGTGCAAAA TCTGGCCTCC
TGTCTGCAAG GCCAGCTACA AAAAGGCATC AGCCACTGGC AAGCTATGCA ATCCCTGTAT
CCTGTAGCTG CCGATATTCC TAAAGATCAT CTGATGCAAG CAACTCTTCA TGAACAGGGA
TCATGGGAGG CCTATAGCAG CAGCGTGCTA ATGGTTGACA GCAATGGTGC ATTGGATGCG
ACGCTAATTT CAGAGAGCCT TTCCCGTAAG AATAAAAGAT TTGGATTACG AGCCGGAACC
GAAATCACCC ATCAAGCAGA CCCCCTACTC AAGCTAGAAG AGACTCATGA GACGCTGATC
GCGATTGCCC GTTATCTGGT GTTGCAAACA GCAATGACCG ATTAA
 
Protein sequence
MPCSGEPLYY AQCQQTICAP PLTMATKLVQ RYQEGARNIS HSKSGSEGYS SLTDNEEWQP 
YAIYEKDGEF SVGIGLAALI TAYPNYVHIR YQQQNAHGDS FSGEQFQERV WLSSDIVGNI
SSALASIPIK EWRAYGLSQL ELVHLFHNPA SHAAPGTALL QIFLPLHEYR LNRGSAIIRS
LLPSHLPQLL TMLQQCDSDQ YDNAPSINKS SIYKSSSCEP SIYEPSIYEQ PGDEYDENNQ
RAHKQDSHLQ TQVERSAIEM QIRQTDPTIF CDRVAKTVNE IRQGKYQKAI LSRQIPLPNN
INLLASYQRG RINNTSARSY AFRMQGFELM GFSSKTAVTV SANGCLITQL LTDTHALSSD
QTQSVPLHHE LRINTKDITE HTSSILSVVA TLTPICVPGS VAIVPYMKVL TCGKVQNLAS
CLQGQLQKGI SHWQAMQSLY PVAADIPKDH LMQATLHEQG SWEAYSSSVL MVDSNGALDA
TLISESLSRK NKRFGLRAGT EITHQADPLL KLEETHETLI AIARYLVLQT AMTD