Gene YpAngola_A4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4056 
SymboldppA 
ID5802535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4316457 
End bp4318064 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content49% 
IMG OID641341839 
Productperiplasmic dipeptide transport protein 
Protein accessionYP_001608345 
Protein GI162420607 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT CCTTGAGAAG AACAGGGATA CTGAAATTCG GTATTGGGCT GGTTGCACTG 
ACTATAGCGG CCAGTGTACA GGCAAAAACG CTAGTATATT GTTCTGAGGG GTCACCGGAA
GGTTTCAACC CACAACTCTT TACTTCTGGC ACAACTTATG ACGCAAGCTC GGTACCTATT
TACAATCGTT TAGTGGAGTT CAAAATCGGG ACTACCGAAA TTGAACCCAG CCTGGCGGAG
CGGTGGGAAG TGAGCGAAGA TGGTAAAACC TACACTTTCT ACCTGCGTAA GGGTGTGAAA
TGGCAGGATA ATAAAGATTT CAAACCCACC CGCGATTTCA ATGCTGATGA TGTTATCTAT
TCATTCATGC GTCAGAAAGA TGATAAAAAC CCGTACCATA AAGTCTCTGG TGGCAGCTAT
GAATACTTCC AGGGTATGGG AATGGGCGAC TTAATCACCA ATGTGGTGAA AGTTGACGAC
AATACCGTTC GCTTTGAACT GACCCGTCCG GAATCGCCAT TCTTGGCAGA CCTGGCGATG
GATTTCGCCT CTATCCTGTC CGCTGAATAC GCCGACAATA TGCTGAAAGC AGGCACCCCG
GAAAAAGTGG ATTTGAACCC AATCGGTACC GGCCCATTCC AACTGCAACA GTACCAAAAA
GACTCCCGTA TTCTGTATAA AGCGTTTCCT GGTTTCTGGG GCACCAAACC AAAAATTGAT
CGCCTGGTCT TCTCTATCAC CCCAGATGCC TCCGTTCGCT ATGCTAAATT GCAGAAAAAC
GAATGTCAGA TTATGCCTTA CCCGAACCCG GCTGACATCG CCCGGATGAA AGAAGACAAA
ACGATTAACC TGATGGAACA ACCGGGTCTG AACGTCGGTT ACCTCTCCTT CAACATTGAG
AAGAAACCAC TCGATAACCT TAAGGTCCGT CAGGCACTGA CCATGGCGGT TAACAAAGAC
GCGATTATTG ATGCGGTTTA TCAAGGCGCG GGGCAAGCGG CCAAAAACCT GATCCCACCA
ACGATGTGGG GCTACAACGA TGATGTGAAA GATTACGCTT ACGATCCCGC TAAAGCGAAA
GAACTGCTGA AAGAAGCGGG TCTGCCAGAT GGCTTCTCCA TCGACCTGTG GGCCATGCCG
GTTCAACGTC CGTATAACCC AAATGCGCGT CGTATGGCGG AAATGATCCA GTCTGATTGG
GCGAAAATTG GTGTGAAAGC CAAGATCGTG ACCTATGAGT GGGGCGAATA CCTCAAGCGT
GCCAAAGATG GCGAACATGA AACTGTGATG ATGGGTTGGA CCGGGGACAA TGGGGACCCA
GACAACTTCT TCGCTACGCT GTTCAGTTGT GATGCGGCCA AGCAGGGTTC TAACTACTCT
AAATGGTGTT ATAAGCCGTT TGAAGATTTG ATCCAACCAG CCCGTGCTGA AGCTGACCAT
GACAAACGTG TCGCACTTTA CAAACAAGCT CAGGTTGTGA TGAACGAGCA GGCCCCGGCG
CTGATCATTG CTCACTCAAC GGTGTACGAG CCAGTGCGTA AAGAAGTGAA AGGCTATGTT
GTGGACCCAT TGGGTAAACA TCATTTCGAT AACGTGTCCC TGGATTAA
 
Protein sequence
MTISLRRTGI LKFGIGLVAL TIAASVQAKT LVYCSEGSPE GFNPQLFTSG TTYDASSVPI 
YNRLVEFKIG TTEIEPSLAE RWEVSEDGKT YTFYLRKGVK WQDNKDFKPT RDFNADDVIY
SFMRQKDDKN PYHKVSGGSY EYFQGMGMGD LITNVVKVDD NTVRFELTRP ESPFLADLAM
DFASILSAEY ADNMLKAGTP EKVDLNPIGT GPFQLQQYQK DSRILYKAFP GFWGTKPKID
RLVFSITPDA SVRYAKLQKN ECQIMPYPNP ADIARMKEDK TINLMEQPGL NVGYLSFNIE
KKPLDNLKVR QALTMAVNKD AIIDAVYQGA GQAAKNLIPP TMWGYNDDVK DYAYDPAKAK
ELLKEAGLPD GFSIDLWAMP VQRPYNPNAR RMAEMIQSDW AKIGVKAKIV TYEWGEYLKR
AKDGEHETVM MGWTGDNGDP DNFFATLFSC DAAKQGSNYS KWCYKPFEDL IQPARAEADH
DKRVALYKQA QVVMNEQAPA LIIAHSTVYE PVRKEVKGYV VDPLGKHHFD NVSLD