Gene YpAngola_A2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2140 
Symbol 
ID5800610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2236766 
End bp2237983 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content46% 
IMG OID641340048 
Productputative endopeptidase 
Protein accessionYP_001606593 
Protein GI162421091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.332227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.318783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAAAG AAAAAATTGA GCATTTAGAA GCAGTAAGCC GTAAAGCCCG CGTGGTCATG 
GAGCGTGAGG GGATTGATGC GCTGGTGGTG ACTGTTTGTG ACAATTTCTA TTATCTCACG
GGTTTTGCCA GCTTCTTTAT GTATACCTTC CGGCATACCG GTGCGGCGGT TGCGATCATG
TTTCGTGATG CGAATATCCC TTCACAAATC ATCATGAATG AATTTGAGGC TGCCAGTACG
CATTTTGATA TGCCGAATAG TGTACTGAAA ACGTTTCCGG TGTGGGTTGA TGTTGATGAC
CCGCGTAATC CGCATCATCA TTATAAAAAA CGTGATCGGC CTATTGGCCC ACCGGTGGAA
GCGGTCTTTA GTTTAGTTAA AAACGCACTT GAAGATGCGG GAGTGCTGGA TAAGACGATT
GCCATTGAAT TACAGGCGAT GTCAAACGGC GGTAAAGGTG TATTAGATAA AGTTGCACCT
GGGCTGAAAT TAGTCGATTC AACGGCATTG TTCAATGAAA TAAGAATGAT TAAAAGCCCG
TGGGAAATTG AACACCTACG AAAAAGAGCT GAAATCACTG AATATGGTAT TGCCAGCGCG
GCTAAAAAAA TACGGGTAGG GTGTACGGCA GCTGAATTGA CTGCTGCATT TAAAGCGGCG
GTAATGTCGT TCCCAGAAAC GAACTTTTCA CGCTTTAATC TGATCTCGGT GGGGGACAAT
TTCTCACCAA AAATAATCGC AGATACGACA CCGGCAAAAG TGGGGGATTT GATTAAGTTT
GACTGCGGGA TCGATGTTGC TGGCTACGGT GCTGATCTGG CAAGAACGTT TGTGCTCGGT
GAGCCGGATA AACTGACGCA GCAGATATAT GACACCATCA GAACGGGTCA TGAGCATATG
CTATCAATGG TGGCACCGGG GGTGAAATTA AAAGCGGTTT TTGACTCCAC GATGGCGGTG
ATTAAGACGT CAGGTTTACC TCATTATAAC CGGGGCCATC TTGGGCACGG TGATGGTGTG
TTTCTGGGCC TTGAAGAAGT GCCTTTTGTT AGCACACAGG CAACTGAAAC GTTTTGTCCC
GGTATGGTCT TAAGCCTTGA AACGCCTTAT TACGGCATTG GGGTTGGCTC AATTATGTTA
GAAGACATGA TCTTAATTAC TGACAGTGGC TTTGAGTTTT TAAGCAAACT GGATCGTGAC
TTACGTCGGT ATTTCTAA
 
Protein sequence
MGKEKIEHLE AVSRKARVVM EREGIDALVV TVCDNFYYLT GFASFFMYTF RHTGAAVAIM 
FRDANIPSQI IMNEFEAAST HFDMPNSVLK TFPVWVDVDD PRNPHHHYKK RDRPIGPPVE
AVFSLVKNAL EDAGVLDKTI AIELQAMSNG GKGVLDKVAP GLKLVDSTAL FNEIRMIKSP
WEIEHLRKRA EITEYGIASA AKKIRVGCTA AELTAAFKAA VMSFPETNFS RFNLISVGDN
FSPKIIADTT PAKVGDLIKF DCGIDVAGYG ADLARTFVLG EPDKLTQQIY DTIRTGHEHM
LSMVAPGVKL KAVFDSTMAV IKTSGLPHYN RGHLGHGDGV FLGLEEVPFV STQATETFCP
GMVLSLETPY YGIGVGSIML EDMILITDSG FEFLSKLDRD LRRYF