Gene YpAngola_A4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4016 
Symbol 
ID5802495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4273666 
End bp4274979 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content48% 
IMG OID641341802 
Productcarbohydrate ABC transporter periplasmic-binding protein 
Protein accessionYP_001608309 
Protein GI162418278 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGC AAATGATAAT GTCAAAAGTA AAAATAAAGT CTTTGCTGTT GTTAACTTAT 
TTAGCCATGG GGATCCCTTC TCTTCACGCC GCAGAAACAT TGCAAGTGTG GATCCGTGCA
AGTAATGACT CTAAAAACAT TTATAAAAAA GAAGCTGACA CGTTTGAGAA AAAAACCGGT
ATCAAAATTG AATATTTCAA TGCCACGACT GATTTTGAAC AACGTCTGGC ACGGGCCGCT
GCGGGCAACG CGCTACCCGA TCTGATTTTC AACGATGCCG TGGCGATTGG TCAGTTTGTG
CAGCTAGGGA TTGTCGAACC GATTGAACCA AAAAATATTA TTGGCGGAGA GAACATCTTC
GATACCGCCT GGAAAAGTAC CCAATATATT GATGGAAAAT ATTACGGGGT TCCCACCTCC
GCTCAAACCT TTGCCTTATT TGTTCGCAAA GACTGGCGGG AAAAGTTGGG CTTAGAGCAA
CCGAAGAACT GGCAAGACAT TAGCACGCTG GCGAAAGCCT TTACCTTTAA TGACCCCGAT
GGCAACGGTA AAAATGATAC TTACGGCTTT ATTTTGCCTG GTTCAACAAC CCGTGGCTAC
GCCAGTTGGT TTATCAGCTC CTACCTTTGG CAGGCGGGGG GTGACTTTAT TCGCCCAGCC
GGTGAAGGTA AATTTATTGC CTCGCTGGAG GAGCCAGCAG CCACTGAAAC CCTGAGTTTC
ATGCGCGGAA TGGTGTGCGA TAAAACAGTC CAGCCGGGGG CTATCAATGC GACCACCGCT
GATGCGATCC CTTCATTCCG TTCTGGCCAA AGTGGCATGT TCTTCTCCGG CCCATATCAC
ATCGCGTTGT TTGATAAAGA TCCCGGCACC GAGGCTTTTG AAGTGATCCC GCCACCGACA
GGGCCAAAAG GACAGGCTAC GTTGGCTGAA GGCACTACGG TCTTTATGAT GAAAAGCAGC
CAGAAAAAAG CAGCCGCTCA GAAATTCATT GAGTTCATGA TCTCCCCTGA AGGGCAGCAG
ATCGGCATGG GAATGGGCAC CAAAAACATG CCAGTCGTGC GCTTGTCGAT CAATAAATTG
GTGGATACCA AAGCGGTCTA TGACGATCCA CGTTGGGCCA TATTCGCCGA TCTGTATGCC
GAACAGGGCC GCTATATTCC ACAAGTTCCT AACTGGACAC CTATTCGTCA GGTCACTGCT
GAAGGTTTCA ACCGTATTTT AGCCAACTGT GACAGTGATA TCGCCGCAGA ATTAAAAGCG
GTGAATCAGA AAGTTAATGA TGAGTTGGCT AAACAGAACG TCTTAGGGCA GTGA
 
Protein sequence
MKTQMIMSKV KIKSLLLLTY LAMGIPSLHA AETLQVWIRA SNDSKNIYKK EADTFEKKTG 
IKIEYFNATT DFEQRLARAA AGNALPDLIF NDAVAIGQFV QLGIVEPIEP KNIIGGENIF
DTAWKSTQYI DGKYYGVPTS AQTFALFVRK DWREKLGLEQ PKNWQDISTL AKAFTFNDPD
GNGKNDTYGF ILPGSTTRGY ASWFISSYLW QAGGDFIRPA GEGKFIASLE EPAATETLSF
MRGMVCDKTV QPGAINATTA DAIPSFRSGQ SGMFFSGPYH IALFDKDPGT EAFEVIPPPT
GPKGQATLAE GTTVFMMKSS QKKAAAQKFI EFMISPEGQQ IGMGMGTKNM PVVRLSINKL
VDTKAVYDDP RWAIFADLYA EQGRYIPQVP NWTPIRQVTA EGFNRILANC DSDIAAELKA
VNQKVNDELA KQNVLGQ