Gene YpAngola_A0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0476 
Symbol 
ID5798939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp495607 
End bp497097 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID641338481 
Productputative sugar ABC transporter periplasmic sugar-binding protein 
Protein accessionYP_001605073 
Protein GI162419105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.251819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0934917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAT TGTTAGAAGT CCGTGGCTTA TCTGTTGAGT TTCCTGGTGT TAAGGCATTG 
GACTCCGTAG ATTTTTCTCT GCAACGCGGT GAGGTTGTTG CGTTATTAGG AGAGAATGGC
GCAGGAAAAT CGACATTAAT AAAAGCATTA ACCGGGGTTT ATAAACGATC CGCTGGTGAG
GTGCTGTTGG ACGGCAAGGC TGTGTGCCCC ATTGATACCG CAGATGCCCA ACTCATGGGG
ATTGGTACTG TCTATCAAGA GGTCAACTTA CTGCCGAACA TATCTGTTGC TGCGAATTTA
TTTATTGGCC GTGAGCCGCT ACGTTGGGGG CTAATTGATC ACAATAAAAT GAACCAGCAA
GCAGCAAAAT TGCTGACGGG TTATGGTTTA ACGTTAGATG TTCAGCAGCC TTTGGCTAAT
TTCTCTATTG CAATACAACA GATTGTAGCA ATCGCCCGCG CCGTTGACCT TTCCGCGAAG
GTGCTGATTC TGGATGAGCC TACAGCAAGT TTAGATGCTA AAGAAGTCAG TATGTTACTG
GATATCTTGT GCCAGCTACG GGACCAGGGT ATCGGTATGG TTTTTGTGAC CCATTTTCTC
GATCAGGTTT ATCGAATCAG TGATCGCATT ACAGTTTTGC GAAATGGCAA GCTCGTCGGT
ACCAAAACTG TGGCAGAACT TCCCCGGATT GAGCTAGTGC AAATGATGCT GGGGCATAGT
TTTGATGAAC AATTACTGAA ACGTGGTGAA CACAGTATTA CGAATAAGAA TCCATTGGTT
GAGTTTAAAA ACTATGGCCG ACGGGGTGTG GTAGAGAATT TTGATTTGTC TGTATCTCCC
GGTGAAATTG TCGGGTTGGC CGGGTTATTA GGTTCAGGAC GGACTGAAAC AGCACAGCTA
ATTTTTGGCA TAACGACACC TGATACGGGG GAAGCCAAAA TACAAGGCAA ACCGGTTAAA
ATTAGAACAC CACGTAAGGC ATCAAAATTT GGTTTTGGCT ACTGCCCAGA AGATCGCAAA
ACAGAGGGTA TTGTTGGTGC GGCAACCGTA AGGGAAAACA TCATTTTAGC CTTACAAGCG
CAGCGTGGGT GGCTGAGGCC CCTTTCTATG CGTGAGCAGA CACAAATTGC AGACGATTTT
ATCCAGCAAC TGGGTATTCG TACCCCAAGC CCTGAACAGC AAATTCAATA TCTCTCCGGT
GGAAACCAAC AAAAAGTATT ACTTGCCCGC TGGCTTGCCA CGAAACCCCG ATTTTTAATT
TTAGATGAAC CGACTCGTGG TATCGATGTA GGTGCTCACG CCGAAATTAT TCGGTTGATC
GAAAAGTTGT GTGATGAAGG GTTGGCGCTG TTAATTATCT CTTCCGAATT AGAAGAATTG
GCGGGTTACG CTGATCGGAT CATCGTTCTT CGTGATCGTC GGCATGTTGC TCAACTCGAC
CACGATGAAA TTTCCGTTCC TGCCATTATG CAGGCGATCG CGGTGCAATA A
 
Protein sequence
MEILLEVRGL SVEFPGVKAL DSVDFSLQRG EVVALLGENG AGKSTLIKAL TGVYKRSAGE 
VLLDGKAVCP IDTADAQLMG IGTVYQEVNL LPNISVAANL FIGREPLRWG LIDHNKMNQQ
AAKLLTGYGL TLDVQQPLAN FSIAIQQIVA IARAVDLSAK VLILDEPTAS LDAKEVSMLL
DILCQLRDQG IGMVFVTHFL DQVYRISDRI TVLRNGKLVG TKTVAELPRI ELVQMMLGHS
FDEQLLKRGE HSITNKNPLV EFKNYGRRGV VENFDLSVSP GEIVGLAGLL GSGRTETAQL
IFGITTPDTG EAKIQGKPVK IRTPRKASKF GFGYCPEDRK TEGIVGAATV RENIILALQA
QRGWLRPLSM REQTQIADDF IQQLGIRTPS PEQQIQYLSG GNQQKVLLAR WLATKPRFLI
LDEPTRGIDV GAHAEIIRLI EKLCDEGLAL LIISSELEEL AGYADRIIVL RDRRHVAQLD
HDEISVPAIM QAIAVQ