Gene YpAngola_A4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4156 
Symbol 
ID5802636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4446712 
End bp4447896 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content52% 
IMG OID641341929 
Productsugar transport system permease 
Protein accessionYP_001608432 
Protein GI162421562 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0316098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAG CTAATCAATC TGAATTTAAC TCACCAGAAA ATGGGGATAA AAAGCCATTT 
TTTCGGCTTA AATCCTTAAA TTTACAAGTT TTTGTCATGC TGGCCGCGAT CGCCATTATT
ATGTTGTTCT TCACCTTCAC GACAGAAGGG GCCTATCTCA GCGCCAGAAA TATCTCTAAC
TTGCTGCGCC AGACGGCGAT CACCGGCATC TTGGCGGTCG GTATGGTGTT TGTCATTATC
TCCGCCGAAA TTGACTTGTC TGTTGGCTCA ATGATGGGGT TATTGGGTGG CATAGCGGCC
ATTTTTGATG TCTGGCTCGG CTGGCCGTTG CCGCTGACCA TTGTCGTTAC GCTGGCGCTG
GGGCTGGTAT TAGGGGCATG GAACGGTTGG TGGGTCGCGT ATCGCAAAGT GCCTTCGTTT
ATTGTTACGC TCGCGGGCAT GTTGGCTTTT CGTGGCATTT TAATTGGTAT CACCAATGGC
ACGACGGTTT CGCCGACCAG TAATGCGATG TCGCAGATTG GCCAGAGCTA TCTGCCAAGT
GGCATTGGTT TTGGCATCGG TGCTATTGGC CTGATGTTGT TTGTGGCCTG GCAATGGCGT
CGGCGTAATC ACCGCATCCG CTTGGGGTTA CCGGTTGCCG CCCCGCAAGG GGATGTGACT
CGCCAAACTA TTACCGCAGT TATCGTTCTG GGTGCCATAT ATTTATTGAA TGATTATCGT
GGTGTACCTA CGCCGGTACT AATCCTTACT GCATTAATGC TGGCGGGGGT ATTTATGGCT
ACTCGCACCG CTTTTGGTCG CCGGATCTAC GCTATTGGCG GCAATATTGA TGCGGCGCGC
CTGTCTGGAA TTAATGTAGA GCGCACTAAA CTGGCAGTAT TCGCGATTAA CGGACTGATG
GTGGCGATGG CTGGTTTGAT CCTCAGTTCA CGTTTAGGCG CAGGTTCGCC TTCTGCGGGC
AATATCGCTG AACTGGATGC GATTGCGGCG TGTGTCATTG GCGGTACCAG CCTGGCCGGA
GGCGTTGGCA GTGTGGCGGG GGCCGTAATG GGCGCATTTA TTATGGCTTC TCTCGATAAT
GGGATGAGCA TGCTAGATGT GCCGACGTTC TGGCAGTACA TTGTCAAAGG CGCAATTTTG
CTGCTGGCCG TGTGGATGGA TTCCGCCACC AAACGGCGGG TGTGA
 
Protein sequence
MSQANQSEFN SPENGDKKPF FRLKSLNLQV FVMLAAIAII MLFFTFTTEG AYLSARNISN 
LLRQTAITGI LAVGMVFVII SAEIDLSVGS MMGLLGGIAA IFDVWLGWPL PLTIVVTLAL
GLVLGAWNGW WVAYRKVPSF IVTLAGMLAF RGILIGITNG TTVSPTSNAM SQIGQSYLPS
GIGFGIGAIG LMLFVAWQWR RRNHRIRLGL PVAAPQGDVT RQTITAVIVL GAIYLLNDYR
GVPTPVLILT ALMLAGVFMA TRTAFGRRIY AIGGNIDAAR LSGINVERTK LAVFAINGLM
VAMAGLILSS RLGAGSPSAG NIAELDAIAA CVIGGTSLAG GVGSVAGAVM GAFIMASLDN
GMSMLDVPTF WQYIVKGAIL LLAVWMDSAT KRRV