Gene YpsIP31758_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4106 
Symbol 
ID5387340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4631862 
End bp4634354 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content46% 
IMG OID640867136 
Productfimbrial usher protein 
Protein accessionYP_001403050 
Protein GI153949755 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATATA ATTCTAAAAG AAAAAAAACC ATCTTCCTAA TGGTGAAAGT ATTAACAATA 
ATTTTAGTAT GGTTATTTTT GCCTGAATCG ACTGCAGTAG TAAAATTCAA TACCAATATT
ATCGATGCTA AAGACCGTAG TAATATCGAT CTCTCTCGTT TTGAGGTTGA TGATTACACC
CCACCGGGGA ATTATCTCCT TGATATTCTT ATTGATGATA GGCTGTTACC TGAACGTTAT
TTAGTGACTT ATCTCGCTGT TGATGAAGGG AAATCAACGA AGCTCTGTTT GACGCCTGAC
TTAGTTAATC TATTTGGTTT ATCTACGGAA GTACGTGAGT CGATGACATT ATGGAATAAC
GACAAGTGTG TTGCTATTGA TGAAAAAAAA GAGATAAAAA TTCAGTACGA TAAAGAGAAG
CAATCCTTAA TTATTTCTAT TCCCCAAGCT TGGCTGGCTT ATAACGATCC CAATTGGGTG
CCGCCTTCAC AATGGGGAAA CGGTGTTGCT GGTACTTTAT TGGATTACAA TTTGTTTGGT
TATCATTACT CACCGAATAT GGGCGGCAGT ACCACAAATT TCAGCAGCTA CGGCACTACC
GGGGCTAATA TGGGACCATG GCGTATTCGT GCAGATTACC AATATATCAA TACAGAAACG
GCGGGCGAGC ACTACCGTAA TTTTGATTGG TCGCAAGTGT ATGCTTTTCG AGCCATCCCC
TCGATAGGGG CTAAATTTGT CGGTGGCCAG ACTTATCTCA ACTCCAGTAT TTTTGATTCA
TTCCGTTTTC TGGGCACTTC ACTTTCCAGT GATGAGCGCA TGTTGCCACC GACGTTACGC
GGCTATGCAC CACAGGTAAT GGGCATTGCT CACACCAACG CCCGCGTGGT ATTGAGCCAG
AATGGGCGGG TGCTTTATCA GACTAACGTT GCACCAGGCC CCTTTGTTAT TCAAGACATT
AGCGAAGCCG TACAAGGGAA TATTGATGTC CGCGTGGAAG AGGAAGATGG CCGGGTCACC
GTGTTTCAAG TCAACGCCGC GAGCGTGCCA TTTTTAACCC GTAAAGGTGC CGTGCGCTAT
AAAGCTGCGT TGGGTCGCCC GATGCTGGGT AATTCAGCCA GTAATCCGAC ATTCTTTAGT
GGCGAATTCT CGTGGGGCGC ATTTAACCAT GTTTCATTAT ACGGTGGGCT GATGACGACT
TCGCAGGATT ACACCTCGGC TGCGTTGGGC ATCGGGCAAA ATTTATATGA CTTCGGCGCA
CTGTCTATTG ATATCACCCA TTCCCGTGCG CAGTTACCAA ATGAAGAACA GCAGAACGGG
GAAAGCTATC GCGTTAATTA TTCCAAACGT TTTGAGCAGA CTGACAGCCA GATTAGCTTT
GCCGGATACC GTTTCTCGAA GAAGAATTTT ATGAGTATGA GCCAGTATTT GGATTGGCTA
AACGGCAATA CTGCTCTGCA ATATGACAAG CAGGCTTATA CCGTGGCAGC TAACCAGTAT
CTGGCCTGGC CGGATATCAC GATGTATTTA TCGGTGACAC GTAGAACCTA TTGGAATGCG
GCCTCCAGTA ACAACTACAG TCTATCCATG AGCAAGATTT TTGATATCGG TACTTTTAAG
GGTATTTCGG CGACGATATC CGCTAATAAG GTGAATAATC AGTATGCCAA TGAGAATCAA
ATGTTCTTCT CACTCAGCGT ACCGATCGGC ATAGGCCAGC AGGCCAGCTA TGATGCACAG
CGAGGCCGCA ATACCGGCTA CACGCAAAAT ATCTCCTATT TCAACAACCA GAATCCGAAA
AATATTTGGC GTATCAGCGC GGGTGGCGGT AACCCAGAAC TGCAAAAAGG TAATGGTGTG
TTCCGTGGTG GCTATCAACA TAGCTCGCCT TATGGTGAAT TTGGTCTTGA TGGCAGTCAT
AAAAATAATG AGTACAACTC AATCAATACC AACTGGTATG GCTCAATTAC GGCAACCGCT
TATGGGGTTG CTGCCCACCA GAATAAAGCG GGCAATGAAC CAAGAATAAT GGTTGATACC
GGGGATGTCG CAGGGGTGTC GCTGAATAAT AACTCGGCAG TGACGAACCG TTTTGGTGTG
GCGGTGGTCA GTGGCGCAAC CAGCTACCAA CAGTCTGATA TTCGGGTGGA TGTGCAGAAT
CTGCCGGATG ATATTGAGGT CTACAACACC GTTATCCAAA AAACGCTGAC CGAAGGGGCA
ATTGGTTACC GTGAGATAAG GGCGGTAAAA GGTCGGCAAA TGATGGCGAT TATTCGCCTG
AAAGATGGCA GTTCCCCCCC CTTGGGGGCA TCCGTTATCA CGGACAAAAC GGGCGCTGAA
GTTGGAATTG TGGGAGACGA TGGCCTGACG TATTTGGCGG GATTACAAGA CACTGAGAGG
CTGACTGTTC AATGGGGGAA AAAACAGTGC ACGCTCATAT TGCCAAAAGA TAAAGGAATG
AACTCAGGAA AGGTACTACT GCCCTGCCAG TAA
 
Protein sequence
MPYNSKRKKT IFLMVKVLTI ILVWLFLPES TAVVKFNTNI IDAKDRSNID LSRFEVDDYT 
PPGNYLLDIL IDDRLLPERY LVTYLAVDEG KSTKLCLTPD LVNLFGLSTE VRESMTLWNN
DKCVAIDEKK EIKIQYDKEK QSLIISIPQA WLAYNDPNWV PPSQWGNGVA GTLLDYNLFG
YHYSPNMGGS TTNFSSYGTT GANMGPWRIR ADYQYINTET AGEHYRNFDW SQVYAFRAIP
SIGAKFVGGQ TYLNSSIFDS FRFLGTSLSS DERMLPPTLR GYAPQVMGIA HTNARVVLSQ
NGRVLYQTNV APGPFVIQDI SEAVQGNIDV RVEEEDGRVT VFQVNAASVP FLTRKGAVRY
KAALGRPMLG NSASNPTFFS GEFSWGAFNH VSLYGGLMTT SQDYTSAALG IGQNLYDFGA
LSIDITHSRA QLPNEEQQNG ESYRVNYSKR FEQTDSQISF AGYRFSKKNF MSMSQYLDWL
NGNTALQYDK QAYTVAANQY LAWPDITMYL SVTRRTYWNA ASSNNYSLSM SKIFDIGTFK
GISATISANK VNNQYANENQ MFFSLSVPIG IGQQASYDAQ RGRNTGYTQN ISYFNNQNPK
NIWRISAGGG NPELQKGNGV FRGGYQHSSP YGEFGLDGSH KNNEYNSINT NWYGSITATA
YGVAAHQNKA GNEPRIMVDT GDVAGVSLNN NSAVTNRFGV AVVSGATSYQ QSDIRVDVQN
LPDDIEVYNT VIQKTLTEGA IGYREIRAVK GRQMMAIIRL KDGSSPPLGA SVITDKTGAE
VGIVGDDGLT YLAGLQDTER LTVQWGKKQC TLILPKDKGM NSGKVLLPCQ