Gene YPK_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_4044 
Symbol 
ID6089673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4460676 
End bp4463327 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content47% 
IMG OID641599141 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionYP_001722759 
Protein GI170026254 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.318083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCAGG CACGGGTCAT TCTGAAAAAA AATTTCTCAG GCCGTAGGAA AGCATTAACA 
TTATGTATAA CTTTAATTTT ACATATTGAT ACTGCTTTTG GGCAGGAAGA ACCACAAAAT
TTCGAATTTG ATGAATCTCT TTTTTTAGGG ACAAAATATG CATCAGGTTT GACCCAGCTT
AATAAAAAAA ATTCTATAAC TGCCGGGAAT TATGATGCTG TAGATGTTTT GGTTAATAAC
AAATTGTTCA AACGTATGAG CGTGCAATTT ATTAAAGATG CCAACTCATC TGAGGTTTAT
CCCTGTCTGA GCGACGAGTT ATTAACCGCG GCTGGCGTTG AACTGGGTAG AGAAAATAGC
ACCCCTCCCA AAGAACCCCA TGTTACCGAG GCTAATACTC CCATAACTGA AACTCACGCC
CCAACGAACC AATGTTTACC TCTGTCTACG CGGGTAAAAG GGGCATCCTT TCGATTTGAT
CAGGCTAAAT TACGTTTAGA GCTTTCGATT CCACAGGCTC TATTACAAAA ACGGCCCAGG
GGCTACATTG AGCGGGCGGA ATGGCAGGAA GGCGAAAAAC TCGCATTCAT TAACTACAGC
GCCAACGCTT ATCGCAGTGA TACTCGCGGC CAGCAAAAGA GAACCTCTGA TTTTGGTTTT
ATTGGCCTTA AAAGTGGGAT CAATCTGGGA TTGTGGCAAG TACGCCAGCA ATCGAACGTC
CGCTACGCCA GTAATGATAG TGGCAGCGAT ACCCAGTGGA ATAGCATCCG CACTTATGTG
CAACGCCCGA TCCCACAGTT GGATAGCCAG TTAACGTTAG GCGAAACTTT CACTGACAGT
ACATTATTTG GCAGCATGTC ATTCCTCGGT GCGAAAATGG CTACCGACCA ACGCATGTGG
CCAGTGTCAA TGCGCGGTTT CTCCCCAGAG GTCCGGGGTG TCGCCAGTAC CAACGCACGC
GTTATTATTC GTCAGAATGG ACGTGAAATT TATGAGACGA ACGTCGCTCC TGGCCCGTTC
GTTATTAACG ATTTGTTCAG CACCAGTAGC CAAGGCGACC TTAATGTTGA GGTTATTGAG
GCCAATGGTA GCCGCTCGAC ATTTACGGTG CCATTCAGTG CCGTTCCTGA TTCAATGCGC
CCAGGCGTTA GCCGCTATAA TGCGGTGATC GGGGAATCAC GCGATTTCAC CAATATTGAT
AATTATTTCA CTGACTTCAC CTATGAACGC GGCCTCACCA ATCAACTGAC GGCCAACAGT
GGTGTCCGTT TAGCAAAAGA CTACACGGCA TTACTTGCCG GTGGCGTATT AGGGACACCG
GTCGGTGCTT TGGGCCTCAA TGCCACCTAT TCTCACGCCA AAGTTGAAAA TGATAAAACG
CAAGATGGTT GGCGGATGCA GGCCACCTAT AGCCAGACCT TTAATCAAAC CGGAACCACG
TTCTCTTTAG CCGGGTATCG TTACTCAACG AAAGGGTATC GCGATTTAAA TGATGTGTTC
GGGGTGCGTT CGATGCAAAA AAACGGGGGA ACGTGGGATT CATCAACCTA TAAACAGCGT
AGCCAATTCA CCACTACTAT TAATCAAGAC CTCGGTAATT GGGGCCAGTT ATACACCTCT
GCCTCAACCA GTGATTATTA TAACGATACG GCGAGAGATA CCCAGTTACA GCTAGGCTAT
TCAAATAGTT ATCAGCAAAT CAGTTACAAT TTGGCTGTCA GCCGCCAGCG CAGTGTTTAT
ACCTCCACTC TGTATAACTG GGATAGCCCT GATACCGACG AGACGGCAAC CACCACGCGC
TACGGTAATA CCGAGAACAT CGCAACATTC ACCGTTTCTA TTCCACTGAA TATTGGTAGC
AACAACCAAT ATCTATCAAT GTCAACCAGC CGTAATCCGA AAAGCGGTAA TAACTATCAA
ACATCATTAT CAGGCACCGC CGGTGAGCGA AATTCCTTCA ACTATGCATT AAATGCAGGC
TATGACGACA GCAACTTCGG TAGTAGCTCA AATAATTGGG GGGCCAACGT ACAGAAGCAA
TTCCCCAATG CTACCGTTAA TGGCAGCTAT TCTCGTGGCA ATAATTACAC CCAATATGGC
GCAGGCGCTC GTGGTGCGGC GGTAATACAT CGACAAGGCG TGACTCTGGG CCCTTATCTG
GGTGAGACCT TTGGTTTAAT TGAAGCTAAT GGTGCTCAAG GGGCGACAGT TCGTAATGCC
CAAGGGGCAA GAATTGATAG TAACGGTTTT GCTCTGGTGC CCGCGCTAAC GCCTTATAAC
TACAACACAA TAGGGCTGGA TACCAAGGGG ATTAACCGTA ACACCGAGTT GAAAGAGAAT
CAGGGCCGTG TCGTCCCTTA TGCCGGGGCA GCGGTAAAAG TGAAATTTGA AACACTGACT
GGCTATGCGG TGTTAATTCA AGCCGAAGGT GAGGGGTTAC CACTGGGTGC CGATGTGTAT
AACAGCAAAG ATGAACTGGT TGGAATGGTA GGTCAGGGGA ATCAGATCTA CGCACGGATA
GCCGATAACA AAGGTACACT TGATGTCCGT TGGGGCGAAA GCAGCGGTGA TCAGTGTCAA
TTACCTTATG CTTTTAATCG CCAGGATACC GAGCAAGATA TCATTCATAT AACCGCGAGT
TGCCGCCGTT AA
 
Protein sequence
MVQARVILKK NFSGRRKALT LCITLILHID TAFGQEEPQN FEFDESLFLG TKYASGLTQL 
NKKNSITAGN YDAVDVLVNN KLFKRMSVQF IKDANSSEVY PCLSDELLTA AGVELGRENS
TPPKEPHVTE ANTPITETHA PTNQCLPLST RVKGASFRFD QAKLRLELSI PQALLQKRPR
GYIERAEWQE GEKLAFINYS ANAYRSDTRG QQKRTSDFGF IGLKSGINLG LWQVRQQSNV
RYASNDSGSD TQWNSIRTYV QRPIPQLDSQ LTLGETFTDS TLFGSMSFLG AKMATDQRMW
PVSMRGFSPE VRGVASTNAR VIIRQNGREI YETNVAPGPF VINDLFSTSS QGDLNVEVIE
ANGSRSTFTV PFSAVPDSMR PGVSRYNAVI GESRDFTNID NYFTDFTYER GLTNQLTANS
GVRLAKDYTA LLAGGVLGTP VGALGLNATY SHAKVENDKT QDGWRMQATY SQTFNQTGTT
FSLAGYRYST KGYRDLNDVF GVRSMQKNGG TWDSSTYKQR SQFTTTINQD LGNWGQLYTS
ASTSDYYNDT ARDTQLQLGY SNSYQQISYN LAVSRQRSVY TSTLYNWDSP DTDETATTTR
YGNTENIATF TVSIPLNIGS NNQYLSMSTS RNPKSGNNYQ TSLSGTAGER NSFNYALNAG
YDDSNFGSSS NNWGANVQKQ FPNATVNGSY SRGNNYTQYG AGARGAAVIH RQGVTLGPYL
GETFGLIEAN GAQGATVRNA QGARIDSNGF ALVPALTPYN YNTIGLDTKG INRNTELKEN
QGRVVPYAGA AVKVKFETLT GYAVLIQAEG EGLPLGADVY NSKDELVGMV GQGNQIYARI
ADNKGTLDVR WGESSGDQCQ LPYAFNRQDT EQDIIHITAS CRR