Gene YPK_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0572 
Symbol 
ID6087864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp631066 
End bp633042 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content55% 
IMG OID641595634 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001719328 
Protein GI170022823 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA ACCTGTACCG TATCGTTTTT AATCAAGCGC GCGGCATGTT GATGGTGGTC 
GCCGATATTG CGGCCTCCGG TCGGGCTGCA TCGTCGCCAT CGTCAGGTGT TGGGCATACT
CAGCGCCACC GGGTCAGTGC CTTATCACCA TTGAGTTTTC GTCTGTTAAT GGCTTTGGGG
GGCATCTCGT TATCCGCTCA GGCCGCAATT GTGGCGGATG GCAGTGCGCC GGGTAATCAG
CAGCCGACCA TTATCAGCAG CGCCAACGGC ACGCCTCAGG TTAATATCCA AACCCCCAGT
AGCGGCGGTG TCTCGCGCAA CGCTTACCGC CAGTTTGATG TCGATAATCG CGGGGTGATC
CTCAATAACG GTCGCGGTGT CAACCAGACC CAGATTGCCG GATTGGTTGA TGGTAATCCG
TGGCTGGCGC GGGGCGAAGC CAGCGTGATC CTCAACGAAG TCAACAGTCG CGACCCGAGT
CAGCTCAACG GCTATATCGA AGTGGCCGGA CGTAAAGCGC AGGTGGTGAT CGCTAACCCA
GCAGGCATTA CCTGCGAGGG CTGTGGTTTT ATCAACGCCA ACCGCGCAAC CCTCACCACC
GGTCAGGCGC AGCTCAATAA CGGCCAACTC ACTGGCTACG ATGTTGAACG GGGTGAAATT
GTTATTCAGG GAAAGGGCCT GGACAGCCGT GGTCAGGATC ATACCGATCT GATCGCCCGT
TCCGTTAAAG TAAATGCGGG CATCTGGGCC AATGAACTGA ATATCACTAC CGGGCGCAAT
CAGGTTGATG CCGCACACCA GAATATCAAT ACCAACGCCG CCGATGGCCG CCCTCGTCCT
GCCGTGGCGG TCGATGTGGC TAATTTGGGG GGGATGTACG CCGGTAAAAT CCGCCTAATC
GGTACTGAAA CCGGTGTTGG CGTGCACAAT GCGGGCGAGA TAGGGGCTTC TGCAGGTGAT
ATCGTGATTA CGGCCGACGG TATGCTGGTG AACCGCGGCC AGATCAGCAG TGCTCAACAA
CTGGCGGTGA ATACCCCCTC AGGCATAGAG AATAGCGGTG TGCTCTATGG GAAGGGCAAT
ACCCAACTGA CCACGGCGGG TAAACTGAGC AACAGTGGCA CGGTCGCGGC GGCGGGTGAC
ACCTTGATCC GCGCGGCAGA GGTTAACAGC AGCCGTAATT CTGTTTTGGG TGCTGGCATT
AAATCCGATA ACAGTGCCAT TACCCGTGGC ACCCTTGATA TTAAAGCCCG TGGGCAGCTA
ACCGCCCAAG GGAAAAATAT TAGCGGCACG GCGCAGACAT TTAATGCGAA CCGTATTGAT
CTCAGCGGTA GCCAAACTCA GAGCGGTGAT CTGACGTTCA CCACCGAAGG TGGCGACATC
GATTTGACAG GGGCTAACCT GTTCGCCAAT CGTCGTCTGT CTGTTTCGAC CCCTTCTTTG
CTACGCACCG ATAAAGCTAA CTTGTTCGCG GAGCAAATCG CGCTCGACGC ACAAGCGCTC
GCCAATGTGG GTGGCGTGAT AACGCAAACT GGGCTGACCG ACTTCAACTT GAATCTACCG
GGTTATATTG ATAACCGTGG TGGCTCTCTC CTCACCCGCG GCAACTTTTT GCTGCAAGCT
GAACACTTGA CCAGTAATAG CCAGAGTTTA CTGGGTGCTG GCATACAGAG TGATGGCAAA
CTGGCTCCGC GTGGTGATCT CAATGTCACT ACGCGGCACG CCTTGATTGC TCAAGGGAAA
ACGCTAGCCG CAGGCACTCT GGCACTCTCC GGCAGCCGGC TTGATCTTAC TGATAGCCTG
ACACAAGCCA AGTATATGCG GCTGACAGCC ACAGAAGGCG ATATTGCGTT AACCGGTGCC
ACGGTGATGG CGGCTAACAC GTTGTTTGCG GATAGCCAAA TCAGACGGTT TATTGACTGG
GCATGTTACA AAAACAACTC CTCAAATGAC AAAAGAAAAT ATCCGCTCAC TCAATAG
 
Protein sequence
MNKNLYRIVF NQARGMLMVV ADIAASGRAA SSPSSGVGHT QRHRVSALSP LSFRLLMALG 
GISLSAQAAI VADGSAPGNQ QPTIISSANG TPQVNIQTPS SGGVSRNAYR QFDVDNRGVI
LNNGRGVNQT QIAGLVDGNP WLARGEASVI LNEVNSRDPS QLNGYIEVAG RKAQVVIANP
AGITCEGCGF INANRATLTT GQAQLNNGQL TGYDVERGEI VIQGKGLDSR GQDHTDLIAR
SVKVNAGIWA NELNITTGRN QVDAAHQNIN TNAADGRPRP AVAVDVANLG GMYAGKIRLI
GTETGVGVHN AGEIGASAGD IVITADGMLV NRGQISSAQQ LAVNTPSGIE NSGVLYGKGN
TQLTTAGKLS NSGTVAAAGD TLIRAAEVNS SRNSVLGAGI KSDNSAITRG TLDIKARGQL
TAQGKNISGT AQTFNANRID LSGSQTQSGD LTFTTEGGDI DLTGANLFAN RRLSVSTPSL
LRTDKANLFA EQIALDAQAL ANVGGVITQT GLTDFNLNLP GYIDNRGGSL LTRGNFLLQA
EHLTSNSQSL LGAGIQSDGK LAPRGDLNVT TRHALIAQGK TLAAGTLALS GSRLDLTDSL
TQAKYMRLTA TEGDIALTGA TVMAANTLFA DSQIRRFIDW ACYKNNSSND KRKYPLTQ