Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_0572 |
Symbol | |
ID | 6087864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 631066 |
End bp | 633042 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641595634 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001719328 |
Protein GI | 170022823 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA ACCTGTACCG TATCGTTTTT AATCAAGCGC GCGGCATGTT GATGGTGGTC GCCGATATTG CGGCCTCCGG TCGGGCTGCA TCGTCGCCAT CGTCAGGTGT TGGGCATACT CAGCGCCACC GGGTCAGTGC CTTATCACCA TTGAGTTTTC GTCTGTTAAT GGCTTTGGGG GGCATCTCGT TATCCGCTCA GGCCGCAATT GTGGCGGATG GCAGTGCGCC GGGTAATCAG CAGCCGACCA TTATCAGCAG CGCCAACGGC ACGCCTCAGG TTAATATCCA AACCCCCAGT AGCGGCGGTG TCTCGCGCAA CGCTTACCGC CAGTTTGATG TCGATAATCG CGGGGTGATC CTCAATAACG GTCGCGGTGT CAACCAGACC CAGATTGCCG GATTGGTTGA TGGTAATCCG TGGCTGGCGC GGGGCGAAGC CAGCGTGATC CTCAACGAAG TCAACAGTCG CGACCCGAGT CAGCTCAACG GCTATATCGA AGTGGCCGGA CGTAAAGCGC AGGTGGTGAT CGCTAACCCA GCAGGCATTA CCTGCGAGGG CTGTGGTTTT ATCAACGCCA ACCGCGCAAC CCTCACCACC GGTCAGGCGC AGCTCAATAA CGGCCAACTC ACTGGCTACG ATGTTGAACG GGGTGAAATT GTTATTCAGG GAAAGGGCCT GGACAGCCGT GGTCAGGATC ATACCGATCT GATCGCCCGT TCCGTTAAAG TAAATGCGGG CATCTGGGCC AATGAACTGA ATATCACTAC CGGGCGCAAT CAGGTTGATG CCGCACACCA GAATATCAAT ACCAACGCCG CCGATGGCCG CCCTCGTCCT GCCGTGGCGG TCGATGTGGC TAATTTGGGG GGGATGTACG CCGGTAAAAT CCGCCTAATC GGTACTGAAA CCGGTGTTGG CGTGCACAAT GCGGGCGAGA TAGGGGCTTC TGCAGGTGAT ATCGTGATTA CGGCCGACGG TATGCTGGTG AACCGCGGCC AGATCAGCAG TGCTCAACAA CTGGCGGTGA ATACCCCCTC AGGCATAGAG AATAGCGGTG TGCTCTATGG GAAGGGCAAT ACCCAACTGA CCACGGCGGG TAAACTGAGC AACAGTGGCA CGGTCGCGGC GGCGGGTGAC ACCTTGATCC GCGCGGCAGA GGTTAACAGC AGCCGTAATT CTGTTTTGGG TGCTGGCATT AAATCCGATA ACAGTGCCAT TACCCGTGGC ACCCTTGATA TTAAAGCCCG TGGGCAGCTA ACCGCCCAAG GGAAAAATAT TAGCGGCACG GCGCAGACAT TTAATGCGAA CCGTATTGAT CTCAGCGGTA GCCAAACTCA GAGCGGTGAT CTGACGTTCA CCACCGAAGG TGGCGACATC GATTTGACAG GGGCTAACCT GTTCGCCAAT CGTCGTCTGT CTGTTTCGAC CCCTTCTTTG CTACGCACCG ATAAAGCTAA CTTGTTCGCG GAGCAAATCG CGCTCGACGC ACAAGCGCTC GCCAATGTGG GTGGCGTGAT AACGCAAACT GGGCTGACCG ACTTCAACTT GAATCTACCG GGTTATATTG ATAACCGTGG TGGCTCTCTC CTCACCCGCG GCAACTTTTT GCTGCAAGCT GAACACTTGA CCAGTAATAG CCAGAGTTTA CTGGGTGCTG GCATACAGAG TGATGGCAAA CTGGCTCCGC GTGGTGATCT CAATGTCACT ACGCGGCACG CCTTGATTGC TCAAGGGAAA ACGCTAGCCG CAGGCACTCT GGCACTCTCC GGCAGCCGGC TTGATCTTAC TGATAGCCTG ACACAAGCCA AGTATATGCG GCTGACAGCC ACAGAAGGCG ATATTGCGTT AACCGGTGCC ACGGTGATGG CGGCTAACAC GTTGTTTGCG GATAGCCAAA TCAGACGGTT TATTGACTGG GCATGTTACA AAAACAACTC CTCAAATGAC AAAAGAAAAT ATCCGCTCAC TCAATAG
|
Protein sequence | MNKNLYRIVF NQARGMLMVV ADIAASGRAA SSPSSGVGHT QRHRVSALSP LSFRLLMALG GISLSAQAAI VADGSAPGNQ QPTIISSANG TPQVNIQTPS SGGVSRNAYR QFDVDNRGVI LNNGRGVNQT QIAGLVDGNP WLARGEASVI LNEVNSRDPS QLNGYIEVAG RKAQVVIANP AGITCEGCGF INANRATLTT GQAQLNNGQL TGYDVERGEI VIQGKGLDSR GQDHTDLIAR SVKVNAGIWA NELNITTGRN QVDAAHQNIN TNAADGRPRP AVAVDVANLG GMYAGKIRLI GTETGVGVHN AGEIGASAGD IVITADGMLV NRGQISSAQQ LAVNTPSGIE NSGVLYGKGN TQLTTAGKLS NSGTVAAAGD TLIRAAEVNS SRNSVLGAGI KSDNSAITRG TLDIKARGQL TAQGKNISGT AQTFNANRID LSGSQTQSGD LTFTTEGGDI DLTGANLFAN RRLSVSTPSL LRTDKANLFA EQIALDAQAL ANVGGVITQT GLTDFNLNLP GYIDNRGGSL LTRGNFLLQA EHLTSNSQSL LGAGIQSDGK LAPRGDLNVT TRHALIAQGK TLAAGTLALS GSRLDLTDSL TQAKYMRLTA TEGDIALTGA TVMAANTLFA DSQIRRFIDW ACYKNNSSND KRKYPLTQ
|
| |