Gene YpsIP31758_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4077 
Symbol 
ID5388026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4595913 
End bp4597304 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content37% 
IMG OID640867105 
Productputative hemagglutinin/hemolysin 
Protein accessionYP_001403021 
Protein GI153949762 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGA CATATATTAA AACGAAACAG GGATTGTCTA TATTGCCTCT CTCTATCATA 
TTGTCGCTCT ATGGGAGTTC GGTTGCTTAT GCTGATAATA TTATTGCTGA TGTACAGGCT
CCAATTGGAC AGCAAGCGGA AGTCTCTATT ATAAAAACGC CGCCCAGTGT ATGCCGAGCG
CTTACCTCTT ACTGTGTCGG TATGACAGAG ACCGTCGTTA ATATTCAAGC ACCTGATGAA
AATGGTTTAT CACATAACAA GTATTCTAAA TTTGATGTGG TCGCTAATGG CTTATTCGAT
GTCACGACAC TGAATAATCG TTTAGCACAA GAGGTTGATG GTAACTCTTT TTTACAAGAC
AAGTTAGCAA CCATTATATT AAATGAAGTC AATTCATCAC AGGCTAGTCT ATTAGATGGG
AATCTCCATG TTGGTGGGCA AGATGCGCAT GTCATTATTG CTAATCCAGC GGGTATTAAT
TGTCGAGGGT GCTCCTTTAC CAATACCTCT CATGTGACAT TGACTACCGG GGCACCATCG
TTTAGTAACA ATAAGCTAAA TAATTTCATT GTTGAGCAGG GTAATATTAA TATTGAAAAA
GATCCCTCTT ACTATATGAA AAGTGGCTTG CGAAATAAAA GTATGGATAC GACTTACCTT
GATTTATTTG CGGAAAAAAT CACTGTCAAT GGTGATATCA ATGCGGACGA TGTTTATATT
GTCACGGGAA AAAATAAAGT AGGTTTCTCT TTGCCTGGGC AACCATTGCA CGTGTCGCGT
TTAGACAATG AAAATACACC AGTACCAGAT ACAGTTAGTT TGGATGTCAG TGAAATTGGG
GGAATGTACG CCAATAAAAT TCGTATCTAT ACAACCGATA GCACGATTAA AAATAAGGGG
GCAATACGCG CCAACGATAC ACTAAGCCTC AGTTCTGCAG CCAATATAGA TAACAGTAAT
GGGAATATAT CAGGTAAAAT GGTGTTACTG AGCAGCGAGG GTGTTATAAA TAACTCTGGT
GGCACAATAT TAAATAATGG TGAGTATGAT TTATTACCTT CTCAAGGTAT TAAAATAACA
TCTCGTGGGT TAAATAATGA AGGTGGAAAA ATAGAGTATA AAAATGGTAG CGTTGAAATA
GCAACAGTTA ACACCATCAA AAATGGTAAA GGTACAATTA AAGCAACATC AACGCAAGGG
CGGGTAAAAA TGAACCTTCA CAGTAATCAT CTTAATAATA CTGGAGGGAG TGTCATTTCT
TCAGGAAAAG TAGAGGGTAA AGTTAATAAT ATACGAAACA ATAGAGGGTC CATTATAGGT
TTAGGGGGAG TGGATTTGAA TGAAACTGTT TTAATTAATA GTACCGGTAA AATAATTTCT
GGTTTTAATT GA
 
Protein sequence
MKLTYIKTKQ GLSILPLSII LSLYGSSVAY ADNIIADVQA PIGQQAEVSI IKTPPSVCRA 
LTSYCVGMTE TVVNIQAPDE NGLSHNKYSK FDVVANGLFD VTTLNNRLAQ EVDGNSFLQD
KLATIILNEV NSSQASLLDG NLHVGGQDAH VIIANPAGIN CRGCSFTNTS HVTLTTGAPS
FSNNKLNNFI VEQGNINIEK DPSYYMKSGL RNKSMDTTYL DLFAEKITVN GDINADDVYI
VTGKNKVGFS LPGQPLHVSR LDNENTPVPD TVSLDVSEIG GMYANKIRIY TTDSTIKNKG
AIRANDTLSL SSAANIDNSN GNISGKMVLL SSEGVINNSG GTILNNGEYD LLPSQGIKIT
SRGLNNEGGK IEYKNGSVEI ATVNTIKNGK GTIKATSTQG RVKMNLHSNH LNNTGGSVIS
SGKVEGKVNN IRNNRGSIIG LGGVDLNETV LINSTGKIIS GFN