Gene YpsIP31758_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2026 
Symbol 
ID5385932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2333013 
End bp2334329 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content49% 
IMG OID640865010 
Producthypothetical protein 
Protein accessionYP_001400999 
Protein GI153949611 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000148125 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCAGA TAGTCAGAAC TATCACTTTA GCATATAACA ATCTTCCTCG ACCCCATCGC 
ATCATGTTGG GGTCGTTGAC TGTAATGACA CTGGCCGTCG CTGTTTGGCG GCCTTTTATC
TATCACCCAG AGCATGAACC TGTTGCCAAG AGCGTTGTGC TAGATACCAG TCAATCCCGC
ATTTTATTGC CAGAAGCCAG TGAACCTCTC GATCAGCCAA CGCCTGACGA TGCAATCCCA
CAAGATGAAT TAGACACCAA AGATGCCAAT GACACTGGTG TCCATGAGTA TGTTGTGTCA
ACGGGTGATA CGTTGAGCAG CATTCTTACC CAGTATGGGA TTGATATCTC TGATGTTTCT
TTATTAGCGA ACCAAAACCG TGACTTACGT AATCTGAAGA TCGGGCAGCA AATCTCCTGG
ACAGTGAATG ACACCGGTGA TTTACAAAGC CTGACTTGGG AAGTGTCGCG TCGCGAAACT
CGGACGTATA ACCGTGTTGG GAACAATTTC AAAGAGACTA AAGAGCTGCA GAAAGGCGAG
TGGAAGAACA GTGTTCTGAC GGGCCGTCTT GATGGTAGTT TTGTCGGAAG TGCGAAGAAA
GCAGGGCTGA CAGCCGCTGA AATTCGTGCC GTGACCAAGG CACTACAGTG GCAATTAGAT
TTCAGTAAGC TGCGTAAAGG CGATCAGTTT GCGGTCTTAA TGTCACGGGA AATGCTGGAT
GGTCGCAGTG AGCAAAGCCA GCTCGTTGGG GTACGCATGC GGTCTGGAGG TAAAGATTAT
TATGCTATCC GGGCTGAAGA TGGCAAATTC TATGATCGGC AGGGCTCTGG CCTTGCACGT
GGTTTCCTAC GTTTCCCTAC ACTGAAACAA TTCCGTGTCT CATCCAATTT TAACCCTCGT
CGGTTAAATC CGGTGACTGG GCGTATTGCG CCGCATAAAG GCGTGGATTT CGCGATGCCT
GTCGGGACAC CGGTATTAGC CGTGGGCGAT GGCGAAGTCT TAATCTCAAA ATTCAGTGGT
GCTGCCGGTA ACTATGTGGT TATCCGCCAT GGTCGCCAAT ACACCACGCA TTACATGCAT
TTGAAAAAAT TGCTGGTTAA ACCCGGGCAG AAAGTAAAAC GTGGTGACCG TATCGCGTTG
TCGGGTAATA CCGGCCGTTC GACTGGGCCA CATCTGCATT ATGAATTTTG GATGAACCAG
CAAGCGGTTA ATCCACTGAC AGCTAAATTG CCACGTTCAG AAGGGTTGAG TGGTAAAGAT
CGCAGTGAGT ATTTGGCCAT TGTTAAGCAA GTTATTCCGC AGTTACAACT GGATTAG
 
Protein sequence
MQQIVRTITL AYNNLPRPHR IMLGSLTVMT LAVAVWRPFI YHPEHEPVAK SVVLDTSQSR 
ILLPEASEPL DQPTPDDAIP QDELDTKDAN DTGVHEYVVS TGDTLSSILT QYGIDISDVS
LLANQNRDLR NLKIGQQISW TVNDTGDLQS LTWEVSRRET RTYNRVGNNF KETKELQKGE
WKNSVLTGRL DGSFVGSAKK AGLTAAEIRA VTKALQWQLD FSKLRKGDQF AVLMSREMLD
GRSEQSQLVG VRMRSGGKDY YAIRAEDGKF YDRQGSGLAR GFLRFPTLKQ FRVSSNFNPR
RLNPVTGRIA PHKGVDFAMP VGTPVLAVGD GEVLISKFSG AAGNYVVIRH GRQYTTHYMH
LKKLLVKPGQ KVKRGDRIAL SGNTGRSTGP HLHYEFWMNQ QAVNPLTAKL PRSEGLSGKD
RSEYLAIVKQ VIPQLQLD