Gene YpsIP31758_3733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3733 
Symbol 
ID5385187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4207845 
End bp4209182 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content51% 
IMG OID640866757 
Productpilin accessory protein PilO 
Protein accessionYP_001402687 
Protein GI153948492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA ATATTGCTGC TCGCCCTATC ATGCCAGCCC AACAGGTAAT GAGAATGGGG 
CGCTTGGGAT GGGTGGCGGG TATGAACTGG CAGATGCACG ATCTGGTCGC CCAAAAAAAA
CGGTTTTTTA GTCGTGCTGA TGGAGCCACT CATCGGCTGA AATTGTTGGC TAAAGCGCAA
CAGATTACCG GTCAGGCGCG CCCTCTTGGT GTTCAACCGG GGCTCCGGCT TTGTTCGCTT
GCTGCGGCCT ATCTGGCTCG CGAGGGCGGG AGTCATTACG GCATTTATCA ACTCGATGCA
AAAAATGATA AGTGGCTTTT TCTGGCAACG ACTGGCGGTT TGCCGTCTGT CATGGGCGAT
ATCATCGGAA CCTTGAATGA GGTATTGAGC GCTCAACAGC GGTTTTTAGA TTTTAATGCA
CCTGAATCAT CTACTTCACT GACATCTCTT TCACTGACAT CTACTGCATT GAATTGTACG
GCGACATCTG AGACACCTGT TTGTTGGCAG ACGTTGACGA CAGGACTGTC TCGTCAGGCA
ATTCGTGCAA CAACCCTGGG CCGTCTCCTT TCTGGCCGCA CCATCGCGTG TTTTGCTCTA
CCCGTCTTGC TTGGTTCAAC GGCTTTCTGG TACTGGGGAA ATCAGCTTGA TAAGGCTGAA
ATGGTAGGCA AAGTCGCTCA GGCTAAGGCG TTACTTGAAT TGCAAAGCCA GAACGTCACC
GAATCGACCA AAGAAGAGAA CCTGCCGCAC CCTTGGGCGA GCATCTGGCC AACACCTTAT
TTTCTCAGTC AGTGCCTTGT TGTTCGCCAG CGACTGCCCA TAACTCTCAC TGGCTGGCGG
CTTGCTTACG GCGAGTGTGT GAGTGAGGGG ATGCGCCTGC GTTATGTGGC AACGGCTGCC
AGTACCATCG CAGATTTCTC CCGTCGGGCT CGGGAATTGC TCGGGCAAGA TGCGTATTTT
AACCTGGAGG AAGGGGGCCA AAATGGTGAT GTGTTGATCC CTTTTGTTAC CACCGATTCT
CCATCGTTAT GGCGAGATGA AAAGATCCCT CCGTCAGCCG TACAGTTGAT GCAGTTCATT
TCACATTTTC AGCGACGAAA TATCAGTGTG CTTCTCAATG AAGTCAAACC ACCTCCGGTG
GTTCCGGGGC AGGAAAATAC GATCCCTTCG CAAGGCTGGC GAGAGTTTAC CTTTACTTTC
ACGACAAAAT CATCACCAGA AGGGGTGTTA GCCAGCATAG ATGATATCGG TCTGCGTCTG
ACAAGCATCG CGTTTACGCT CAACCCGCAG AGTCAATTTG AATACACTAT CAAGGGGAGC
CTGTATGCAC AAAATTAA
 
Protein sequence
MNKNIAARPI MPAQQVMRMG RLGWVAGMNW QMHDLVAQKK RFFSRADGAT HRLKLLAKAQ 
QITGQARPLG VQPGLRLCSL AAAYLAREGG SHYGIYQLDA KNDKWLFLAT TGGLPSVMGD
IIGTLNEVLS AQQRFLDFNA PESSTSLTSL SLTSTALNCT ATSETPVCWQ TLTTGLSRQA
IRATTLGRLL SGRTIACFAL PVLLGSTAFW YWGNQLDKAE MVGKVAQAKA LLELQSQNVT
ESTKEENLPH PWASIWPTPY FLSQCLVVRQ RLPITLTGWR LAYGECVSEG MRLRYVATAA
STIADFSRRA RELLGQDAYF NLEEGGQNGD VLIPFVTTDS PSLWRDEKIP PSAVQLMQFI
SHFQRRNISV LLNEVKPPPV VPGQENTIPS QGWREFTFTF TTKSSPEGVL ASIDDIGLRL
TSIAFTLNPQ SQFEYTIKGS LYAQN