Gene YPK_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3551 
Symbol 
ID6089438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3909212 
End bp3910822 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content52% 
IMG OID641598635 
Producthypothetical protein 
Protein accessionYP_001722271 
Protein GI170025766 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.178777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAT TTGAACGCCA GATCCGTGCA GCCATTTCCG CAGCACGCAA TGGCGCAAAA 
CATGCGGAAC AGTCACTGAC TACACCAATG TGGCAAGCCA AAAGCACCGT AGCCTCATTG
GGTGGGATTG TCCCTAGAAG TGGCTCTTCG TCAACGTCAC AGGCGGAGAA CTATAAGGAA
GGTCTCGCGG ACCAGGCTGC CTCGGGCAAC AACATGGCGC GCACGAGTGC GCCACCGGTC
ACTTTGTATC AGCAACAGCC AAATGCGAAT GACAGCTATC CAAACGGGAA TAACAACAAT
CCAAACGGGG ATAACAACAA TCCAAACGGG AGTAACAACA ATATAGCGAG AGTACAGCGT
ATGCCGCATG GCATTTCCAG GGGCTTATAT GAGCGCCCTG GGATGTTATT GGGTGCCTGG
GATAACGCCT ATATTGCTGC GGCTATGCCT TTGCTGCTGC TGGTGGAAAA TATTCGTAGC
TGGCCGACGC GTAACGCCGC AGAGGTCAGG CCACCGATTG TGCGGGAATT ACAATATTTC
CAGCAACATT TGCAGAAAAA GAACTACCCG CAAGAAGACA TTAACCACCT GTCTTACCTG
CTATGTACCT ATATCGATGG CATTTTTAAC GGGCTGCAAA CCCCAGGCTC CTACAACCAA
AGTCTGTTAG TGGAGTTTCA CCGTGATGCC TGGGGGGGTG AGGACTGCTT CGAACATCTG
CGGGTCTATA TGAACTCGCC GAAACAGTAC CGGGAAGTTC TGGAATTCTA TGATCTGATT
ATGTGCCTTG GTTTTGACGG TAAATACCAG ATGATAGAGC ATGGTGCGGT TCTGCTGATG
GATTTACGCA GCCGTCTCCA CACGCAACTC TACGGTCAGG ACGCCACACA ATCTTTGGCT
ATCGCGCAAG CGGTCAAAGG TTCTCCGCGT CGCCAATATA TCAAGGCGCT GAAAATCTTC
ACCTATGGTT TCGCACTGTG CCTTTGTGCT TACGGCGTCA CGGCGTGGTA TCTGCACCAG
CAATCCCAAC AGATCCGCAG CAACATTCTG ACGTGGGTAC TGCCTGAACC GCGGAAAATC
AACATCATGG AGACCTTGCC GAATCCGCTA TCCAACATCC TGAATGAAGG GTGGCTGGAG
GTCAGGAAAG ATCCGCGTGG ATGGCTATTA ATCTTCACCT CCGACGGCGC GTTCCGCACG
GGTGAAGCGA CCCTCTCGGA AGAGTTTATC AACAAGAAGA ATATCGAACG TCTTGGGCTG
GCATTAGCCC CATGGCCGGG AGATATCGAG GTTATTGGTC ATACGGATAA CAAACCGTTC
CGTAGCACTT CCGGTAACAA CAACCTCAAA CTTTCCGCGG CCAGAGCATC GGTGGTGGCA
GATAAACTGC GGGAATCCAC TCAAATCAAC GAAACCCATC AGCGAGAAAT AAGTGCCATC
GGACGGGGGG AGAGCGATCC TTTAGCTGAC AATGCAACGG AAGAAGGGCG CAAGCGTAAC
CGGCGTGTGG ATATCCTATG GAAAATTGGT CAGCGCGATG CCGATAAGGC CATGAAGCAA
TTCCTGGAGA ACCCAACACC AGAAGTTCAA GGAACGAATA CCCAACAATA G
 
Protein sequence
MNEFERQIRA AISAARNGAK HAEQSLTTPM WQAKSTVASL GGIVPRSGSS STSQAENYKE 
GLADQAASGN NMARTSAPPV TLYQQQPNAN DSYPNGNNNN PNGDNNNPNG SNNNIARVQR
MPHGISRGLY ERPGMLLGAW DNAYIAAAMP LLLLVENIRS WPTRNAAEVR PPIVRELQYF
QQHLQKKNYP QEDINHLSYL LCTYIDGIFN GLQTPGSYNQ SLLVEFHRDA WGGEDCFEHL
RVYMNSPKQY REVLEFYDLI MCLGFDGKYQ MIEHGAVLLM DLRSRLHTQL YGQDATQSLA
IAQAVKGSPR RQYIKALKIF TYGFALCLCA YGVTAWYLHQ QSQQIRSNIL TWVLPEPRKI
NIMETLPNPL SNILNEGWLE VRKDPRGWLL IFTSDGAFRT GEATLSEEFI NKKNIERLGL
ALAPWPGDIE VIGHTDNKPF RSTSGNNNLK LSAARASVVA DKLRESTQIN ETHQREISAI
GRGESDPLAD NATEEGRKRN RRVDILWKIG QRDADKAMKQ FLENPTPEVQ GTNTQQ