Gene YpsIP31758_3422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3422 
Symbol 
ID5384334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3854136 
End bp3855746 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content51% 
IMG OID640866435 
Producthypothetical protein 
Protein accessionYP_001402377 
Protein GI153950101 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAT TTGAACGCCA GATCCGTGCA GCCATTTCCG CAGCACGCAA TGGCGCAAAA 
CATGCGGAAC AGTCACTGAC TACACCAATG TGGCAAGCCA AAAGCACCGT AGCCTCATTG
GGTGGGATTG TCCCTAGAAG TGGCTCTTCG TCAACGTCAC AGGCGGAGAA CTATAAGGAA
GATCTCGCGG ACCAGGCTGC CTCGGGCAAC AACATGGCGC GCACGAGTGC GCCACCGGTC
ACTTTGTATC AGCAACAGCC AAATGCGAAT GACAGCTATC CAAACGGGAA TAACAACAAT
CCAAACGGGG ATAACAACAA TCCAAACGGG AGTAACAACA ATATAGCGAG AGTACAGCGT
ATGCCGCATG GCATTTCCAG GGGCTTATAT GAGCGCCCTG GGATGTTATT GGGTGCCTGG
GATAACGCCT ATATTGCTGC GGCTATGCCT TTGCTGCTGC TGGTGGAAAA TATTCGTAGC
TGGCCGACGC GTAACGCCGC AGAGGTCAGG CCACCGATTG TGCGGGAATT ACAATATTTC
CAGCAACATT TGCAGAAAAA GAACTACCCG CAAGAAGACA TTAACCACCT GTCTTACCTG
CTATGTACCT ATATCGATGG CATTTTTAAC GGGCTGCAAA CCCCAGACTC CTACAACCAA
AGTCTGTTAG TGGAGTTTCA TCGTGATGCC TGGGGGGGTG AGGACTGCTT CGAACATCTG
CGGGTCTATA TGAACTCGCC GAAACAGTAC CGGGAAGTTC TGGAATTCTA TGATCTGATT
ATGTGCCTTG GTTTTGACGG TAAATACCAG ATGATAGAGC ATGGTGCGGT TCTGCTGATG
GATTTACGCA GCCGTCTCCA CACGCAACTC TACGGTCAGG ACGCCACACA ATCTTTGGCT
ATCGCGCAAG CGGTCAAAGG TTCTCCGCGT CGCCAATATA TCAAGGCGCT GAAAATCTTC
ACCTATGGTT TCGCACTGTG CCTTTGTGCT TACGGCGTCA CGGCGTGGTA TCTGCACCAG
CAATCCCAAC AGATCCGCAG CAACATTCTG ACGTGGGTAC TGCCTGAACC GCGGAAAATC
AACATCATGG AGACCTTGCC GAATCCGCTA TCCAACATCC TGAATGAAGG GTGGCTGGAG
GTCAGGAAAG ATCCGCGTGG ATGGCTATTA ATCTTCACCT CCGACGGCGC GTTCCGCACG
GGTGAAGCGA CCCTCTCGGA AGAGTTTATC AACAAGAAGA ATATCGAACG TCTTGGGCTG
GCATTAGCCC CATGGCCGGG AGATATCGAG GTTATTGGTC ATACGGATAA CAAACCGTTC
CGTAGCACTT CCGGTAACAA CAACCTCAAA CTTTCCGCGG CCAGAGCATC GGTGGTGGCA
GATAAACTGC GGGAATCCAC TCAAATCAAC GAAACCCATC AGCGAGAAAT AAGTGCCATC
GGACGGGGGG AGAGCGATCC TTTAGCTGAC AATGCAACGG AAGAAGGGCG CAAGCGTAAC
CGGCGTGTGG ATATCCTATG GAAAATTGGT CAGCGCGATG CCGATAAGGC CATGAAGCAA
TTCCTGGAGA ACCCAACACC AGAAGTTCAA GGAACGAATA CCCAACAATA G
 
Protein sequence
MNEFERQIRA AISAARNGAK HAEQSLTTPM WQAKSTVASL GGIVPRSGSS STSQAENYKE 
DLADQAASGN NMARTSAPPV TLYQQQPNAN DSYPNGNNNN PNGDNNNPNG SNNNIARVQR
MPHGISRGLY ERPGMLLGAW DNAYIAAAMP LLLLVENIRS WPTRNAAEVR PPIVRELQYF
QQHLQKKNYP QEDINHLSYL LCTYIDGIFN GLQTPDSYNQ SLLVEFHRDA WGGEDCFEHL
RVYMNSPKQY REVLEFYDLI MCLGFDGKYQ MIEHGAVLLM DLRSRLHTQL YGQDATQSLA
IAQAVKGSPR RQYIKALKIF TYGFALCLCA YGVTAWYLHQ QSQQIRSNIL TWVLPEPRKI
NIMETLPNPL SNILNEGWLE VRKDPRGWLL IFTSDGAFRT GEATLSEEFI NKKNIERLGL
ALAPWPGDIE VIGHTDNKPF RSTSGNNNLK LSAARASVVA DKLRESTQIN ETHQREISAI
GRGESDPLAD NATEEGRKRN RRVDILWKIG QRDADKAMKQ FLENPTPEVQ GTNTQQ