Gene YpsIP31758_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3811 
Symbol 
ID5386005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4296675 
End bp4297745 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content49% 
IMG OID640866835 
Productsecretion system apparatus protein SsaU 
Protein accessionYP_001402765 
Protein GI153948750 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4792] Type III secretory pathway, component EscU 
TIGRFAM ID[TIGR01404] type III secretion protein, YscU/HrpY family 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCCA CTGAAAAGAA TGAAAAACCC ACGCCCAAGC GCCTTAAGGA AGCCAAAGAA 
AAAGGGCAGG TAGTCAAAAG TGTTGAGATA ACCTCCGGCG TACAGTTGGT GGCGCTGGTT
ATCTACTTTT TACTGACAGG ATATAGCTTG GTCGAGCAGG CTAAAGCCCT TATCCGCAGT
TCAATCATAC AATTACAGCA ACCACTTACT CTGGCTTTAG CCCGTATCGG TGCCGAGTGC
ATGACGGTAT TGATGCATAT CGTGGTGGTC TTGGGCGGGG CGCTGATCGT GGTCACCATT
ATTGCCGGTA TTGCTCAGGT TGGGCCGTTG TTGGCGACCA AAGCGGTGTC GTTTAAAGGT
GAGCGAATTA ACCCCATTCA AAATGCTAAA CAACTTTTCT CATTGCGCAG CGTGTTTGAG
CTGATGAAAT CATTATTGAA AGTGGGGGTG CTAACGCTGA TTTTTGGTTA CTTATTGATG
CAGTATGCGC CCTCTTTTGG TTATTTGACC CACTGTGGCA GTCGCTGTGC TCTCCCGGTC
TTTTCGACAC TAATGGGGTG GTTATTAGGC TCACTGATTG CCTGCTATCT GGTTTTTTCA
TTGATGGATT ATACTTTTCA GCGCTATACC ATCATGAAAC AACTGAAAAT GTCCCATGAT
GAGGTCAAGC GGGAGCATAA AGACAGTAAT GGGGATCCGC ACATTAAGCA GAAGCGGCGG
CAGCTACAGC ATGAAGTACA AAGTGGTAGT TTTGCCACTA ACGTGCGGCG TTCCACTGCG
GTAGTCCGTA ACCCGACACA TTTTGCCGTT TGTCTGGTTT ATCACCCAGA AGAGACGCCG
CTACCGATAG TGATTGAAAA AGGTCATGAT GAGCAGGCTG CATTGATTGT GAGCCTCGCA
GAGCAGAGCG GTATCCCTGT GGTAGAAAAC ATTGCGCTGG CGCGCGCATT ACACCGTGAT
GTTGCTTGTG GTGACACCAT CCCAGAACAA TTCTTCGAGC CAGTTGCTGC GCTCTTACGT
ATGGCACTGG AGTTGGATTA TCAGCCATCA AGTGATGATC CACCGCGCTA G
 
Protein sequence
MMSTEKNEKP TPKRLKEAKE KGQVVKSVEI TSGVQLVALV IYFLLTGYSL VEQAKALIRS 
SIIQLQQPLT LALARIGAEC MTVLMHIVVV LGGALIVVTI IAGIAQVGPL LATKAVSFKG
ERINPIQNAK QLFSLRSVFE LMKSLLKVGV LTLIFGYLLM QYAPSFGYLT HCGSRCALPV
FSTLMGWLLG SLIACYLVFS LMDYTFQRYT IMKQLKMSHD EVKREHKDSN GDPHIKQKRR
QLQHEVQSGS FATNVRRSTA VVRNPTHFAV CLVYHPEETP LPIVIEKGHD EQAALIVSLA
EQSGIPVVEN IALARALHRD VACGDTIPEQ FFEPVAALLR MALELDYQPS SDDPPR