Gene YpsIP31758_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2200 
Symbol 
ID5386051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2520904 
End bp2522103 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content47% 
IMG OID640865184 
Producthypothetical protein 
Protein accessionYP_001401170 
Protein GI153948534 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.243728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCA TTAATCGTTA CGGAGTTACT GGCACTACGT TGAGTGAATA TCTGGATACC 
ATGCGCCAGC ATTATCTTGC TATTGATGAT GGCTGGAATA TTAATCCGGA ATCACCGGAT
GGTTTGGCAA TCGCTATTTG GTGTGAGGTA TTAGCCAATT TGGATGAAGC GGTGATTAAT
GCCTATCACG CCGCAGATCC CCATTCAGCG ATTGACCAAC AATTAGATCG CATCGCCGCA
TTTGCTGGAA TCAAACGCAA AAGTGCGACC TATTCAACCG TGACCGTTAA TTTTCACGGT
ATCGCTTTTA CCCCTGTCAG GGCCGGAACA TTAATCAGAA ACAGAGTAAC CAATACTTTA
TGGGCGACCG ATGGTGATGT TGTCACCGAC GCGGCAGGTA ATGCGACGGT TAACGCCACT
TGTACGTTGG CTGGAGCGCA GGGGGCTAAT AGCGATAACC TGACAATCAT CGCGACCCCC
ATCGGCGGTA TTACAGCAGT GACTAATGGC GCGACGGCTT CAATGGGGCT GGATAAAGAA
ACCAATAATG CGTTTCGCAT TCGACGCAAT GAGTCAGTCG CGTTACCAGG ATCAAACCAG
ATTGATAATA TTTATGCGGC ACTGGTCAAT ATTGAGGATG TTAAACGGGC GCGTATCTAT
GAAAACGTTG AGGATCAAGC GGATGAGAAT GGGATATTCG GCCACTCAAT GGCGATATTT
GTTGATGGTG GCAGCATTGA GGATGTGATC TGCAGTATCG CAACCAATAA AAGCCCCGGC
TGCGGGCTGA ACCGCTACAA CACATTTCCC AATAAAATCT CGTTGGATAC TGTTACACCA
AAAGGTAACC CGATCACCGT AACCTTCTTT CGCCCACAAC TGGTACCGGT TTATGCAAAG
GTAGAGGTTG TCAGCAATGC TGAATTCATT GACGATGAAA TAAAGCAGGC CATTGTTGAG
TACAGCATTA CTGGCTTTGA TCAGACCAAT GGGTTTTCTA AGCTGGGCTT TAAAATTGGT
GAAAATATTG GAGCTGGCCG CTTATTTACC CCTGTCAATC ATTTGGTGGC TGGTAATGGC
TTTGTCAATG CAATCACTGT AGGCACCACT ATCGAACAGG CCAGCAATAG CGTAGTGAGG
ATCGCCTTTA ACCAGTTAGG CATATTTAGT GCCGAAAATA TCGAGGTGGT CTATGTATAA
 
Protein sequence
MATINRYGVT GTTLSEYLDT MRQHYLAIDD GWNINPESPD GLAIAIWCEV LANLDEAVIN 
AYHAADPHSA IDQQLDRIAA FAGIKRKSAT YSTVTVNFHG IAFTPVRAGT LIRNRVTNTL
WATDGDVVTD AAGNATVNAT CTLAGAQGAN SDNLTIIATP IGGITAVTNG ATASMGLDKE
TNNAFRIRRN ESVALPGSNQ IDNIYAALVN IEDVKRARIY ENVEDQADEN GIFGHSMAIF
VDGGSIEDVI CSIATNKSPG CGLNRYNTFP NKISLDTVTP KGNPITVTFF RPQLVPVYAK
VEVVSNAEFI DDEIKQAIVE YSITGFDQTN GFSKLGFKIG ENIGAGRLFT PVNHLVAGNG
FVNAITVGTT IEQASNSVVR IAFNQLGIFS AENIEVVYV