Gene YpsIP31758_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0549 
Symbol 
ID5388221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp646198 
End bp647376 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content47% 
IMG OID640863520 
ProductHNH endonuclease domain-containing protein 
Protein accessionYP_001399542 
Protein GI153950788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.663462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACTC ATTACTCTGT TATGGATTTA GGGGAAAGAT ATGGCGCACG ATTTTCCACT 
TTGGAGGAAA TGGTTCCTTG CTTCGGTATA GATGCTAATG TGTTCATCCA GGCGTGGCTT
GAAGGGCGGC TTCCTCTTTA CATTTACTTC GGTAATGAGA GCCGACCATG TACGATCAGG
CGCTGTGTTT CCGCTAAAGT ACATGAACAC GTCATGCATG ACATACTTTA TGGCCGTGAT
TTCTATCAGA GCAAAGCGAG TCCAGACAAC GATGTATTGA TGTTTGTACC AGAAACTCCG
TTAGTCTGTA AGACTAAATT CAGGGGGGAT TACAGATTAT CAACCGGAAC TCATGCCAAC
GTGGAGAAAG GAACACATGG ACACGTTAAC AACCGCTACC CCATCGGTGT TAGCGCCCGT
GGTCGTATTG CTGATGGCGC GGTAGGTACC CTTGAGGGGC TAGGTACTTT GATGGGGCCT
TCCGCCCAAG AATATATGGC TGGTGCCTTT AATCCAGAGC AGGCCGCAAT AAATAAAGTC
CGACAGCAAA ATCAACAAGC CGCTGGCAAG GCTATTTATG ATAATACGAA AGGGGCGGTG
ACAGACGCTT ATCAGCGCAA TGGATTAGCC GGTGCGGCCG CCATGGTAGT CACGGCATCC
GTGGCGGAGT TGGCGGGTAC TAAGGGGTTG GGAACGGTAG AAAAAGTTGG CACATTAGGC
GATGTCGCTA AGTTAGGGAA AGCTATTGAG CTGGAAAAAC TAGAGGGGTA CCTTGGCACT
TATAAAGGTC AGAAAGTATT GCTACAAAAC GTCGATGTTG TGAAGATGGA TTATTTCCGA
CGAGACCGTG CAGAGGCCGC TATGTTGCGA AGCCAATTCC GCTCTGTTCG GACTAAATTT
GTTAAATCTA TAGCAAATAA TCCCGACGTT GCTAAGCGCT TTACTTTAGA GCAAATAGAC
GGCTTGTCTA ATGGCATTAC ACCTAGCGGC TGGGTTGTGC ATCACAAACT ACCCTTAGAC
GACAGCGGAA CTAATGCGTT AGATAATCTA GTGCTTATCA AAGACAGCCC AGAGCATACT
GTTCTGACTA ATGCGCAAAA GAAAATCACT AACGGATTGC CACACGAGGC TTCGAAAGAA
GTGCTTTGGC CGATTCCTCA AGGTCTTGTT TACCCATAG
 
Protein sequence
MRTHYSVMDL GERYGARFST LEEMVPCFGI DANVFIQAWL EGRLPLYIYF GNESRPCTIR 
RCVSAKVHEH VMHDILYGRD FYQSKASPDN DVLMFVPETP LVCKTKFRGD YRLSTGTHAN
VEKGTHGHVN NRYPIGVSAR GRIADGAVGT LEGLGTLMGP SAQEYMAGAF NPEQAAINKV
RQQNQQAAGK AIYDNTKGAV TDAYQRNGLA GAAAMVVTAS VAELAGTKGL GTVEKVGTLG
DVAKLGKAIE LEKLEGYLGT YKGQKVLLQN VDVVKMDYFR RDRAEAAMLR SQFRSVRTKF
VKSIANNPDV AKRFTLEQID GLSNGITPSG WVVHHKLPLD DSGTNALDNL VLIKDSPEHT
VLTNAQKKIT NGLPHEASKE VLWPIPQGLV YP