Gene YpsIP31758_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1779 
Symbol 
ID5384972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2056191 
End bp2057471 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content50% 
IMG OID640864762 
Producthypothetical protein 
Protein accessionYP_001400754 
Protein GI153948117 
COG category[S] Function unknown 
COG ID[COG4950] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01926] uncharacterized peroxidase-related enzyme 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCAAT TTCGCAGGAG AAATAATGCC CATTGGTATC ATGAGACTCA GTGTAGCGGC 
AGCCTGGAGC ATTGTAGCGG TAGCCCAGTG AATATTTCCA CGACAGTGAA TGTTCCCACG
ACAGTGAATA ATGAAAACCC CCCTGTTGAT ATTCGGCCAC GTGATCCCAG CGAGAGTGAC
AACATGAGCA CTGGCAGTAA TATAACTGAA CAGAGCATCA TAACTGAACA GGGCATCTTT
CTACTTGGCG TGACAGAAAA CATCGCTCCT ACATTACAAG ACACCCTCTA CCATGAGCAG
CCTATTCTTA CTGCCTCCGA CGCCATGTAT CAGGCCCTGT TCCCAACGAT TATCGAGATC
AACCACACCA ATACCTTCTC ACTTTATGAT CGGTTAAGTA CTGCGCTGAC GGTCGCTCAG
GTTACCGGGA TTCAGCGGCT ATGTAGCCAC TATGCTCTCC GTCTCGCGCC GCTCCCCAGC
CCGGATGCCT CAAGGGAAAG CAATATTAGG CTAACGCAAA TTACGCAATA TGCCCGCCAA
TTGGCCAGCC AACCTACGTT GATCGATAGG CATGCTTTAG CGCAATTGCA TGACGTGGGT
TTAACTGATA GCGATATCAT TATTTTATCG CAAATTATTG GATATGTGGG ATATCAAGCC
CGAGTGGTCG CTGGCATCTC TGCACTGGCT GGTTACCCTA CCGTGATGCT CCCCGGTTTC
CCCCGCATGG AAGATGCCGC CCCCAACCCA TTACCAGATG TCATGCCCAA TTGGCAAGGT
TGGCTACCGT CTCATGCGGC AAACGACGAT CAATCCGATA AAGAACCTGA CGAAACGGCC
AGCACACTGA CTGAACTGTT GGGCCATCAC CAGCAAAGTT TGCTCGCTTA TCACGCCATT
ACCACTCACC AGCCCAACTC ACCTCAATTG CAACGTGACT GGCTGGAACT GGTGGCATTG
GTCAGCGCAC GAATCAATGG CAGCCTCTAC TGCCAAGCCC GTCACAGGCA ACATTTACAG
CAACTGACGG AGCAGCCCCT GTTGGTCACT GAGCTGTTAA AAGGGATTGA TCACGCGTTA
TTCTTGTTAC CCGAACAACA AATACCCCAT CAGCTAATCA GTGTAACCGC CGAGCTCACT
CGCGCCCCGG AACGCTTTAA TCATCAGCAT GTTAAACGTC TACAGACCCT TGGCGTCAGT
GATACTCAAG TCATGCGTAT TATTTTCAGT ATCGCCATTA CTGGTTGGAC CAACCGCCTA
CGACATACGT TAGGAAAATA G
 
Protein sequence
MVQFRRRNNA HWYHETQCSG SLEHCSGSPV NISTTVNVPT TVNNENPPVD IRPRDPSESD 
NMSTGSNITE QSIITEQGIF LLGVTENIAP TLQDTLYHEQ PILTASDAMY QALFPTIIEI
NHTNTFSLYD RLSTALTVAQ VTGIQRLCSH YALRLAPLPS PDASRESNIR LTQITQYARQ
LASQPTLIDR HALAQLHDVG LTDSDIIILS QIIGYVGYQA RVVAGISALA GYPTVMLPGF
PRMEDAAPNP LPDVMPNWQG WLPSHAANDD QSDKEPDETA STLTELLGHH QQSLLAYHAI
TTHQPNSPQL QRDWLELVAL VSARINGSLY CQARHRQHLQ QLTEQPLLVT ELLKGIDHAL
FLLPEQQIPH QLISVTAELT RAPERFNHQH VKRLQTLGVS DTQVMRIIFS IAITGWTNRL
RHTLGK