Gene YpsIP31758_1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1606 
Symbol 
ID5386681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1864959 
End bp1866080 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content48% 
IMG OID640864587 
Productcupin family protein 
Protein accessionYP_001400583 
Protein GI153947083 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACC AACTCGATCT CGATTGGCCT GACTTTCTAC AACGCTATTG GCAAAAGCGC 
CCTGTTATCC TCAAACGTGG CTTCAAAAAT TTTATTGACC CACTCTCCCC AGATGAACTT
GCCGGGTTAG CCATGGAAAA TGAAGTCGAT AGCCGCTTGG TAAGCCATGA AAATGGGCGT
TGGCATGTTA GCCACGGGCC ATTTGAAAGC TTTGATCATT TAGGTGAAAA CAACTGGTCA
TTGTTAGTTC AGGCGGTAGA CCATTGGCAT GAACCCGCAG CGGCGCTAAT GCGCCCTTTC
CGTTCACTCT CTGACTGGCG TATGGATGAT TTAATGATCT CCTTCTCCGT GCCTGGCGGT
GGTGTTGGGC CTCATTTTGA TCAATATGAT GTTTTTATTA TTCAGGGTTC AGGTCGTCGC
CGCTGGCGGG TGGGCGAAAA AACTGAAATG AAACAACATT GCCCGCACCC AGATTTGCTC
CAAGTGGGGC CTTTCGACGC TATCATTGAT GAAGAAATGG AGCCAGGTGA TATTCTTTAT
ATTCCACCGG GCTTCCCTCA TGAAGGCTAT TCCCTTGAAA ATGCGCTGAA TTATTCCGTT
GGTTTCCGCG CCCCAAGTGG TCGAGAACTG GTCAGTGGTT TTGCGGATTA TGTATTGGCT
CGAGAACTGG GTAGCTATCG TTATAGCGAT CCAGATTTAC AGCTACGCGA GCATCCAGCC
GAAGTATTAC CGTCCGAAGT TGATAAATTG CGCACAATGA TGCTGGATCT GGTCCAGCAA
CCTGAACATT TCCAAAACTG GTTTGGTGAA TTTATTTCCC AATCACGCCA TGAGTTGGAT
ATTGCACCGC CGGAGCCGCC TTATCAGACC GGCGATATCT ATGAACTATT GAAGCAAGGC
GATGAATTAC AACGCCTTAG TGGATTACGG GTTCTGCGGG TTGGTGATCG TTGCTTTGCT
AATGGTGAGT TGATTGATAC GCCACACTTA CAGGCCGCCA ATGCACTGTG CCAGCATTTT
AGCGTGAATG CAGAGATGTT GGGTGATGCA CTTGAAGACC CTTCTTTCCT GGCAATGCTT
GCAGCACTGG TCAATAGCGG TTATTGGTAT TTTAACGACT GA
 
Protein sequence
MDYQLDLDWP DFLQRYWQKR PVILKRGFKN FIDPLSPDEL AGLAMENEVD SRLVSHENGR 
WHVSHGPFES FDHLGENNWS LLVQAVDHWH EPAAALMRPF RSLSDWRMDD LMISFSVPGG
GVGPHFDQYD VFIIQGSGRR RWRVGEKTEM KQHCPHPDLL QVGPFDAIID EEMEPGDILY
IPPGFPHEGY SLENALNYSV GFRAPSGREL VSGFADYVLA RELGSYRYSD PDLQLREHPA
EVLPSEVDKL RTMMLDLVQQ PEHFQNWFGE FISQSRHELD IAPPEPPYQT GDIYELLKQG
DELQRLSGLR VLRVGDRCFA NGELIDTPHL QAANALCQHF SVNAEMLGDA LEDPSFLAML
AALVNSGYWY FND