Gene YpsIP31758_3427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3427 
Symbol 
ID5387172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3858683 
End bp3859786 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content50% 
IMG OID640866440 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001402382 
Protein GI153948573 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCGT CCGACACCCC TATTATGCCT CCCGACAGTC CACGTCGGAT TATTGGCCGG 
CACTTTACAC AGCGTCAGTT GCAGCAGTTA ACGCTGTCTG AGGTTTATTT TATTCAATGC
ACCTTCACGG ATATTTCGTT TGCCGAGATT GCGGTACGCA ATATTCATTT TGAAAGTTGT
CATTTTACTC ACCTGCGGCT GGACGGTGGC AGAGTGGAAA ACTGCAGTTT CCGTTTCTGT
AAGTTCAATG ACGTTTCAGC GCAAGGCGTT TCCATACTCA ATACCTCATG TATAGAGGTG
CAATGGCAGG GATGCTATCT CACGGGGTGC TTAATTGAGC GCTGGTTGCT CACCAGTTGT
CAGCTCGATG ATGCCCAATT GCAGGATATC CAGCTTAATT ATTGGACCGT GCAAGACACC
CCGATGACTC ATCTGACGAT GAGTGACAGC AAAATGCAAG ATTCCAGTTG GCATGGCTGC
ACTATCCAGC AGTCAGTGTG GAAAAACAGC GAACTCATAC GTCAGGTGAT GGGCAGTTGT
GTACTGAAAG AGTGCCAATA TCAAGCGATT CAAAGCGATA CGGTGGTATG GAGTCAGTGC
CAGCTTGAGC AGGTTGATTT TCGGCACCAG CCGCTGGAAA ACAGTAATTT CCACAAAAGC
ACACTAACAC AGTGCGGGTT CAGTGACGCT AATTTGGCCG CCGCGCTGTT CAGTGAAGCG
ACACTCGAGG GCTGTGACTT CAACATGGCG CAATTAGCGG CGGCGCAGTT TGTTGATGCC
ACATTGCGCG ATTGCAATTT TGACACCGCC GATTTGCAAA ACGCCTCTTT GCTACGGGCC
AATCTGACGC GATGTCACTT AACGCAGGTC AATTTAACCA AAGCTGATTT GCGCAGTTGC
ATGTTGAGCG AGTCGTCATT ACAGGCCAGT AAACTTAGCA AAACCCGTAT CCACGGCGCG
CAAATACCCA CGCTGGACAC ACCGCTACAA ATGCCGGATC CCTTGCTGAG CCAGATTGAT
AACTGGTATG GCCATCATCA ACCCGGTCCA AAAAATAACC CTAAATTCCC TTCGATACCT
TCAGGAGCCA GCCGCTATGT CTAG
 
Protein sequence
MTPSDTPIMP PDSPRRIIGR HFTQRQLQQL TLSEVYFIQC TFTDISFAEI AVRNIHFESC 
HFTHLRLDGG RVENCSFRFC KFNDVSAQGV SILNTSCIEV QWQGCYLTGC LIERWLLTSC
QLDDAQLQDI QLNYWTVQDT PMTHLTMSDS KMQDSSWHGC TIQQSVWKNS ELIRQVMGSC
VLKECQYQAI QSDTVVWSQC QLEQVDFRHQ PLENSNFHKS TLTQCGFSDA NLAAALFSEA
TLEGCDFNMA QLAAAQFVDA TLRDCNFDTA DLQNASLLRA NLTRCHLTQV NLTKADLRSC
MLSESSLQAS KLSKTRIHGA QIPTLDTPLQ MPDPLLSQID NWYGHHQPGP KNNPKFPSIP
SGASRYV