Gene YpsIP31758_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1228 
SymbolpstC1 
ID5386969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1448001 
End bp1450232 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content54% 
IMG OID640864206 
Productphosphate ABC transporter, permease protein PstC 
Protein accessionYP_001400209 
Protein GI153949838 
COG category[R] General function prediction only 
COG ID[COG4590] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCAAA CAAGCATAGC GTCACCATCA GGGAGAGATA AGCGCCGGGC TTTTATTGAT 
CGTTGGGTGC GCATGACAGT CACCAGTGGT GGTTTGCTGG TATTACTGAC ACTGATGCTG
ATATTTGTCT ACTTATTGTA TGCCGTATTG CCATTATTTA AACCTGTTTC TATCCGTTTG
GATAATCAGT TCGTGGTGAA ACGCAATGCG CCGGCACTCG CTCTGGGAAT GGATATTCAA
GGGAAAATAG CTTATCGCAT TGATAACCAA GGTAAAGGTG TGTTTGTTCG CCTTGGTGGG
CAAGGGAGTG CACCTGCTGG CAGTGTCATC AATGAACAGG CGTTATCCGC GCCGCCCGTC
TCATGGAGCC GTGCCATCGG CGGGCAGCCT TTATATGGTG CCGGTCTGAG TAATGGCCGC
TTTACTCTGC TACAACCCGA TTTTTCTGCC ACCCCGCCTG GCTGGCAATT TCCATTGGGC
GATCAACCGC GTGCGCTCGA TATACAAGGC AAGCGCTTGC TGCATTTGGC GCTGGCAGAG
CCGCAACCAC AGCAATTTTC CGTTGCTGCC ATTACCGATG ATAGCCGCTT GTTGGTGGGG
CAATTCACCC CGCAGGGGCA GCAGATAACC ACTCTTGGCG CGGTGCCCGC CACTGTCGAT
CAACTGTTGC TCGCGCCTGA TGGCCGCTTA CTCTATGTGC TTTCGGGCAA TCAAGTGCAT
ATTTACCAAT TGGTCAATAG GCCATTAGAG AGTGAAAAAC TCGCGAGTGA CGCACCAGGT
TCCGGCTGGC AGTTACGCGA AGTGGTGTCT CTGGTGGGTG AAAAGGAAGC GCTCAGTGGG
CCTTTAACGC TCTCCTTATT GGCTGGAGGG AAATCGCTGT TGGTGCAATC ACCGGATGGT
GTGGTGACCC AATGGTTTGA TGTGCGAAAA GGGCCAAGCC CACAGTTCCA TTTGACGCGG
ATCCGCAGTT TTACACCAGC AAGTCAGGGG GTTCTCACCG CAGAGAACAC CCGCCGGGTC
TTTGCTTTGC TCTCGCCACA CGGTGAGTTA TCCCTTTTTT CCAGCATCCA GTCACCGCCG
TTACTGCACT CTACGTTGGT CGAAGGTATC AGCCATGCCG ATTTTTCGCC GTGGGGCGAT
AGCCTGTTAG TGGAAAACGC CTCCGGGTGG TCAGTTTATC GGCTAGACAA CCGTTATCCG
GACATCACCT GGCGTGGCCT GTGGCAGCCG TTATGGTATG AAAACTACCC AGAGCCAGCC
TATGTATGGC AATCGAGTTC TGCCGAAGAG AGCTATCAGG CCAAATTTAG CCTGATACCG
ATTATCTTCG GTACCTTAAA GGCGGCCGTT TATGCCATGT TATTCGCGGT GCCGTTGGCA
TTGGCGGGGG CCATTTACAC TGCGTATTTT ATGTCTGCGG GGTTGCGGCG AGTCGTTAAA
CCGACCATTG AGATGATGGG AGCATTCCCT ACGGTGGTTA TCGGTTTAAT TGCAGGGATC
TGGCTGGCAC CGGTGATTGA ACGTTATCTG GCGGGGATCT TGTTATTGCC GCTGCTGCTA
GCCCTAGCGA TTTTATTGTG TGGTTGGGGC AGCGCTCGCC TGTCGGCCAA GACACAATGG
CCATTACCTG CGGGCTGGGA TGTGATGGTA CTGCTGCCGG TGATCTTACT GACCGGATGG
CTGGCCCTGT GGTTGGGGCC ATCATTGGCG GTGTGGGCGT TGGGCCAACC GTTGCATGAG
TGGCTGGGGG ATGATTACGA TCAGCGCAAT GCGCTGGTGG TGGGGGTGGC AATGGGGTTC
GCCTTAATCC CTATTATTTT TTCGCTGGCA GAAGATGCGC TGTTCAGTGT GCCCCCGTCA
TTAAGCCAAG GCTCGTTGGC TCTGGGGGCA ACCCCCTGGC AAACATTGCT CCACGTGGTG
CTGCCTTCAG CCTACGCCGG TATTTTCTCT GCGGTGATGA TAGGTTTTGG CCGTGCGGTG
GGTGAGACCA TGATTGTGCT GATGGCAACC GGTAACTCGC CCATTATTGA TGGCAGCATT
TTCCAAGGGT TACGGGCGAT GGCGGCCAAT ATAGCCATTG AAATGCCGGA AGCGGTGGTC
GGCAGTGGGC ACTACCGGGT GCTATTCCTG ACGGCATTGG TCTTATTCTG CTTTACGTTT
TTGGTGAATA CGCTGGCAGA AGCGGTTCGC CTGCGGCTGC GTGAACGCTA TCAAATGGAG
CGGACAGCGT GA
 
Protein sequence
MRQTSIASPS GRDKRRAFID RWVRMTVTSG GLLVLLTLML IFVYLLYAVL PLFKPVSIRL 
DNQFVVKRNA PALALGMDIQ GKIAYRIDNQ GKGVFVRLGG QGSAPAGSVI NEQALSAPPV
SWSRAIGGQP LYGAGLSNGR FTLLQPDFSA TPPGWQFPLG DQPRALDIQG KRLLHLALAE
PQPQQFSVAA ITDDSRLLVG QFTPQGQQIT TLGAVPATVD QLLLAPDGRL LYVLSGNQVH
IYQLVNRPLE SEKLASDAPG SGWQLREVVS LVGEKEALSG PLTLSLLAGG KSLLVQSPDG
VVTQWFDVRK GPSPQFHLTR IRSFTPASQG VLTAENTRRV FALLSPHGEL SLFSSIQSPP
LLHSTLVEGI SHADFSPWGD SLLVENASGW SVYRLDNRYP DITWRGLWQP LWYENYPEPA
YVWQSSSAEE SYQAKFSLIP IIFGTLKAAV YAMLFAVPLA LAGAIYTAYF MSAGLRRVVK
PTIEMMGAFP TVVIGLIAGI WLAPVIERYL AGILLLPLLL ALAILLCGWG SARLSAKTQW
PLPAGWDVMV LLPVILLTGW LALWLGPSLA VWALGQPLHE WLGDDYDQRN ALVVGVAMGF
ALIPIIFSLA EDALFSVPPS LSQGSLALGA TPWQTLLHVV LPSAYAGIFS AVMIGFGRAV
GETMIVLMAT GNSPIIDGSI FQGLRAMAAN IAIEMPEAVV GSGHYRVLFL TALVLFCFTF
LVNTLAEAVR LRLRERYQME RTA