Gene YpsIP31758_3428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3428 
Symbol 
ID5385947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3859812 
End bp3862031 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content50% 
IMG OID640866441 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001402383 
Protein GI153950018 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCA TTCGCCCGCA ACAGCTCGTG GTCTTGAAAA GCAGTTACCA GATAGGCCAT 
GAAAGCCATA TGGGGATCAG CGTCGTTGCT GGCTGTTATC TCTCTAAACC TGAGCATATG
GTGACGGAGT CACAGATCTG GCAGGCATGG AAAGCGGCAC CGCTCTCTTT CCGCATGTTG
GACAGTGCTG AGCCAAAACC CTTTGCTGAG TTTTTGCTGG CCGGTCATGC CGGTATTGGT
GAAGAGGTTA CCTCGCTGAG TGCAGAGGTT AGTGTTGGCT CACTGACCCG GCGCTGGTGT
ATTGAGGGCG AAAGCAACAA AACGGGTCTG GTCATTAAAC CTTTCTTACG TATGTCGATG
GACCATACGC AAAGTTGGGG GGGCAAAGGC TGTAAGGAGA ATCCACTGGG ACGTGGGTAT
AACGATGAGC GCAAACCGAC GATTATGTCT TTAGGCCTTG ACGGCTCTGC TATCGTTCGC
TCACCGCTGG CGTCTCCGTC ACCTGTGCCT CATGACTTCC AACTGCGTAA AGTGCATATC
AATGAAGTCG CCTCGACCAT GACCGATCCT CAATATCTGG AAACATTTTA TCCCGGTTTG
CCACCGCAAA TTGATCGTCG CTATTTCCAA ATGGCTCCGC CAGGGCAGTG GCTGAAGAAA
AGTGCATGGC CTGATAGCGT ACCGTTCAAA CTCATCGGTT TTCGCCCGGA CAATGAGGAG
ATCAGCGGTG CGTTTCCTGC GGTCAGTGCA AGGGCGTTTG TTTGGGATAA CCCTTCAGCG
CCCCCCAGTG AGGTGACCTT ACTGCGGAAA ACACTGTGGC TGTTACCGGA TAACGATATG
GGGCTAATGG TGTTTACCGG CAGCGTGCCA CTGACTCACC TTTTTGATGA GCCTATCGAT
ACGTTGCTGG TGGGATTGGA TGACTCCCAT TCGCTACGTG AGTTGGAATA TTACCAACAG
GTCTATAAAA GTCGCAGCGT TGAAGGTGCT GCGAGTTTTG AATTCCTCAA AGATCCGGAA
CTGATGCCAG AGGGGATGCC GCTGAACGTC ATCCGGGATT TGGCGGATCA CCCAGACTCG
CTGCGTTATA GCGCTTCCGC CATGTCTGAA GCGGAGTCCG AACGTTTCTA TCAGGATGTT
CAGGATGCTA TCGATCGGCA GGAACAGCAG AAGAGTGAAG AACAAGAGAC GCTGGGTGAT
TTGAATGTCC CCGCAGCCGG TAAAGAGGAA GCGGGAACCC AATGGTTGGA AAGCAAAGAA
GATACGGCAA CCAACGTCAC ATTTTTAGGG ACTGACTTCT CTGGAATGAC CTTGGACAAC
AAGCAATTCC GCTATTGCAT GTTTACCGGT TGCCATTTTG ACAAGGCGAC ATTTAAAGAC
TGCACCTTCG AGCATTGCCA GTTTACGCAA AGTGATTTTG AAAACTCCCG TTGGAACAAT
GTGCATTTAA GTGGCTGTTT ATTCAAACAG GCAGAGTGGC AAAAAGCCGC CTTTACCCAC
TGTAAATGGG AGAAATCCAC CTTTGAGTAT GGGGTGTTTA AACACGCTCA GTTTACCGAC
AATGCGTTAG ATAACTGCCT GATTAACCAT AGTGATTTCA GCCTTGGCAC GTTTGATCAT
TGTACGCTGA ATGGCTGTTT CTTCTCCGAA ACACATTGTG ATCAAACACA ATTTAATCAG
GTCATCATCA CGTCGTGCAT ATTCGAAAAA TGCGACGGCC CGAAGGCTTG CTTTACCGAA
AGCACGATAG AGAAAACCTC GTTTATTAGC AGCAGTTGGG TGGGGGGGCG CTTGAGTCAT
TGCTATCTCA ATAGCTTGAC CACGGGCCTG AATACCAATC TCTCTGAGTC GCATTTTGAG
CAGTGCAGCC TGAATAAAAT GGGCTTCCTC AAGGTCAATT TACAATCCAG TACCTTTATT
AATTGCTCGA TGTTGGAGAG TTGCTGCGAT AAGGCTGATT TCTCTCAGGC GACGCTGATT
GCCTGTGATA TGACCGCGGT ACGGTTAAAA GATGCCAACT TAGTCCATAG CCACTGGCAG
AACACCAGCT TACAGCAAAG CATGTTTTAC AACGCTGACT TACGTGATGC CACTTTCCAG
CGTTGCAATC TGGCGGGCGC TAATCTGGCG ATGATCAGCC AAAACATGGA CACCCGATTT
GAACATTGTT TGACGGAAAA GACGCACTGG ATCCCGCGTC GTTACACCGT CCCGGCATAA
 
Protein sequence
MRIIRPQQLV VLKSSYQIGH ESHMGISVVA GCYLSKPEHM VTESQIWQAW KAAPLSFRML 
DSAEPKPFAE FLLAGHAGIG EEVTSLSAEV SVGSLTRRWC IEGESNKTGL VIKPFLRMSM
DHTQSWGGKG CKENPLGRGY NDERKPTIMS LGLDGSAIVR SPLASPSPVP HDFQLRKVHI
NEVASTMTDP QYLETFYPGL PPQIDRRYFQ MAPPGQWLKK SAWPDSVPFK LIGFRPDNEE
ISGAFPAVSA RAFVWDNPSA PPSEVTLLRK TLWLLPDNDM GLMVFTGSVP LTHLFDEPID
TLLVGLDDSH SLRELEYYQQ VYKSRSVEGA ASFEFLKDPE LMPEGMPLNV IRDLADHPDS
LRYSASAMSE AESERFYQDV QDAIDRQEQQ KSEEQETLGD LNVPAAGKEE AGTQWLESKE
DTATNVTFLG TDFSGMTLDN KQFRYCMFTG CHFDKATFKD CTFEHCQFTQ SDFENSRWNN
VHLSGCLFKQ AEWQKAAFTH CKWEKSTFEY GVFKHAQFTD NALDNCLINH SDFSLGTFDH
CTLNGCFFSE THCDQTQFNQ VIITSCIFEK CDGPKACFTE STIEKTSFIS SSWVGGRLSH
CYLNSLTTGL NTNLSESHFE QCSLNKMGFL KVNLQSSTFI NCSMLESCCD KADFSQATLI
ACDMTAVRLK DANLVHSHWQ NTSLQQSMFY NADLRDATFQ RCNLAGANLA MISQNMDTRF
EHCLTEKTHW IPRRYTVPA