Gene YPK_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2666 
Symbol 
ID6087967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2933332 
End bp2935623 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content47% 
IMG OID641597735 
Producthypothetical protein 
Protein accessionYP_001721396 
Protein GI170024891 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTTA TCGCGTTATC AACAGATCGG GTCGCCGCTG CTGTCATTGT CGGGATCTTG 
CCTCTTATTT TTCTGCATCA ACTACCAGGG CCCACTATCA TTGGTTCTCT ATTAGCCCTA
AGTGCATTTT TGTGGTTAAG CCGCAATCGT TACTGCCAAT TTTTAGCGCT GATTGTGATT
AGCTTCCTGT GGGGGGTTTG GCATAGCAAT GGGATACTGA TGCAAACAGA AGCATTAACC
CAGGGGGATC AGCAGATAGT TGCGACAATA AACAGTTCAT CTCTTTCATG GGATGACGGC
CAGAAAGTTG TGATAAGTAT TCAGAAAATT AATGAAAAAC GGGTTTTCCC TCCAATAGCA
GTCAACGTTA AGTGGCCAGA ACGGTTGGAT CAATACTGTG CAGGGCAACG CTGGGCGTTT
AGGTTACGTA TGAGGGCAGT ACATAGCGTA CTGAATGAGG GCGGGTTTGA CAGTCAGCGC
TGGGCTATTG CTAACAGACG TCCACTACAA GGGCGCATCA TTGAGGCTAA ATTACTGGAT
GCAGAGTGTA ATTTTCGTCA GCAAATTATC AGTAGCATAG AGCAGCAACT GGTGGGATAT
GATCAGCGGC GGATCATGCT AGCACTGGCC TTTGGTGAAA GATCACAACT GAACAAAGAA
GAGTGGTCAC TCTTACGCTA CACCGGCACT GCGCACCTGA TGGCGATTTC CGGTTTACAT
ATTGCACTGG CGGCCTTATT TGGTGGGATG CTTGCCCGAT TAGTACAGCT CCTTTTTCCT
GTCAGTTGGA TTGGGCCTTT GCTACCGCTA CTGATTGGCT GGCTGATTGC TATGATTTAT
GTCTGGCTGG CAGGAGCAAA CTCACCAGCA ATCCGGGCGG CCATTGCGTT AACGCTGTGG
CTGCTGCTAC GTTTGTTCGG TATTTTATGT AGCCCATGGC AGGTGTGGAG GTGGGCTTTG
GGGCTAATTT TAGTCAGCGA CCCGCTAGCC GTATTATCAG ACAGCTTTTG GCTCTCCTGC
CTGGCGGTAT TTAGCCTGAT ATGTTGGTTT CACTTGGCCC CTGTTTCTTC TCGTTTCATT
ACTGGTTGGT ATGGCTTGGT TATCCGTTGG TTTCATTTGC AGTTTGGTAT GATGCTACTG
CTGATGCCGT TACAGATAGG GTTATTCCAC GGTATAAGTT TATTTTCTAT ACCCGCCAAT
CTGTGGGCGG TCCCAATAGT CTCATTGTTC ACCGTACCCT GCGTATTATT AGCATTGGCT
TTAGCATTGC TCCCTGCTGT TGCTGATATA TTTTGGTTCT TGGCTGATAT CTCGTTGACT
GTGGTGCTAT TCCCGCTCAA TCAGCTAAAA GAAGGCTGGT TACACACTGG CATGGCTTCT
GTTGCGATCG GTTATGGTGG TTGGCTGGCA TTATTTATCT GGCGCTTTCA ATGGTGGCGC
AGTCATCCTC TTGGTGTCAT TGTGCTGTGT ATGAACATGG TATTACTGAC TCAACGGCGT
GATGAGTATC ACTGGCGTGT GGATATGCTG GATATCGGGC ATGGTCTGGC TGTGGTGATT
GAGCGTGAGG GGAAAGCCAT TATCTATGAC ACTGGCAATC ATTGGCCTAC AGGTAATATG
GCCGCTATTG TCGTTTTACC GCTCCTTAAA TGGCGTGGCA TTACTGTTGA ACAGATTATC
CTTAGCCATG ACCATCAGGA CCATACTGGC GGTTTGGCTG TACTCTTGGA TGCTTTTCCG
CAGGCAACGG TACGTGCGCC TTTCTCTGTA AAAAACGTAG CCAATACTCT GCCTTGTAAA
CAGGGGGAGA GATGGCAGTG GCAAGGTTTA GATTTTGACG TGTTATGGCC GAAAGAACAG
GTGGTTAATG CTCAGAATAA TGACTCATGT GTCATTCGCA TTAATGATGG TAAACATAGT
GTATTGTTGA CCGGGGATCT TGAGTCCCAA GGAGAACGGC AGTTGGTGAA CGATATCCGG
GGAGAATTAA CATCAACGGT GCTGCAAGTG CCCCATCACG GCAGTAATAC TTCTTCAACC
GCGCCTTTTC TACGGGCAGT TAGCCCAGAA TTGGCCCTCG CTTCTGTTGC TCGTTATAAC
CAATGGCGAC TACCTGCGAA AAAAGTGATC AATCGCTATC AAAAAAATGG CATTATTTGG
CGTGATACAT CAGTATCAGG GCAATTAAGT GTATATTTTC ACTGCGATAC TTGGTTCGTT
AAAGGCTATC GGGAACAATT AAAGCCACGT TGGTATCACC AGCGGTTTGG CGTTAGAGGT
CATAATGAGT AG
 
Protein sequence
MVFIALSTDR VAAAVIVGIL PLIFLHQLPG PTIIGSLLAL SAFLWLSRNR YCQFLALIVI 
SFLWGVWHSN GILMQTEALT QGDQQIVATI NSSSLSWDDG QKVVISIQKI NEKRVFPPIA
VNVKWPERLD QYCAGQRWAF RLRMRAVHSV LNEGGFDSQR WAIANRRPLQ GRIIEAKLLD
AECNFRQQII SSIEQQLVGY DQRRIMLALA FGERSQLNKE EWSLLRYTGT AHLMAISGLH
IALAALFGGM LARLVQLLFP VSWIGPLLPL LIGWLIAMIY VWLAGANSPA IRAAIALTLW
LLLRLFGILC SPWQVWRWAL GLILVSDPLA VLSDSFWLSC LAVFSLICWF HLAPVSSRFI
TGWYGLVIRW FHLQFGMMLL LMPLQIGLFH GISLFSIPAN LWAVPIVSLF TVPCVLLALA
LALLPAVADI FWFLADISLT VVLFPLNQLK EGWLHTGMAS VAIGYGGWLA LFIWRFQWWR
SHPLGVIVLC MNMVLLTQRR DEYHWRVDML DIGHGLAVVI EREGKAIIYD TGNHWPTGNM
AAIVVLPLLK WRGITVEQII LSHDHQDHTG GLAVLLDAFP QATVRAPFSV KNVANTLPCK
QGERWQWQGL DFDVLWPKEQ VVNAQNNDSC VIRINDGKHS VLLTGDLESQ GERQLVNDIR
GELTSTVLQV PHHGSNTSST APFLRAVSPE LALASVARYN QWRLPAKKVI NRYQKNGIIW
RDTSVSGQLS VYFHCDTWFV KGYREQLKPR WYHQRFGVRG HNE