Gene YPK_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1947 
Symbol 
ID6087505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2161677 
End bp2164010 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content47% 
IMG OID641597016 
Producthypothetical protein 
Protein accessionYP_001720689 
Protein GI170024184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.231119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATA TCCCAAGAGA AGCATGGGTT GCACTGAGTT TTGCTATTTT ATGCCTGTTT 
TTTGTTTGGA AAAATATCCG TTGGGCTAAG ATTGGTTCTG GACCGTCCAT CAATTCAGAC
GTACCGAAAG CCTCAGCGAT ACCCGATCTG GGCGGTTATC AGCTAGACTC TCTGGCCCTT
CATCCTGAAC TCGTCATTGA TGATGAGCTT ACCGATCACC TTGATCAATC TGCTAGTGCT
AAACCCTCTT CTGGTGATAA ACCCTCTGCG AGTACTAAAC AGTCAGAAAT TTCAAGAGGC
TCTCTGGCCC GTGCGTCCTG GCTTCGCCCC GGCGAATTAG CCGTGGTTGC CGGGATAACC
TTAGCCGATG GCATGATCTA CATTGGTAAT AAACATAAGA GCAATCGCAC CGGAGCAGAG
CCGGCCTTTA TCAATCCATC AATGCAGGTC GCCGAAGAAG AGGTAGACAT TTCATTGGCG
CTGATCGATA ACGCCCCCGA TTACTCGACA TGTTCCCCGG AGGCGAGAAA AGCGTATTTG
CAATGGTTAT CGGAAGGTCG TAAATCTCCA ATTGCTCATA TTAGTTATGT TTTCTTATTT
TTTTATGGAT TAGAGCACCG CGTACTGATT GATGCGAGTA TTGATCCAAT CGCAAAACGA
GAGCTGCCCA TTATTGAGGC GGAAATTAAC CGATTGCTGA ATATTTATGG AAAAAATAAC
ACTTTTAATC ATTATGCTCA GAATTTATTA GTTTATATCG CCAGAGTCGA TATAGAGGAG
ACGCTCTATT TATTGCCACC GCCATTAAGC CGAAATGCCT CAACCGAACT GCCACTTGCT
TTGTTGGTTG GCTTAGGGCA ATTGGCTGTG GACCAACGGC CATTACCTGC AGAGTGGGCC
TGTGCGTGGG GGGTTGCGGA TCCAGCCATT AACCAAAACA GATCCTTGAT GCATTGTCCT
GATGCCTTCT CCCAACTTTT CCAACAGCGC TATCGGGATA AATACGGCGA TGGCTTTATC
TTGCCCGTCA ATAAAACCAA ACTACATATT TTCTACCGTC CTGCTTCCGC AGGGTTAATG
GGGCAAGAGT TCATCCAGCA GATGGGTGAG TTACCCAACG TGGCAGTATG TAAAGCCAAT
AGAGATAAGC TGCGGTCATT AATTGAAGCA TGTCAGAACG ATCTCGAGGC TTATAACCGC
TTTGTGAGTT TGAATCCTAA GAAAAAAGCA GCATTAGAAG GGCTGTTACT TCTCCCGATG
CCGTTGTGGC CAGAGGCATT AAACGCTGAG CTGGAAAGTA TTAAAAGCCG AGTGGGCTAT
GGCTTACTGT TGATAACATT GGATGAACTT TTTGCCCAAC TCAGCCATGC CTGCGCATCA
AACCCACTAG AACGTTTATC CCGCAATGTG GTCATTGCGC TGACTCAGGC GTTGGCATCC
TTGCAGGTTG GTATAGAGCC TGATGTTCGG GCAGATAATC GTACACCAAC CTTGCAAGAT
ACAGTTGCTC TGTTTCCTAT TGAGTCCGCG ACGGCGGAGT CGACAACCTT TGCCGTCTAC
CGTATCACGA TGTTAGCCGT CGAGTTGGCT TGTGCCGCAG TGATGGTTGA TGGCCGAGTT
GGGGAACCCG AATCGATGAT TTTGACGCGG CATATTGATG CCTGGAGCTA CCTTAGCCCT
GGTCAACGTA TGCGTCTGAA AGCCTATCTT CAACCGGGGG TCAGGAAGAA TAATACGTTG
TTGGCCTTGA AAAATAACCT TGAGCCGCTA TCGCTTGAAC ATCGCCGTGC TATTGCCCGT
TTCCTGGCAC ACCTGATTCA GGTCGAAGGG ACCATTACCC CACAAGATGT GAAATTTCTT
GAGCGGGTAT ATAAGCGTTT TGCCCTAGAC AGCAAACTGG TTTATACCGA TCTTGATAGC
TGTGCGACCC CGATGACATT AAGTTTGCCG CCACCAGGTT CGGTAGCACT AAATAGTGTT
GATCGCATTA CGCTACTGAA AAATGAAAGT CAGGAATTGG TGGCGCTGTT GGAGAATTTA
TTTCTAACCA AAGTGGTGGC GACAGAGGAT AAAGAAGAGA TTGATCCACA ATCAATGGCG
GCAATAATGG CTGATATGCA GGAAGGAACC GTGACGCTCC TACGATTACT GATTTCACGC
AATGCATGGG AGCGTGATAA ATTAACCGAT ATCGCGCTGG ACATGGAAAT CAGACTGGAC
GATATACTGA TTGAGATTAA TCGCAAGATA TTGGCTGTTT TTGACGTACC GCTGATTACT
GGCGATGCGG TTATTTCAAT TAATCGTGAT GTCTCGGCTG CATTATTGGC CTGA
 
Protein sequence
MTDIPREAWV ALSFAILCLF FVWKNIRWAK IGSGPSINSD VPKASAIPDL GGYQLDSLAL 
HPELVIDDEL TDHLDQSASA KPSSGDKPSA STKQSEISRG SLARASWLRP GELAVVAGIT
LADGMIYIGN KHKSNRTGAE PAFINPSMQV AEEEVDISLA LIDNAPDYST CSPEARKAYL
QWLSEGRKSP IAHISYVFLF FYGLEHRVLI DASIDPIAKR ELPIIEAEIN RLLNIYGKNN
TFNHYAQNLL VYIARVDIEE TLYLLPPPLS RNASTELPLA LLVGLGQLAV DQRPLPAEWA
CAWGVADPAI NQNRSLMHCP DAFSQLFQQR YRDKYGDGFI LPVNKTKLHI FYRPASAGLM
GQEFIQQMGE LPNVAVCKAN RDKLRSLIEA CQNDLEAYNR FVSLNPKKKA ALEGLLLLPM
PLWPEALNAE LESIKSRVGY GLLLITLDEL FAQLSHACAS NPLERLSRNV VIALTQALAS
LQVGIEPDVR ADNRTPTLQD TVALFPIESA TAESTTFAVY RITMLAVELA CAAVMVDGRV
GEPESMILTR HIDAWSYLSP GQRMRLKAYL QPGVRKNNTL LALKNNLEPL SLEHRRAIAR
FLAHLIQVEG TITPQDVKFL ERVYKRFALD SKLVYTDLDS CATPMTLSLP PPGSVALNSV
DRITLLKNES QELVALLENL FLTKVVATED KEEIDPQSMA AIMADMQEGT VTLLRLLISR
NAWERDKLTD IALDMEIRLD DILIEINRKI LAVFDVPLIT GDAVISINRD VSAALLA