Gene YPK_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2045 
Symbol 
ID6089010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2276500 
End bp2277927 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content51% 
IMG OID641597112 
Productbifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase 
Protein accessionYP_001720785 
Protein GI170024280 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0135] Phosphoribosylanthranilate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA CTGGGGGTTA CAAAACCGAG GGTTATAACG TTGGCAGTGA CAAAGTTGAC 
AGTGATAAAA CCAAAACCGT GCTCCACCAA ATCGTACACG ATAAAGAAAT TTGGGTTGCC
GCGCGGAAAC TGCAACAGCC TCTAACCCGC TTCCAAAACG AAATCACCCA GAGTCAGCGC
GATTTTTATC ACGCGCTACA AGGCGATAAA ACGGTCTTTA TTTTGGAATG CAAAAAAGCC
TCACCTTCTA AAGGGGTTAT CCGTGACAAC TTTAACCCGG CGGAGATTGC CGGTGTTTAT
AAGCACTATG CGTCGGCTAT CTCAGTATTA ACGGATGAGA AATATTTCCA GGGCAGTTTT
GATTTCTTGC CACAAGTCAG TGCCGCCGTC ACTCAGCCGG TATTGTGTAA AGATTTTATT
ATTGATGCTT ATCAGATTCA GCTAGCGCGG TTTTACCACG CTGACGCCAT TTTACTGATG
CTGTCGGTCT TGGACGATGA GGCTTACCGC CAATTGGCCG CCGTCGCACA CAGCCTGAAC
ATGGGGGTGT TGACCGAAGC CAGTAACGCC GAAGAATTGG AGCGTGCTAT TACCTTGGGT
GCCAAAGTTG TTGGCATCAA TAACCGCGAC CTGCGTGACC TGTCTATCGA TCTGAATCGC
ACCCGTGAAT TGGCACCACG CCTACCAGAA GGTGTCACAA TAATCAGTGA ATCTGGCATT
AGTCATTATC GTCAGGTCCG TGAATTGAGC CAATTTGCCA ACGGTTTCCT GATTGGCAGT
GCCCTGATGT CCGAACCCGA TCTCAACGCG GCCGTCCGCC GGGTGTTACT GGGCGAAAAT
AAAGTTTGCG GCCTGACACG CGCACAAGAT GCCGCCACGG CTTACCACGC AGGTGCGGTG
TACGGCGGGT TGATTTTTGT CGACAGTTCA CCGCGGTATG TGGATATCGC CAGCGCCCGT
ACGGTTATCA GTGGTGCGCC GCTAAAGTAT GTCGGTGTTT TTCGTCATGC TGAAATAGAA
ACTGTACGGC AAACGGCTGA ACAACTCTCA CTGGCAGCAG TGCAATTGCA TGGGCATGAA
GATCAACAGT ATATCAATCA ACTGCGCAAA GTATTACCTG CGGGTTGCCA GATTTGGAAG
GCACTGAGTG TCGGTGACAC GATGCCGGAA CGCAACTTAC AGCAAGTTGA ACGCTACGTA
CTGGATCACG GTACGGGTGG CACAGGGCAA CGTTTCGACT GGTCATTATT GGCAGATCAG
GCACTGGATA ATGTCTTGCT GGCGGGCGGT TTGGGGCCAG AGAACTGTGA TGTGGCGGCC
CAACTAGGCT GTGCGGGTCT GGATGTCAAT TCCGGCGTAG AAAGCGCCCC TGGCATCAAA
GACCCCCAAC GGATCGCCGC TGTATTCCAG GCATTACGCG TGTACTGA
 
Protein sequence
MQETGGYKTE GYNVGSDKVD SDKTKTVLHQ IVHDKEIWVA ARKLQQPLTR FQNEITQSQR 
DFYHALQGDK TVFILECKKA SPSKGVIRDN FNPAEIAGVY KHYASAISVL TDEKYFQGSF
DFLPQVSAAV TQPVLCKDFI IDAYQIQLAR FYHADAILLM LSVLDDEAYR QLAAVAHSLN
MGVLTEASNA EELERAITLG AKVVGINNRD LRDLSIDLNR TRELAPRLPE GVTIISESGI
SHYRQVRELS QFANGFLIGS ALMSEPDLNA AVRRVLLGEN KVCGLTRAQD AATAYHAGAV
YGGLIFVDSS PRYVDIASAR TVISGAPLKY VGVFRHAEIE TVRQTAEQLS LAAVQLHGHE
DQQYINQLRK VLPAGCQIWK ALSVGDTMPE RNLQQVERYV LDHGTGGTGQ RFDWSLLADQ
ALDNVLLAGG LGPENCDVAA QLGCAGLDVN SGVESAPGIK DPQRIAAVFQ ALRVY