Gene YpsIP31758_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3658 
Symbol 
ID5387122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4123711 
End bp4125225 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content53% 
IMG OID640866678 
Producthypothetical protein 
Protein accessionYP_001402612 
Protein GI153947920 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000233957 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACC ATAGTAAGAA ACAAACCAGA GTCAGTTTAC CACACTCTGT TTTTTCTGCG 
GACTGGGTAC GTCAGAATGA AGCAGCGGCG GCGGCCCACT TCTCACTGAC CCTGTTTGAT
CTGATGGTTC GAGCCGGAAA CGCTGCCTTT GAACTGGCGC ATAAACAATA TCCTATAGCA
CGTCATTGGT TGATTCTGTG CGGGCATGGC AACAATGGCG GGGATGGCTA TATCGTTGCC
AACCGCGCGT TGGCTGCGGG TATCGATGTG ACTCTTATCG CCTGTCCAGG TAACCGCCCT
TTACCTGCGG AGGCGAAAGA AGCGCAATCT CAATGGCTGG CTATCGGTGG TGTTATTCAC
CAACCTGATA CCCAGTGGCC ACAAGATATT GATTTAATTA TTGATGGTTT GTTGGGAATT
GGCCTACGGG CTGCACCACA AGGCATCTAT GAGACTTTGA TTGACATGGC TAACCGCCAT
CGGGCGGCAA AAGTTGCCTT GGATATTCCC TCCGGGCTGT GTGCGGATAA TGGTGCGGCC
CTCGGTTCGG TGCTCCGAGC TGACCACACG TTGACTTTTG TTGCATTGAA GCCCGGTTTA
CTCACCGGTC AGGCCCGTGA TTGGGTTGGG CAGTTACATT ACGATGATCT AGGGTTGGCA
ACGTGGTTGG ATACTCAGTC AGCACAAATT GAACGAATAA CCGCCGATCA TCTTCCTCAG
TGGCTGAAAC CTCGCCGCCC GTGTGCCCAT AAAGGGGATC ATGGGCGTCT GCTGTTGGTG
GGCGGTGATA GAGGGTTCGG TGGTGCTATC CGTATGGCGG GGGAAGCCGC CCTACGAAGC
GGCGCTGGCT TAGTGCGAGT ACTTACTCAC TTTGAGCATG TGGCACCGAT TCTGGCTGCT
CGCCCAGAGT TGATGGTGCA GGCGCTAACC GCCGAAACCC TGGAGCAAAG CATGCAATGG
GCTGATGTAC TGGTCGTCGG GCCGGGGTTA GGTCAGTCTG ATTGGAGCAG GAATGCTCTG
AAACGACTGC AACAGAGTGA TAAACCCACC TTGTGGGATG CGGATGCACT TAACTTACTA
GCATTAAATC CTCATAGGCG TCAGAATTGG GTATTGACTC CTCATCCAGG AGAAGCTGCT
CGTCTCCTAG GTTGCCGCGT CGTTGACATT GAAAGTGACC GCTTACTTTC GGCCCGAAAC
ATCGTCAAGC AATATGGTGG TGTGGTGGTT TTGAAGGGCG CGGGTACCCT GATCGCCAAT
GAGCAAGGTG AGATGGCCAT TGCCGATGTG GGTAATGCTG GCATGGCCTC CGGCGGGATG
GGCGATATTC TTTCTGGTAT TATTGGGGGT TTGATAGCAC AAAAGCTGGC GCTGTATGAT
GCCGCCTGCG CGGGGTGTGT CGTGCATGGC GTTGCTGCAG ATAAGCTGGC TGAAGTTCAA
GGCACCCGGG GTTTACTGGC CACAGATTTA CTGCCTGTTA TTCCGAAGTA CATTAATCCT
GAGTTAGCAA AATAG
 
Protein sequence
MTDHSKKQTR VSLPHSVFSA DWVRQNEAAA AAHFSLTLFD LMVRAGNAAF ELAHKQYPIA 
RHWLILCGHG NNGGDGYIVA NRALAAGIDV TLIACPGNRP LPAEAKEAQS QWLAIGGVIH
QPDTQWPQDI DLIIDGLLGI GLRAAPQGIY ETLIDMANRH RAAKVALDIP SGLCADNGAA
LGSVLRADHT LTFVALKPGL LTGQARDWVG QLHYDDLGLA TWLDTQSAQI ERITADHLPQ
WLKPRRPCAH KGDHGRLLLV GGDRGFGGAI RMAGEAALRS GAGLVRVLTH FEHVAPILAA
RPELMVQALT AETLEQSMQW ADVLVVGPGL GQSDWSRNAL KRLQQSDKPT LWDADALNLL
ALNPHRRQNW VLTPHPGEAA RLLGCRVVDI ESDRLLSARN IVKQYGGVVV LKGAGTLIAN
EQGEMAIADV GNAGMASGGM GDILSGIIGG LIAQKLALYD AACAGCVVHG VAADKLAEVQ
GTRGLLATDL LPVIPKYINP ELAK