Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3658 |
Symbol | |
ID | 5387122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 4123711 |
End bp | 4125225 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640866678 |
Product | hypothetical protein |
Protein accession | YP_001402612 |
Protein GI | 153947920 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000000233957 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACC ATAGTAAGAA ACAAACCAGA GTCAGTTTAC CACACTCTGT TTTTTCTGCG GACTGGGTAC GTCAGAATGA AGCAGCGGCG GCGGCCCACT TCTCACTGAC CCTGTTTGAT CTGATGGTTC GAGCCGGAAA CGCTGCCTTT GAACTGGCGC ATAAACAATA TCCTATAGCA CGTCATTGGT TGATTCTGTG CGGGCATGGC AACAATGGCG GGGATGGCTA TATCGTTGCC AACCGCGCGT TGGCTGCGGG TATCGATGTG ACTCTTATCG CCTGTCCAGG TAACCGCCCT TTACCTGCGG AGGCGAAAGA AGCGCAATCT CAATGGCTGG CTATCGGTGG TGTTATTCAC CAACCTGATA CCCAGTGGCC ACAAGATATT GATTTAATTA TTGATGGTTT GTTGGGAATT GGCCTACGGG CTGCACCACA AGGCATCTAT GAGACTTTGA TTGACATGGC TAACCGCCAT CGGGCGGCAA AAGTTGCCTT GGATATTCCC TCCGGGCTGT GTGCGGATAA TGGTGCGGCC CTCGGTTCGG TGCTCCGAGC TGACCACACG TTGACTTTTG TTGCATTGAA GCCCGGTTTA CTCACCGGTC AGGCCCGTGA TTGGGTTGGG CAGTTACATT ACGATGATCT AGGGTTGGCA ACGTGGTTGG ATACTCAGTC AGCACAAATT GAACGAATAA CCGCCGATCA TCTTCCTCAG TGGCTGAAAC CTCGCCGCCC GTGTGCCCAT AAAGGGGATC ATGGGCGTCT GCTGTTGGTG GGCGGTGATA GAGGGTTCGG TGGTGCTATC CGTATGGCGG GGGAAGCCGC CCTACGAAGC GGCGCTGGCT TAGTGCGAGT ACTTACTCAC TTTGAGCATG TGGCACCGAT TCTGGCTGCT CGCCCAGAGT TGATGGTGCA GGCGCTAACC GCCGAAACCC TGGAGCAAAG CATGCAATGG GCTGATGTAC TGGTCGTCGG GCCGGGGTTA GGTCAGTCTG ATTGGAGCAG GAATGCTCTG AAACGACTGC AACAGAGTGA TAAACCCACC TTGTGGGATG CGGATGCACT TAACTTACTA GCATTAAATC CTCATAGGCG TCAGAATTGG GTATTGACTC CTCATCCAGG AGAAGCTGCT CGTCTCCTAG GTTGCCGCGT CGTTGACATT GAAAGTGACC GCTTACTTTC GGCCCGAAAC ATCGTCAAGC AATATGGTGG TGTGGTGGTT TTGAAGGGCG CGGGTACCCT GATCGCCAAT GAGCAAGGTG AGATGGCCAT TGCCGATGTG GGTAATGCTG GCATGGCCTC CGGCGGGATG GGCGATATTC TTTCTGGTAT TATTGGGGGT TTGATAGCAC AAAAGCTGGC GCTGTATGAT GCCGCCTGCG CGGGGTGTGT CGTGCATGGC GTTGCTGCAG ATAAGCTGGC TGAAGTTCAA GGCACCCGGG GTTTACTGGC CACAGATTTA CTGCCTGTTA TTCCGAAGTA CATTAATCCT GAGTTAGCAA AATAG
|
Protein sequence | MTDHSKKQTR VSLPHSVFSA DWVRQNEAAA AAHFSLTLFD LMVRAGNAAF ELAHKQYPIA RHWLILCGHG NNGGDGYIVA NRALAAGIDV TLIACPGNRP LPAEAKEAQS QWLAIGGVIH QPDTQWPQDI DLIIDGLLGI GLRAAPQGIY ETLIDMANRH RAAKVALDIP SGLCADNGAA LGSVLRADHT LTFVALKPGL LTGQARDWVG QLHYDDLGLA TWLDTQSAQI ERITADHLPQ WLKPRRPCAH KGDHGRLLLV GGDRGFGGAI RMAGEAALRS GAGLVRVLTH FEHVAPILAA RPELMVQALT AETLEQSMQW ADVLVVGPGL GQSDWSRNAL KRLQQSDKPT LWDADALNLL ALNPHRRQNW VLTPHPGEAA RLLGCRVVDI ESDRLLSARN IVKQYGGVVV LKGAGTLIAN EQGEMAIADV GNAGMASGGM GDILSGIIGG LIAQKLALYD AACAGCVVHG VAADKLAEVQ GTRGLLATDL LPVIPKYINP ELAK
|
| |