Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4569 |
Symbol | thiE |
ID | 6875349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4410292 |
End bp | 4410927 |
Gene Length | 636 bp |
Protein Length | 211 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642787477 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_002218079 |
Protein GI | 198245244 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.00615742 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTATCAGC CTGATTTCCC GACCGTGCCG TTCCGCCTCG GGCTTTACCC GGTGGTGGAC AGCGTTGCAT GGATTGAGCG TCTACTGGAG GCGGGCGTGC GCACGATCCA GCTGCGTATC AAAGATAAAC GCGATGAAGA GGTGGAAGCG GATGTTATCG CTGCCATCGC GCTGGGGCGT CGTTATAACG CCCGTCTGTT TATCAACGAC TACTGGCGTC TGGCAATTAA GCACCGCGCT TACGGCGTGC ATCTCGGCCA GGAAGACCTT GAAACCACCG ACCTGAAGGC CATTCAGGCG GCGGGGTTAC GCCTGGGCGT ATCGACTCAC GATGATATGG AGATTGACGT CGCACTCGCC GCTAAGCCTT CTTATATCGC GCTCGGCCAC GTCTTCCCCA CGCAAACCAA GCAGATGCCT TCCGCCCCAC AGGGGCTGGC GCAGTTGGCC AGTCATATTG AACGACTGGC GGATTACCCG ACCGTCGCGA TCGGCGGCAT CAGCCTTGAA CGCGCCACAG CGGTACTGGC GACCGGCGTC GGCAGTATTG CCGTGGTCAG CGCCATTACT CAGGCCGCCG ACTGGCGCGC CGCTACCGCG CAGTTACTGG ATATTGCGGG AGTTGGCGAT GAATGA
|
Protein sequence | MYQPDFPTVP FRLGLYPVVD SVAWIERLLE AGVRTIQLRI KDKRDEEVEA DVIAAIALGR RYNARLFIND YWRLAIKHRA YGVHLGQEDL ETTDLKAIQA AGLRLGVSTH DDMEIDVALA AKPSYIALGH VFPTQTKQMP SAPQGLAQLA SHIERLADYP TVAIGGISLE RATAVLATGV GSIAVVSAIT QAADWRAATA QLLDIAGVGD E
|
| |