Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2451 |
Symbol | thiE |
ID | 5136648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2606027 |
End bp | 2607349 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640533903 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001218351 |
Protein GI | 147674784 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACGTT TGGTTTTTCC ACGTCACCTA TCAGCCTTAA TCGGTCACGT GCAATACGCA CTGTTGCAAG CCAAGGAACA AGGGGTCGCT ATTCAGCATA TCCGTTTGGA TGTGGGCTCT GAGGCTCAGT TTATTTTAGA GAAGAGTGAA GAGTCTTTGC GAATTGGCAG TAGTTTGTGC TCTCAAAAGG AGGGTTTTGA GCCTTGCGAC TACTACCTAG ACTATGTGTC TGAAAACCGA GTGTTACCCG AGGCAATGAT GTGTAACGCT CGCTGCACGG TGACTGTAGG TCTTCATGAT GAATATGGTT TTACGCTGGA TAAGTGGCAG TATGGGCATG CTGCAGAGCA ACTTATTGTT TATCCATCAG AGAACCATCG ATTAAATAGT AAGGTAAATC AGCATCTTGC TTGGGTTTTA GCTACGTTGA CCTTAGATTT TTCGATTGGA GATGGTTTGT GCATTGCAAG AGCGGCAATC ACTCAAGGGG ATAGCGTTTC ACGTGAAACA TGGCCAACGC AATTCGAGCG TTTCCCTGCA GTGCAATCTA ACATTCGCTC TCTATCTACC CAAGTATTTC TAACCACCAG AGCATTTCCA ACGATTGATA AAGCAAAATT TAATCTCTAT CCCGTAGTGG ATGATGTGAA TTGGATTGAG CATCTTCTGA AGCTCGGTGT TAGAACGGTT CAGCTGCGAA TTAAAGATCC TAAGCAGGGT GATTTGGAAG CACAGATTAT CCGAGCCATC GCGTTAGGTC GTGAATTCAA CGCACAGGTT TTTATAAATG ATCATTGGCA ATTAGCCATT AAGCATCAGG CTTATGGTGT GCACCTTGGT CAAGAGGATT TGACGAGTGC GAATTTAACC GAGCTATTAG ATGCAGGTAT TCGATTGGGG CTTTCGACGC ATGGTTATTA TGAGTTACTG ATAGCCGCAG GAATACAGCC TAGCTACATC GCCCTTGGGC ACATTTTTCC CACGACCACT AAACAGATGC CATCAAAACC ACAAGGGCTA GTGCGATTGG CCGCCTATCA GCGGTTGGTT AATCAAATGC CTCACCAAGG ACAACATGGC ATTCCAACGG TGGCGATTGG TGGTATTGAT TGCAGAAACA TTCGCGATGT TCTCGATTGT GGCGTCACGG CGGTTGCAGT GGTGCGGGCT ATTACCGAGT CACCGGATCC TAGCCTTGCA GTACAAGCGT TGAGTTCTGC TTTTGCCGAT TTTGTAGATG CGGAGTACAA GTTGATGCCA GCCAGCGAGT CATGCGAGCC ACTCAGTTAC TTGGCTATGG AGGTAGCGGA TGCTCACAGA TAA
|
Protein sequence | MVRLVFPRHL SALIGHVQYA LLQAKEQGVA IQHIRLDVGS EAQFILEKSE ESLRIGSSLC SQKEGFEPCD YYLDYVSENR VLPEAMMCNA RCTVTVGLHD EYGFTLDKWQ YGHAAEQLIV YPSENHRLNS KVNQHLAWVL ATLTLDFSIG DGLCIARAAI TQGDSVSRET WPTQFERFPA VQSNIRSLST QVFLTTRAFP TIDKAKFNLY PVVDDVNWIE HLLKLGVRTV QLRIKDPKQG DLEAQIIRAI ALGREFNAQV FINDHWQLAI KHQAYGVHLG QEDLTSANLT ELLDAGIRLG LSTHGYYELL IAAGIQPSYI ALGHIFPTTT KQMPSKPQGL VRLAAYQRLV NQMPHQGQHG IPTVAIGGID CRNIRDVLDC GVTAVAVVRA ITESPDPSLA VQALSSAFAD FVDAEYKLMP ASESCEPLSY LAMEVADAHR
|
| |