Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2448 |
Symbol | thiH |
ID | 5135994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2603179 |
End bp | 2604291 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640533900 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001218348 |
Protein GI | 147675280 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCA TTACCCACTT CCAACAATTA GGCTGGGATG ACAGTCGGCT ATCGATTTAT GGTAAGACAG CGCGGGATGT TGAACGCGCT TTATCTTCAC CTAAACGGAC GTTAGACGAT TTCAAAGCGC TGATTTCACC TTCAGCAGAG CCTTATCTGG AAACAATGGC GCAAATGGCT TATCAGGCCA CTCGTCAACG TTTTGGCAAT ACCATGTCGA TGTACATTCC GCTCTACCTC TCTAATCTCT GTTCGAATTC CTGTACCTAT TGCGGTTTTT CAATGGATAA CCGAATCAAG CGTAAAACAC TGAATGAAGT GGAGATTGAA CGAGAGATCG CGGCGATTAA ACGAATGGGT TTTGATAGCG TTTTGCTGGT GACCGGAGAG CATGAAACCA AAGTGGGTAT CGAATACTTC CGTCGGGCAC TGCCTATCAT CAAGCAAGCT TTTCATTATG TTGCGATGGA AGTTCAGCCG TTGCAGCAAG AAGAGTATGC CGAGTTAATT GGGCTCGGGT TGGATGCGGT GATGGTTTAC CAAGAAACGT ACCATCCTTC GACTTATGCG CAGCATCATT TGCGAGGCAA GAAAACCGAT TTTTGGTATC GACTCGAAAC ACCCGATAGG TTGGCTAGAG CGGGCATCGA TAAAATTGGT ATTGGCGCGC TGATTGGGCT TGAGGACTGG CGAACTGACA GCATTTTTGT TGCGGCGCAT CTGGATTATC TTGAGCGACA ATATTGGAAG ACGCGCTACT CAATTTCTTT TCCGCGTTTA CGCCCGTGTG AGGGCGCTTT ACAGCCGAAA TCAGTGATGA CGGATCGACA ACTTGTGCAG TTGATTTGTG CATTTCGTCT GTTCAATAGC GAGGTTGAGC TTTCTTTGTC TACCCGCGAG TCACCGATGT TTCGTGATCA AGTGGCTAAG CTCGGGATTA CCTCAATGTC TGCAGCGTCT AAAACTCAGC CGGGAGGGTA TTCAGACCCT AAGGTTGAGT TAGAGCAGTT TGCGGTAAGT GATGAGCGTT CTGCCGCAGA GGTGAGTAGC GCGTTAATGG AGCAAGGTTT ACAAGTGGTA TGGCATGACT GGCATCGAGC CTATTCAGGC TAA
|
Protein sequence | MSFITHFQQL GWDDSRLSIY GKTARDVERA LSSPKRTLDD FKALISPSAE PYLETMAQMA YQATRQRFGN TMSMYIPLYL SNLCSNSCTY CGFSMDNRIK RKTLNEVEIE REIAAIKRMG FDSVLLVTGE HETKVGIEYF RRALPIIKQA FHYVAMEVQP LQQEEYAELI GLGLDAVMVY QETYHPSTYA QHHLRGKKTD FWYRLETPDR LARAGIDKIG IGALIGLEDW RTDSIFVAAH LDYLERQYWK TRYSISFPRL RPCEGALQPK SVMTDRQLVQ LICAFRLFNS EVELSLSTRE SPMFRDQVAK LGITSMSAAS KTQPGGYSDP KVELEQFAVS DERSAAEVSS ALMEQGLQVV WHDWHRAYSG
|
| |