Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VEA_002077 |
Symbol | |
ID | 8555726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio sp. Ex25 |
Kingdom | Bacteria |
Replicon accession | NC_013456 |
Strand | + |
Start bp | 489903 |
End bp | 491015 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 646405090 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003284705 |
Protein GI | 262392851 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTG TTGAGCAATT TAAGCAGCTT AATTGGGATG ACATTTCGAT GTCGATCTAT GCGAAAACAG CACAAGATGT CGAGCGAGCA TTGAATAAAC CCAAGCGTGA TTTGGAAGAC TTTAAAGCGC TAATTTCACC CGCGGCTGAA GCATACTTAG AGCAAATGGC GCAGTTGTCA TACTCGGCAA CTCGCAAGCG GTTTGGCAAT ACCATGTCGC TTTACATTCC ACTGTACCTT TCTAACTTGT GCGCCAATGC TTGTACTTAT TGTGGCTTCT CGATGGAGAA CAGAATCAAG CGTCGTACCT TGAATAGGGA CGAAGTGGCT GCAGAAGTTG AAGCCATTAA ACGCATGAAG TTCGATAGCG TATTGCTGGT GACTGGCGAA CACGAAACCA AAGTGGGCAT GAAATACTTC CGCGAAATGG TGCCTATGAT TAAGCAACGC TTTAATTATT TGGCGATGGA AGTGCAGCCG CTAGATCAAG ACGAATACGC AGAGCTCAAG ACATTGGGTT TGGATGCGGT TATGGTTTAT CAAGAAACGT ATCATCCTTC GACTTATGCT GAGCACCATT TACGTGGCAA TAAGATGGAT TTCGAATACC GATTGGATAC ACCCGATCGT CTTGCAAAAG CGGGCATCGA TAAGATCGGT ATTGGCGCTT TGATAGGATT GGAAGAGTGG CGTACAGATT GTTTTTATGT GGCAGCGCAC TTGGACTATC TTGAGCGCAC GTATTGGCAG ACTCGTTACT CAATTTCTTT CCCGCGTTTG CGTCCTTGTG AGGGCGCGCT CCAACCAAAA TCAGTCATGA CGGATAAGCA ACTTGTTCAG TTGATTTGCG CTTATCGTTT GTTGAATCCA GAAGTGGAGT TGTCGTTATC GACGCGTGAG TCACCGAAGT TTAGAGACAA CGCATTGCCG TTAGGCATCA CCAGTATGTC TGCTGCATCG AAAACTCAGC CGGGTGGTTA TGCGATGGAT GATGTTGAAC TCGAGCAGTT TGAGATCAGT GATGAGCGAA GTGCTGGTTC TGTGGAAGAT ATGATTCGAG CCAAAGGCTT TGACCCAGTA TGGCGAGACT GGCACTCGGC GTATTCTGGT TAA
|
Protein sequence | MSFVEQFKQL NWDDISMSIY AKTAQDVERA LNKPKRDLED FKALISPAAE AYLEQMAQLS YSATRKRFGN TMSLYIPLYL SNLCANACTY CGFSMENRIK RRTLNRDEVA AEVEAIKRMK FDSVLLVTGE HETKVGMKYF REMVPMIKQR FNYLAMEVQP LDQDEYAELK TLGLDAVMVY QETYHPSTYA EHHLRGNKMD FEYRLDTPDR LAKAGIDKIG IGALIGLEEW RTDCFYVAAH LDYLERTYWQ TRYSISFPRL RPCEGALQPK SVMTDKQLVQ LICAYRLLNP EVELSLSTRE SPKFRDNALP LGITSMSAAS KTQPGGYAMD DVELEQFEIS DERSAGSVED MIRAKGFDPV WRDWHSAYSG
|
| |