Gene VEA_002077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_002077 
Symbol 
ID8555726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013456 
Strand
Start bp489903 
End bp491015 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content46% 
IMG OID646405090 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_003284705 
Protein GI262392851 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTG TTGAGCAATT TAAGCAGCTT AATTGGGATG ACATTTCGAT GTCGATCTAT 
GCGAAAACAG CACAAGATGT CGAGCGAGCA TTGAATAAAC CCAAGCGTGA TTTGGAAGAC
TTTAAAGCGC TAATTTCACC CGCGGCTGAA GCATACTTAG AGCAAATGGC GCAGTTGTCA
TACTCGGCAA CTCGCAAGCG GTTTGGCAAT ACCATGTCGC TTTACATTCC ACTGTACCTT
TCTAACTTGT GCGCCAATGC TTGTACTTAT TGTGGCTTCT CGATGGAGAA CAGAATCAAG
CGTCGTACCT TGAATAGGGA CGAAGTGGCT GCAGAAGTTG AAGCCATTAA ACGCATGAAG
TTCGATAGCG TATTGCTGGT GACTGGCGAA CACGAAACCA AAGTGGGCAT GAAATACTTC
CGCGAAATGG TGCCTATGAT TAAGCAACGC TTTAATTATT TGGCGATGGA AGTGCAGCCG
CTAGATCAAG ACGAATACGC AGAGCTCAAG ACATTGGGTT TGGATGCGGT TATGGTTTAT
CAAGAAACGT ATCATCCTTC GACTTATGCT GAGCACCATT TACGTGGCAA TAAGATGGAT
TTCGAATACC GATTGGATAC ACCCGATCGT CTTGCAAAAG CGGGCATCGA TAAGATCGGT
ATTGGCGCTT TGATAGGATT GGAAGAGTGG CGTACAGATT GTTTTTATGT GGCAGCGCAC
TTGGACTATC TTGAGCGCAC GTATTGGCAG ACTCGTTACT CAATTTCTTT CCCGCGTTTG
CGTCCTTGTG AGGGCGCGCT CCAACCAAAA TCAGTCATGA CGGATAAGCA ACTTGTTCAG
TTGATTTGCG CTTATCGTTT GTTGAATCCA GAAGTGGAGT TGTCGTTATC GACGCGTGAG
TCACCGAAGT TTAGAGACAA CGCATTGCCG TTAGGCATCA CCAGTATGTC TGCTGCATCG
AAAACTCAGC CGGGTGGTTA TGCGATGGAT GATGTTGAAC TCGAGCAGTT TGAGATCAGT
GATGAGCGAA GTGCTGGTTC TGTGGAAGAT ATGATTCGAG CCAAAGGCTT TGACCCAGTA
TGGCGAGACT GGCACTCGGC GTATTCTGGT TAA
 
Protein sequence
MSFVEQFKQL NWDDISMSIY AKTAQDVERA LNKPKRDLED FKALISPAAE AYLEQMAQLS 
YSATRKRFGN TMSLYIPLYL SNLCANACTY CGFSMENRIK RRTLNRDEVA AEVEAIKRMK
FDSVLLVTGE HETKVGMKYF REMVPMIKQR FNYLAMEVQP LDQDEYAELK TLGLDAVMVY
QETYHPSTYA EHHLRGNKMD FEYRLDTPDR LAKAGIDKIG IGALIGLEEW RTDCFYVAAH
LDYLERTYWQ TRYSISFPRL RPCEGALQPK SVMTDKQLVQ LICAYRLLNP EVELSLSTRE
SPKFRDNALP LGITSMSAAS KTQPGGYAMD DVELEQFEIS DERSAGSVED MIRAKGFDPV
WRDWHSAYSG