Gene VIBHAR_00361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_00361 
SymbolthiH 
ID5556205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp348244 
End bp349371 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content46% 
IMG OID640905855 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001443609 
Protein GI156972702 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCG TTGAACAATT TAAGCAGCTT AATTGGGATG AAATCTCGAT GTCGATCTAC 
GCGAAAACGG CACAAGATGT TGAGCGAGCA TTGAATAAAC CCAAGCGTGA TTTGGAAGAC
TTTAAAGCGC TAATTTCACC AGCGGCTGAG GCGTACTTAG AGCAAATGGC GCAGTTGTCA
TACTCGGCAA CTCGCAAGCG TTTTGGTAAT ACCATGTCGC TTTATATTCC ATTGTACCTT
TCTAATTTGT GCGCCAATGC TTGTACTTAT TGTGGCTTCT CGATGGAGAA CAGAATCAAG
CGTCGTACCT TGAATAGGCA CGAAGTGGCT GCAGAAGTTG AAGCTATTAA ATGCATGAAG
TTCGATAGCG TATTGCTGGT GACTGGCGAA CACGAAACCA AAGTGGGCAT GAAATACTTC
CGCGAAATGG TGCTGATGAT TAAGCAACGC TTTAACTATT TAGCGATGGA AGTACAGCCA
CTTGATCAAG ACGAATACGC TGAGCTCAAG ACATTGGGTT TGGATGCGGT GATGGTCTAT
CAAGAAACGT ATCATCCTTC GACTTATGCC GAGCACCATT TGCGTGGCAA TAAGATGGAT
TTCGAATACC GATTGGATAC ACCCGATCGT CTTGCAAAAG CGGGCATCGA TAAGATCGGT
ATTGGCGCAT TGATAGGGTT GGAAGAGTGG CGTACCGATT GCTTTTATGT TGCAGCGCAC
TTGGACTATC TTGAGCGCAC GTATTGGCAG ACTCGTTACT CAATTTCTTT TCCGCGTTTA
CGTCCTTGTG AGGGGGCCAG TTCTTTAAAC GGAAAGCAGC CAAAATCAGT CATGACGGAT
AAGCAACTTG TTCAGCTGAT TTGCGCTTAT CGTTTGTTGA ATCCAGAAGT GGAGTTGTCA
TTGTCGACTC GTGAGTCACC GAAGTTTAGA GATAACGCAT TGCCTTTAGG CATTACCAGT
ATGTCTGCAG CATCGAAAAC TCAGCCGGGT GGTTATGCGA TGGATGATGT TGAACTCGAG
CAGTTTGAGA TCAGCGATGA GCGAAGCGCG GCTTCTGTGG AATATATGAT TCGAGCCAAA
GGGTTTGACC CAGTATGGCG AGATTGGCAC TCGGCGTATT CTGGTTAA
 
Protein sequence
MSFVEQFKQL NWDEISMSIY AKTAQDVERA LNKPKRDLED FKALISPAAE AYLEQMAQLS 
YSATRKRFGN TMSLYIPLYL SNLCANACTY CGFSMENRIK RRTLNRHEVA AEVEAIKCMK
FDSVLLVTGE HETKVGMKYF REMVLMIKQR FNYLAMEVQP LDQDEYAELK TLGLDAVMVY
QETYHPSTYA EHHLRGNKMD FEYRLDTPDR LAKAGIDKIG IGALIGLEEW RTDCFYVAAH
LDYLERTYWQ TRYSISFPRL RPCEGASSLN GKQPKSVMTD KQLVQLICAY RLLNPEVELS
LSTRESPKFR DNALPLGITS MSAASKTQPG GYAMDDVELE QFEISDERSA ASVEYMIRAK
GFDPVWRDWH SAYSG