Gene Spea_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_2378 
SymbolthiH 
ID5662771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp2900354 
End bp2901460 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content47% 
IMG OID641237000 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001502233 
Protein GI157962199 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTG TTGATGTGTT TAAAAAGCTG TCTCGTTCAG AGCTTAAACT CAGATTGTAT 
TCAAGCACAG CTGCGGATGT TGAGTGCGCA ATGCAGAAAC CATCGGGAGA CGTAGATAGT
TTACTTGCGC TGTTGTCACC GGCGGCTGAG CCATTTCTTG AGCAGATGGC TCAGCAAGCT
GTCGCGTTAA CTCGTCAGCG ATTTGGCGCG AGTATCGGCA TGTATATTCC GCTCTATCTT
TCAAATCTCT GTGCCAATGA GTGTGATTAT TGTGGCTTCA CCATGAGTAA CAAAATCAAG
CGTAAGACTT TAACTGACAG TGAAATCAAA GACGAGATGC GATCGATTAA GAGCATGGGC
TACGATTCAA TTTTACTGGT ATCGGGTGAG CATGAGTCGA AAGTGGGCGT GCCATACTTC
AAGCAGGTGT TACCACTGAT CACAGAGCAG TTTAGCCATG TGGCCATGGA GGTGCAGCCG
TTGGAAGAGC AAGACTATAG AGAGCTAGTC GCAGAAGGGC TCGATGCGGT GATGCTATAC
CAAGAAACCT ATAATCCTGT GACGTATAGC GAGCACCATA CCCGAGGTAA GAAGAAGGAT
TTTGGTTATC GGCTCGAGTC TCCCGATAGG GTAGCGAGAG CTGGTGTCGA TAAAATAGGT
CTTGGGGTAT TACTCGGCTT AGATGATTGG CGACTCGACG CACTATTAAT GGGGCATCAT
TTAGATTACA TGGAGAAAAC TTATTGGCGC AGTCGTTATA GCATTTCGCT TCCAAGGTTG
AGGCCTTGTA CTGGTGGGGT AACGCCAAAG GTTGAGCTGA CTGATAAAGG CTTAGTGCAG
ATGATCTGTG CCTTTAGATT GTTTAATCAA CAACTAGAGA TTAGTCTATC GACACGCGAA
ACGCCTAAGC TACGGGATAA CCTATTTACA CTAGGGGTAA CTAATGTTAG TGCAGGAAGC
TCGACGCAAC CCGGCGGCTA TGTTGAACCT AACACCGAGC TGGATCAGTT TGAGATCAGC
GATGAGCGCT CACCGCAAGT GGTTGCTAAC GCAATGCTTG AGCGTGGATT AAACCCAGTA
TGGAAAGACT GGGAAAGCGG CTGGTGA
 
Protein sequence
MSFVDVFKKL SRSELKLRLY SSTAADVECA MQKPSGDVDS LLALLSPAAE PFLEQMAQQA 
VALTRQRFGA SIGMYIPLYL SNLCANECDY CGFTMSNKIK RKTLTDSEIK DEMRSIKSMG
YDSILLVSGE HESKVGVPYF KQVLPLITEQ FSHVAMEVQP LEEQDYRELV AEGLDAVMLY
QETYNPVTYS EHHTRGKKKD FGYRLESPDR VARAGVDKIG LGVLLGLDDW RLDALLMGHH
LDYMEKTYWR SRYSISLPRL RPCTGGVTPK VELTDKGLVQ MICAFRLFNQ QLEISLSTRE
TPKLRDNLFT LGVTNVSAGS STQPGGYVEP NTELDQFEIS DERSPQVVAN AMLERGLNPV
WKDWESGW