Gene Swoo_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwoo_2587 
SymbolthiH 
ID6116884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella woodyi ATCC 51908 
KingdomBacteria 
Replicon accessionNC_010506 
Strand
Start bp3164124 
End bp3165233 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content45% 
IMG OID641634117 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001760959 
Protein GI170726933 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.914294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0149307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC TTAGTTATTT ACAAGGGCTA TCACGTGAAA AGTTATCTCT AAAACTTTAC 
TCATGCACGG GGAAAGATGT TGAGCAGGCG TTGCAGCATC CGGAAGGTTC CTTAGAGAGC
CTACTGGCGC TTTTATCTCC AGCTGCAGAG CCTTATCTGG AACAGATGGC GCAAACCTCC
GCACGTCTCA CTCGCCAACG TTTTGGGGCG AATATTGGTA TGTATTTGCC GCTATACCTC
TCTAACCTGT GTGCGAATGA GTGTGATTAT TGTGGCTTTA GCATGAGCAA TCGCATTAAG
CGTAAGACCT TGAGTCTAGA TGAGTTAAAT GCTGAGATGC GAGTGGTAAA AGCGATGGAT
TATGACTCAA TATTGCTGGT TTCAGGTGAA CATGAAACTA AAGTTGGCAT TGATTACTTT
AGCTCAGTTC TGCCAAGCGT TAAAAGTCAG TTCAGTTATG TGGCGATGGA GGTTCAACCA
CTCAAAGAGG CTGAATATTC CCAGTTGGCA GGACTTGGGC TCGATGCCGT GATGATCTAT
CAGGAAACTT ACAACCCGCA AACCTATGCC GAGCACCACA CACGAGGAAA TAAACAAAAT
TTTGAGTATC GTCTCGAGAC TCCTGAAAGG GTGGCTAGAG CAGGTGTCGA TAAAATTGGT
TTAGGTGTAC TGCTTGGTTT AGATGACTGG CGTTTGGATG CACTGTTATT GGGCCACCAT
TTAACTTATC TTGAGTCTCA TTTTTGGCGA AGCCGCTACA GTGTATCGTT ACCGAGATTA
CGCCCCTGCA CAGGGGGGAT ATCACCTAAA GTAGAGCTAA CTGATAGAGG GTTGGTACAA
CTAATTTGCG CATTTCGACT CTTTAATCAT CAGCTTGAGA TCAGTCTATC GACTAGAGAG
TCGGCTGAGT TACGTAATAA TTTGTTCGGT TTAGGGGTGA CTCAGTTAAG CGCAGGCAGT
TCAACACAGC CCGGTGGCTA CTTATTGCCT GATACTCAGC TTGATCAGTT TGAGATAAGT
GATGAGCGTA CACCTGTTGA AGTTTGCGTG GCAATGAGAG ATAGGGGGTT TAATCCTGTT
TGGAAAGACT GGGAATCAGC TTGGGTATAG
 
Protein sequence
MSFLSYLQGL SREKLSLKLY SCTGKDVEQA LQHPEGSLES LLALLSPAAE PYLEQMAQTS 
ARLTRQRFGA NIGMYLPLYL SNLCANECDY CGFSMSNRIK RKTLSLDELN AEMRVVKAMD
YDSILLVSGE HETKVGIDYF SSVLPSVKSQ FSYVAMEVQP LKEAEYSQLA GLGLDAVMIY
QETYNPQTYA EHHTRGNKQN FEYRLETPER VARAGVDKIG LGVLLGLDDW RLDALLLGHH
LTYLESHFWR SRYSVSLPRL RPCTGGISPK VELTDRGLVQ LICAFRLFNH QLEISLSTRE
SAELRNNLFG LGVTQLSAGS STQPGGYLLP DTQLDQFEIS DERTPVEVCV AMRDRGFNPV
WKDWESAWV