Gene Sama_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1855 
SymbolthiH 
ID4604105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2259305 
End bp2260420 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content56% 
IMG OID639781231 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_927730 
Protein GI119774990 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG GGTATTTCTC TGAGGCATTT GCGCGCCTGA ACCCCGATAG CTTGCGGCTT 
AAGCTCTACT CGGCCACAGC ACAGGATGTG GAGGCGGCAC TGCGGGCTCC GGCGGGTAAC
CTGAATGCGC TGCTGGCACT CTTGTCGCCT GCCGCCGAGC CTTATCTTGA GCAGATGGCA
CAAAAAAGCA TGCAGCTCAC CCGTAAGCGC TTTGGTGCCA GTATCGGTAT GTACCTGCCG
CTGTATCTAT CCAATCTGTG TGCAAACGAG TGTGATTACT GCGGCTTTTC AATGAGTAAC
CGCATTAAGC GCAAGACCCT TGATGAAACC GAGCTTACCC GCGAAATGGC CGCCATCAAG
GCCATGGGCT ATGACTCAGT GCTGCTGGTC TCCGGCGAGC ATGAAACCAA GGTCGGCATG
GGCTATTTTC GTAAGGTGTT ACCCGAGGTA AAGCGTGCGT TTTCCTACGT GGCGATGGAA
GTGCAGCCCC TAGCCGAGCC GGAGTACCGC GAGCTGGTAA CCCTCGGGCT TGATGCTGTG
ATGATTTATC AGGAAACCTA TCAGCGAGCC ACCTACGCTG AGCATCACAC CCGCGGCAAA
AAAATGGACT TTATCTGGCG CCTGGATACG CCCGACAGAC TGGCGCTGGC CGGTGTAGAC
AAGATTGGCC TTGGGGTGCT GCTGGGGCTT GATGACTGGC GCCTGGATGC GCTGTTGATG
GGGTATCACC TTGATTATCT TGAGCGAAAG TACTGGCGCA GCCGCTACAG TATTTCACTG
CCAAGACTCA GGCCCTGCAC CGGCGGCGTG GCACCGAAAA CCGAGATAAG CGATAAAGGA
TTGGTGCAAC TTATCTGTGC GTTCCGGTTG TTCAACGAAG CGCTGGATAT CAGCCTCTCG
ACACGGGAGC GGCCGGACTT TCGCGACAAT CTGTTTGCCC TTGGGATCAC CCAAACCAGC
GCTGGCAGTG CTACATCACC GGGAGGATAC TCTGAGCCGG ATACCCATTT GGATCAGTTT
GAAATCAGCG ATGACAGAAG TGCAGCCGAC ATCGCAGCCG TGCTGAAGGC GAGGGGACTT
AACCCCATCT GGAAAGACTG GGAATCCCAG TGGTAA
 
Protein sequence
MSQGYFSEAF ARLNPDSLRL KLYSATAQDV EAALRAPAGN LNALLALLSP AAEPYLEQMA 
QKSMQLTRKR FGASIGMYLP LYLSNLCANE CDYCGFSMSN RIKRKTLDET ELTREMAAIK
AMGYDSVLLV SGEHETKVGM GYFRKVLPEV KRAFSYVAME VQPLAEPEYR ELVTLGLDAV
MIYQETYQRA TYAEHHTRGK KMDFIWRLDT PDRLALAGVD KIGLGVLLGL DDWRLDALLM
GYHLDYLERK YWRSRYSISL PRLRPCTGGV APKTEISDKG LVQLICAFRL FNEALDISLS
TRERPDFRDN LFALGITQTS AGSATSPGGY SEPDTHLDQF EISDDRSAAD IAAVLKARGL
NPIWKDWESQ W