Gene Shewmr4_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1886 
SymbolthiH 
ID4252460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2245438 
End bp2246553 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content49% 
IMG OID638118497 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_734017 
Protein GI113970224 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG TCGACCAATT TGCCCGCATT GAACGGGATA AGTTATTGCT GGCGTTATAT 
TCCTGCACGG CAGCGGAGGT TGAGCGCGCC CTGATGCAAC CCGAGGGTAA TCTAGAGAGT
TTACTCGCCT TGTTGTCTCC AGCAGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG
GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATTGGAC TCTATTTGCC GTTATATCTG
TCAAATCTGT GTGCCAACGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGTTAAAG
CGTAAAGTGC TCAATGAGCA GGAAATTGCG GCCGAAATGG CGATTATTAA ATCCCGTGGT
TTTGACTCCA TCTTACTGGT GTCGGGTGAG CATGAAACCA AAGTGGGGAT AGATTACTTT
AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATT TGGCTATGGA GGTTCAGCCG
CTTGAAGAGA TTGATTATCG CCAGCTTGTC GAGCTAGGGC TTGATGCTGT GATGGTGTAT
CAGGAAACCT ATCAAGCGGC GACCTATGCT AAGCATCACA CCCGAGGCAA TAAGCAGGAC
TTTGCGTATC GGCTCGCAAC GCCCGACCGC GTTGCCAGCG CAGGTGTCGA TAAGATTGGC
CTAGGCGTGT TATTGGGTTT GGATGACTGG CGACTCGATG CCTTACTGAT GGGGCATCAT
TTGGACTATT TAGAACGGCA TTATTGGCGG ACTCGCTTTA GTATTTCGTT ACCTCGTTTG
CGGCCTTGTA CCGGCGGCAT AACACCAAAA GTGCATTTAA CCGATCTTGG ACTGGTACAA
TTGATCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG
GCGCCATCAC TTCGGGATAA TTTGCTGCCA CTTGGGATAA CACAAATGAG TGCGGGGAGT
TCAACGCAAC CTGGTGGTTA TCAGGCGCCA GAGAGCCAAT TAGATCAGTT TGAGATAAGC
GATGAACGTA CCGTTGAGCA AGTCATGACT CAGATGCGCC TTCGGGGATT TAATCCGGTT
TTTAAGGATT GGGAATCGGC TTGGATTGCG GGTTAG
 
Protein sequence
MSFVDQFARI ERDKLLLALY SCTAAEVERA LMQPEGNLES LLALLSPAAE PYIEEMAQRS 
AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKSRG
FDSILLVSGE HETKVGIDYF KRVLPIVKQQ FSYLAMEVQP LEEIDYRQLV ELGLDAVMVY
QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH
LDYLERHYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ LICAFRLFNQ QLDISLSTRE
APSLRDNLLP LGITQMSAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMT QMRLRGFNPV
FKDWESAWIA G