Gene Shewmr7_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_2092 
SymbolthiH 
ID4258359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2470376 
End bp2471491 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content48% 
IMG OID638122760 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_738137 
Protein GI114047587 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.62908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG TTGACCAATT TGCCCGTATT GAACGGGATA AGTTATTGCT GGCGCTATAT 
TCCTGCACGG CAGTGGAGGT TGAGCGGGCC CTGATGCAAC CCGAGGGTAA TCTAGAAAGT
TTACTCGCCT TGTTGTCTCC AGCCGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG
GCAGCGCTTA CTCGGCAACG CTTTGGGGCT AATATCGGAC TCTATTTGCC GTTATATCTG
TCAAATCTGT GTGCCAACGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGCTAAAG
CGTAAAGTGC TCAATGAGCA GGAAATTGCG GCTGAAATGG CGATAATCAA ATCACGTGGG
TTTGACTCCA TCTTACTGGT GTCGGGCGAG CATGAAACTA AAGTGGGGAT GGATTACTTT
AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATT TAGCAATGGA GGTACAGCCG
CTTGATGAGA TTGATTATCG CCAGCTTGTC GAGCTAGGGC TTGATGCTGT GATGGTGTAT
CAAGAAACCT ATCAAGCGGC GACCTATGCT AAGCATCACA CTCGAGGCAA TAAGCAGGAC
TTTGCGTATC GGCTGGCAAC GCCTGACCGC GTTGCCAGCG CAGGTGTCGA TAAGATTGGC
CTAGGCGTGT TATTGGGTTT GGATGACTGG CGACTCGATG CTTTACTGAT GGGTCATCAT
TTGGACTATT TAGAACGGCA TTACTGGCGT ACTCGCTTTA GTATTTCGTT ACCGCGTTTA
CGACCTTGTA CCGGCGGTAT AACGCCAAAA GTGCATTTAA CCGATCTCGG ACTGGTACAA
TTGACCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG
GCGCCATCAC TTCGGGATAA TTTGCTGCCA CTTGGGATAA CACAAATAAG TGCGGGGAGT
TCAACGCAAC CTGGTGGTTA TCAGGCGCCA GAGAGTCAAT TAGATCAGTT TGAGATAAGC
GATGAACGTA CCGTTGAGCA AGTCATGGCT CAGATGCGCC TTAGGGGATT CAATCCGGTT
TTTAAGGATT GGGAATCGGC TTGGATTGCG GGTTAA
 
Protein sequence
MSFVDQFARI ERDKLLLALY SCTAVEVERA LMQPEGNLES LLALLSPAAE PYIEEMAQRS 
AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKSRG
FDSILLVSGE HETKVGMDYF KRVLPIVKQQ FSYLAMEVQP LDEIDYRQLV ELGLDAVMVY
QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH
LDYLERHYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ LTCAFRLFNQ QLDISLSTRE
APSLRDNLLP LGITQISAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMA QMRLRGFNPV
FKDWESAWIA G