Gene Sfri_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfri_2196 
SymbolthiH 
ID4279108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella frigidimarina NCIMB 400 
KingdomBacteria 
Replicon accessionNC_008345 
Strand
Start bp2626048 
End bp2627163 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content43% 
IMG OID638134991 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_750880 
Protein GI114563367 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.119873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTATT CAAGCGTATT TGCGCAATTA GCCCCCGAAG CATTATCGAT GAAGTTGTAT 
TCAACAACCG CCAAAGACGT TGAGTTGGCT TTGAAAAATC CAAGCGGTAA TCTTGATAGC
TTGTTGGCAT TATTATCTCC TGCCGCAATG CCATACATTG AACCGATGGC GAAGCAAGCA
GCTCAGCTGA CTCGGCAGCG ATTTGGTGCA AATATCGGCC TTTTCTTACC CTTATATTTA
TCAAACTTGT GCGCCAATGA GTGCGATTAT TGTGGATTTA GTATGAGCAA TAAGGTCAAG
CGCAAAACTC TTACTCGTGA TGAACTAGCA GCGGAAATGG CGATTATTAA ACAACGTGGC
TTTGATTCGA TATTATTGGT GTCTGGCGAA CACGAAACTA AAGTTGGAAT GGAGTATTTT
GAGTCGGTGC TACCGTTAGT TAGCAGGGCT TTTAATTACG TGGCAATGGA AGTGCAACCA
TTGGAGACTG ATCAGTATCA GCGTCTTGGC AAACTAGGTG TTGATGCCGT TATGGTATAC
CAAGAAACAT ATCGTGCTCA TACTTATGCT CAGCATCATA CTCGCGGTAA AAAACAAGAT
TTTATCTATC GCTTAGATAC CCCTGATCGA GTGGCAAAAT CGGGTATCGA TAAAATTGGC
CTCGGGGTAT TACTTGGGTT AGATGATTGG CGCTTAGATG CATTACTTAT GGGCTTTCAT
CTAGATTACT TGGAAAATAC CTACTGGCGC AGTCGCTACA GTATTTCACT ACCTCGTTTA
CGTCCATGTA CTGGCGGCAT TACACCGAAA GTTGAATTAA CCGATGCAGG ATTAGTACAA
ATGATCTGTG CATTTAGGTT GTTTAACCCC CAGCTTGAAA TCAGTTTATC CACGCGAGAG
TTACCATCAT TAAGGGATAA TTTACTCCCT TTAGGAATTA CCCACATGAG TGCAGGCAGT
TCAACTCAAC CAGGCGGTTA TATGGCGCCA GACAGCCAAC TTGATCAATT TGAAATCAGT
GATAACCGCC CAGTAGAGCA AGTTGTTGAA CAAATGAAGC GACGAGGAAT TAATCCAGTA
TGGAAAGATT GGGAGATGGG TTGGGTTAAC AGCTAA
 
Protein sequence
MTYSSVFAQL APEALSMKLY STTAKDVELA LKNPSGNLDS LLALLSPAAM PYIEPMAKQA 
AQLTRQRFGA NIGLFLPLYL SNLCANECDY CGFSMSNKVK RKTLTRDELA AEMAIIKQRG
FDSILLVSGE HETKVGMEYF ESVLPLVSRA FNYVAMEVQP LETDQYQRLG KLGVDAVMVY
QETYRAHTYA QHHTRGKKQD FIYRLDTPDR VAKSGIDKIG LGVLLGLDDW RLDALLMGFH
LDYLENTYWR SRYSISLPRL RPCTGGITPK VELTDAGLVQ MICAFRLFNP QLEISLSTRE
LPSLRDNLLP LGITHMSAGS STQPGGYMAP DSQLDQFEIS DNRPVEQVVE QMKRRGINPV
WKDWEMGWVN S