Gene Shew_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_2091 
SymbolthiH 
ID4923627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2418226 
End bp2419344 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content57% 
IMG OID640163673 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001094216 
Protein GI127513019 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCT TCGATACCTT GAGTGGACTC TCTCGCGAGC AGCTGCGAAT GGCGCTCTAT 
TCCACCACGC CGGCCCAGGT AGAGACGGCG ATCGAGGGTG AGCAGGGCAA CTTGGGTCAT
CTGTTAGCCC TGTTGTCGCC GGCCGCCGAG GAGTACCTAG AGCCGATGGC GCAGCGTGCT
GCCGCCTTAA CCCGGCAACG ATTTGGTCAT AACATCGGCC TCTATCTGCC GCTCTATCTC
TCAAATCTCT GCGCTAACGA GTGTGACTAC TGTGGTTTTA CCATGAGCAA CAAGCTTAAG
CGCAAGGTGC TCAGCCATGA TGAACTGGCG GCCGAGATGG CGGTGATCAA ACCTCAGGGA
TTTGATTCCA TCTTGTTGGT CTCCGGCGAG CATGAAACCA AGGTCGGCAT AGAGTACTTT
GCCGATATCT TACCCTTGGT GAAGGCGGAG TTTAGCCATG TGGCGATGGA GGTACAGCCA
CTCAGCCGTG AACACTATGA GATTTTGGTG GAGAAGGGGC TGGATGCCGT GATGCTCTAT
CAGGAAACCT ACGATCCTGA GACCTATCGC AGACATCACC TGAGGGGCAA CAAGCAGGAC
TATGGTTATC GTCTCGCATC GCCGGAGCGG ATCGCCCAGG CGGGCGTGGA TAAGATAGGC
CTAGGTGTGT TACTGGGTCT CGATGATTGG CGCATGGACG CGCTCTTGAT GGGCTATCAC
CTGGATTATC TCGAGCGCCG CTTCTGGCGC AGTCGTTACA GCATATCCTT GCCCAGACTC
AGACCCTGCG TCGGTGGTAT CACGCCTAAG GTGCAGTTGA CCGACAAGGG CCTAGTGCAA
CTCATCTGTG CCTTTCGCCT CTTCAACGAG CAGCTTGAGA TCAGCCTGTC GACCCGGGAG
ACGCCTAGCT TGAGAGATAA CCTGTTAGGG CTCGGCATCA CCCAGATGAG CGCCGGTAGT
CGCACCGAAC CGGGTGGCTA TGTCAATCCG GCGGCTCAGC TAGATCAGTT TGAGATCAGT
GATGAGCGCA GCGCCGCCGA GGTTGCCAGT GTCCTTCGAA GCCGTGGCTT TACCCCTGTG
TGGAAAGACT GGGAGGCTGG CTGGATAGGC GCAGGCTAA
 
Protein sequence
MSFFDTLSGL SREQLRMALY STTPAQVETA IEGEQGNLGH LLALLSPAAE EYLEPMAQRA 
AALTRQRFGH NIGLYLPLYL SNLCANECDY CGFTMSNKLK RKVLSHDELA AEMAVIKPQG
FDSILLVSGE HETKVGIEYF ADILPLVKAE FSHVAMEVQP LSREHYEILV EKGLDAVMLY
QETYDPETYR RHHLRGNKQD YGYRLASPER IAQAGVDKIG LGVLLGLDDW RMDALLMGYH
LDYLERRFWR SRYSISLPRL RPCVGGITPK VQLTDKGLVQ LICAFRLFNE QLEISLSTRE
TPSLRDNLLG LGITQMSAGS RTEPGGYVNP AAQLDQFEIS DERSAAEVAS VLRSRGFTPV
WKDWEAGWIG AG