Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_2091 |
Symbol | thiH |
ID | 4923627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | - |
Start bp | 2418226 |
End bp | 2419344 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640163673 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001094216 |
Protein GI | 127513019 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTCT TCGATACCTT GAGTGGACTC TCTCGCGAGC AGCTGCGAAT GGCGCTCTAT TCCACCACGC CGGCCCAGGT AGAGACGGCG ATCGAGGGTG AGCAGGGCAA CTTGGGTCAT CTGTTAGCCC TGTTGTCGCC GGCCGCCGAG GAGTACCTAG AGCCGATGGC GCAGCGTGCT GCCGCCTTAA CCCGGCAACG ATTTGGTCAT AACATCGGCC TCTATCTGCC GCTCTATCTC TCAAATCTCT GCGCTAACGA GTGTGACTAC TGTGGTTTTA CCATGAGCAA CAAGCTTAAG CGCAAGGTGC TCAGCCATGA TGAACTGGCG GCCGAGATGG CGGTGATCAA ACCTCAGGGA TTTGATTCCA TCTTGTTGGT CTCCGGCGAG CATGAAACCA AGGTCGGCAT AGAGTACTTT GCCGATATCT TACCCTTGGT GAAGGCGGAG TTTAGCCATG TGGCGATGGA GGTACAGCCA CTCAGCCGTG AACACTATGA GATTTTGGTG GAGAAGGGGC TGGATGCCGT GATGCTCTAT CAGGAAACCT ACGATCCTGA GACCTATCGC AGACATCACC TGAGGGGCAA CAAGCAGGAC TATGGTTATC GTCTCGCATC GCCGGAGCGG ATCGCCCAGG CGGGCGTGGA TAAGATAGGC CTAGGTGTGT TACTGGGTCT CGATGATTGG CGCATGGACG CGCTCTTGAT GGGCTATCAC CTGGATTATC TCGAGCGCCG CTTCTGGCGC AGTCGTTACA GCATATCCTT GCCCAGACTC AGACCCTGCG TCGGTGGTAT CACGCCTAAG GTGCAGTTGA CCGACAAGGG CCTAGTGCAA CTCATCTGTG CCTTTCGCCT CTTCAACGAG CAGCTTGAGA TCAGCCTGTC GACCCGGGAG ACGCCTAGCT TGAGAGATAA CCTGTTAGGG CTCGGCATCA CCCAGATGAG CGCCGGTAGT CGCACCGAAC CGGGTGGCTA TGTCAATCCG GCGGCTCAGC TAGATCAGTT TGAGATCAGT GATGAGCGCA GCGCCGCCGA GGTTGCCAGT GTCCTTCGAA GCCGTGGCTT TACCCCTGTG TGGAAAGACT GGGAGGCTGG CTGGATAGGC GCAGGCTAA
|
Protein sequence | MSFFDTLSGL SREQLRMALY STTPAQVETA IEGEQGNLGH LLALLSPAAE EYLEPMAQRA AALTRQRFGH NIGLYLPLYL SNLCANECDY CGFTMSNKLK RKVLSHDELA AEMAVIKPQG FDSILLVSGE HETKVGIEYF ADILPLVKAE FSHVAMEVQP LSREHYEILV EKGLDAVMLY QETYDPETYR RHHLRGNKQD YGYRLASPER IAQAGVDKIG LGVLLGLDDW RMDALLMGYH LDYLERRFWR SRYSISLPRL RPCVGGITPK VQLTDKGLVQ LICAFRLFNE QLEISLSTRE TPSLRDNLLG LGITQMSAGS RTEPGGYVNP AAQLDQFEIS DERSAAEVAS VLRSRGFTPV WKDWEAGWIG AG
|
| |