Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3253 |
Symbol | thiH |
ID | 4253821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 3886019 |
End bp | 3887458 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638119893 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_735378 |
Protein GI | 113971585 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000173942 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACAC ACGAGCATCA TTCCATTACC GTCTCTGACT ATAATCCCAA CGTCAGCTTT ATTGACGATC AGGCGATTTG GCAGGCCATT GAAGACGCCA GTCATCCGAG TCGCGAACAA ATCCAAGCCA TTCTCCAAAA GGCGCGCCAA TGCGAAGGCT TAAGCATTCG CGAAACCGCT CTCCTGCTAC AAAATCAAGA TAAAGCCCTG GATGAAGCAC TCTTTGCCGT CGCCCGTGAA ATAAAAAACA CCATCTACGG CAATCGTATA GTGATGTTTG CGCCGCTCTA TGTGTCCAAC CATTGTGCCA ACAGTTGTAG TTACTGCGGC TTTAACGCCG ATAACCATGA ACTAAAACGC AAGACCTTAA AACAGGATGA GATCCGCCAA GAGGTCACCA TCCTCGAAGA AATGGGCCAC AAACGGATCT TGGCCGTTTA TGGCGAGCAT CCACGCAACA ATGTGCAAGC CATTGTTGAC AGTATTCAAA CCATGTACAG CGTTAAGCAG GGCAAGGGTG GAGAAATTCG CCGTATCAAT GTCAACTGCG CGCCAATGAG TGTGGAAGAC TTTAAACAGC TTAAAACGGC GGCGATAGGC ACTTATCAAT GTTTCCAAGA AACCTATCAT CAAGACACTT ACAGTAAAGT GCACCTAAAA GGTAAAAAAA CCGACTTTTT ATACCGACTC TACGCCATGC ACAGGGCGAT GGAAGCAGGA ATCGACGATG TCGGTATCGG TGCGCTCTTT GGCCTGTATG ACCATAGATT TGAGCTGCTC GCCATGCTCA CCCATGTTCA GCAACTCGAA AAAGACTGTG GCGTTGGCCC ACATACCATC TCCTTCCCGC GGATTGAACC CGCCCATGGC TCTGCCCTTA GTGAAAAGCC GCCCTATGAG GTTGATGATG AGTGCTTCAA GCGTATCGTT GCTATCACTC GCCTAGCCGT ACCTTATACC GGGCTGATTA TGAGCACACG GGAGAGCGCT GCGCTGCGTA AAGAATTGTT AGAGCTCGGT GTTTCACAGA TCAGTGCGGG CTCACGCACT GCGCCGGGTG GCTATCAAGA CAGCAAACAA AATCAACACG ATGCCGAACA ATTTAGCCTT GGTGATCATC GCGCTATGGA TGAGATCATC TATGAATTAG TTACAGATTC GGATGCCATC CCCTCCTTCT GCACGGGCTG TTACCGTAAA GGGCGCACAG GCGATCACTT TATGGGATTA GCCAAACAGC AGTTTATTGG CAAATTCTGC CAGCCCAATG CCTTGATCAC CTTTAGGGAA TATCTGAACG ACTACGCCAG CGATAAAACC CGTGAGGCAG GTAACGCCCT GATAGAGCGA GAGCTCGCCA AAATGAGTCC ATCACGGGAA CGTAACGTAC GCGTCTGCCT GAAAAAAACC GATGCGGGTG AACGGGATAT CTATCTATAA
|
Protein sequence | MSTHEHHSIT VSDYNPNVSF IDDQAIWQAI EDASHPSREQ IQAILQKARQ CEGLSIRETA LLLQNQDKAL DEALFAVARE IKNTIYGNRI VMFAPLYVSN HCANSCSYCG FNADNHELKR KTLKQDEIRQ EVTILEEMGH KRILAVYGEH PRNNVQAIVD SIQTMYSVKQ GKGGEIRRIN VNCAPMSVED FKQLKTAAIG TYQCFQETYH QDTYSKVHLK GKKTDFLYRL YAMHRAMEAG IDDVGIGALF GLYDHRFELL AMLTHVQQLE KDCGVGPHTI SFPRIEPAHG SALSEKPPYE VDDECFKRIV AITRLAVPYT GLIMSTRESA ALRKELLELG VSQISAGSRT APGGYQDSKQ NQHDAEQFSL GDHRAMDEII YELVTDSDAI PSFCTGCYRK GRTGDHFMGL AKQQFIGKFC QPNALITFRE YLNDYASDKT REAGNALIER ELAKMSPSRE RNVRVCLKKT DAGERDIYL
|
| |