Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_0708 |
Symbol | thiH |
ID | 4476918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | - |
Start bp | 822821 |
End bp | 824260 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639725243 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_868352 |
Protein GI | 117919160 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.519768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000306942 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACAC ACGAGCATCA TTCCATTACC GTCTCTGACT ATAATCCCAA CGTCAGCTTT ATTGACGATC TGGCGATTTG GCAGGCCATT GAAGAGGCCA GCAATCCGAG TCGTGAACAA ATCCAAGCCA TTCTCGAAAA GGCGCGCCAA TGCGAAGGCT TAAGCATTCG CGAAACCGCT CTCCTGCTAC AAAATCAAGA TAAAGCGCTG GATGAAGCAC TCTTTGCCGT CGCCCGTGAG ATTAAAAACA CCATCTACGG CAATCGTATA GTGATGTTTG CGCCACTCTA TGTCTCCAAC CATTGTGCCA ACAGTTGTAG TTACTGCGGC TTTAATGCCG ATAACCATGA GCTGAAACGC AAGACCTTAA AACAGGATGA GATCCGCCAA GAGGTCACCA TCCTCGAAGA AATGGGCCAC AAACGGATCT TGGCCGTTTA TGGCGAGCAT CCACGCAACA ATGTGCAAGC CATTATTGAA AGTATTCAAA CCATGTACAG CGTGAAGCAG GGCAAGGGCG GCGAAATTCG CCGTATCAAC GTCAACTGTG CGCCAATGAG TGTGGAGGAC TTTAAACAGC TCAAAACGGC GGCGATAGGC ACTTACCAAT GCTTCCAAGA AACCTATCAT CAAGACACTT ACAGTGAAGT GCACCTAAAA GGTAAAAAAA CTGACTTTTT ATACCGCCTC TACGCCATGC ACAGAGCCAT GGAAGCGGGA ATCGACGATG TCGGTATCGG CGCCCTCTTT GGCCTGTATG ACCATAGATT TGAGCTGCTC GCTATGCTCA CTCATGTTCA ACAACTCGAA AAAGACTGTG GCGTTGGCCC GCATACTATC TCCTTTCCGC GGATAGAACC CGCCCATGGC TCTGCCCTTA GTGAAAAGCC GCCCTATGAG GTTGATGATG AGTGCTTTAA GCGTATCGTT GCCATCACTC GCCTCGCCGT ACCTTATACA GGCTTGATTA TGAGCACGCG GGAGAGCGCC GCAATGCGCA AAGAATTGTT AGAGCTTGGC GTTTCACAGA TCAGTGCAGG CTCACGCACT GCACCCGGTG GTTATCAAGA CAGCAAACAA AATCAACACG ATGCCGAACA ATTTAGCCTT GGCGATCATC GCGCCATGGA TGAAATCATC TATGAATTAG TCACAGATTC GGATGCCATC CCCTCCTTCT GTACGGGCTG TTACCGTAAA GGGCGCACAG GCGACCACTT TATGGGATTA GCCAAGCAGC AGTTTATTGG CAAGTTCTGC CAGCCCAATG CCTTAATCAC CTTTAGGGAA TATCTGAACG ACTACGCCAG CGATAAAACC CGTGAAGCAG GTAACGCCCT GATAGAGCGA GAACTCGCCA AAATGAGTCC ATCACGGGAA CGTAATGTGC GCGTTTGCCT GAAAAAAACC GATGCGGGTG AACGGGATAT CTATTTGTAA
|
Protein sequence | MSTHEHHSIT VSDYNPNVSF IDDLAIWQAI EEASNPSREQ IQAILEKARQ CEGLSIRETA LLLQNQDKAL DEALFAVARE IKNTIYGNRI VMFAPLYVSN HCANSCSYCG FNADNHELKR KTLKQDEIRQ EVTILEEMGH KRILAVYGEH PRNNVQAIIE SIQTMYSVKQ GKGGEIRRIN VNCAPMSVED FKQLKTAAIG TYQCFQETYH QDTYSEVHLK GKKTDFLYRL YAMHRAMEAG IDDVGIGALF GLYDHRFELL AMLTHVQQLE KDCGVGPHTI SFPRIEPAHG SALSEKPPYE VDDECFKRIV AITRLAVPYT GLIMSTRESA AMRKELLELG VSQISAGSRT APGGYQDSKQ NQHDAEQFSL GDHRAMDEII YELVTDSDAI PSFCTGCYRK GRTGDHFMGL AKQQFIGKFC QPNALITFRE YLNDYASDKT REAGNALIER ELAKMSPSRE RNVRVCLKKT DAGERDIYL
|
| |