Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcar_1631 |
Symbol | thiH |
ID | 3724331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter carbinolicus DSM 2380 |
Kingdom | Bacteria |
Replicon accession | NC_007498 |
Strand | - |
Start bp | 1905681 |
End bp | 1907105 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637751226 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_357045 |
Protein GI | 77919230 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0000108196 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGCTT TACCATCCAT GGAACTCAGT AAAAACGCCG TCGATTTTAT CGATGAGAAC CACCTCAATG CGCTTTTGGC CGGCAAGAAA CCGGATGCCA CTCGGATTCG CGAAATTATT GCCAAAAGTC TGGCTAAAGA AGCGCTTTCC GTTGAGGAAA CGGCTGAACT TGTTCTGACC GACGATCCTG CGCTGATTGA GGAAATATTT GCCGCAGCCC GGGAACTTAA AAAAACCGTT TACGGTAATC GTATCGTTCT GTTTGCGCCT TTATATATCG GCAATGACTG TATTAATGAT TGCACCTATT GTGCATTTAA GCGGTCGAAT TTTGATGCGA TACGGCGCAC TTTGACTCCC GAGGAAATAG GTCAGCAGGT CGTTGCCCTT GAGGATAAGG GACACAAACG TCTGATACTG GTATTCGGAG AACATCCCAA ATACGATGCC GATTTTATTG CGGATACGGT AAAAAATGTT TATTCCGTCA AATCCGGAAA CGGGGAGATT CGCCGCGTAA ATATCAATGC TGCGCCTCTC GATATCGAAG GCTATAAAAA GGTCAAAGAG GCAGGGATCG GCACGTACCA GATTTTCATG GAAACCTATC ATCACGATAC CTATTCCATG ATGCATCCCG GCAATACCCG AAAAGGAAAT TACCTTTATC GACTTGACGG TCTGAGTCGT GCATTTGAAG CCGGTTGCGA CGACGTCGGG CTCGGTGTTC TTTTCGGTTT ATATGACTGG CGTTTCGAAG TGCTTAGCAT GGTTCGTCAT GCATTGTATC TTCAGGAGCG GTACAATGTC GGCCCTCACA CATTGAGTTT CCCCAGGCTC CGTCCTGCTC AAGGGGTTGA CTTCAACGAA GAGTATTTCG TCGACGACGA GGACTTCAAG CGTATTATAG CTATCTTGCG ACTTGCGGTA CCCTATACGG GGCTGATTCT CACCGCCCGC GAAAAACCTG AACTGCGCAG AGAGCTGATG TCCTTCGGTG TTTCTCAAAT CGATGCCGGC AGCCGTATCG AACTCGGCGG ATACACCGAA GCGGGAGATG CCCAGGTTAT GGAACGGGAA CAGTTCAGCC TTGGCGATAT TCGTTCCCTG GATGAGGTTA TGTGCGAGTT GATTAGCGAT GGTTATGTTC CCAGTTTCTG TACCTCCTGT TATCGCAGCG GTCGCACAGG CGAACATTTT ATGGAGTTCA GTATCCCCGG TTTCATCAAG CGTTACTGTA CCCCCAATGC GTTGTTGACC CTGGAAGAGT ACCTGGTCGA TTATGCCTCC GAGGAAACAC GGGCTGTCGG TGAAAAACTT ATTGCCGAAG AACTTGCCAA AATGGAAGAT GGCGAGATGA AAAACCGTAC TCTTAAACAA CTGGAAGAAA TCAAGGACCG CAACGTTCGC GATATCTATT TTTGA
|
Protein sequence | MCALPSMELS KNAVDFIDEN HLNALLAGKK PDATRIREII AKSLAKEALS VEETAELVLT DDPALIEEIF AAARELKKTV YGNRIVLFAP LYIGNDCIND CTYCAFKRSN FDAIRRTLTP EEIGQQVVAL EDKGHKRLIL VFGEHPKYDA DFIADTVKNV YSVKSGNGEI RRVNINAAPL DIEGYKKVKE AGIGTYQIFM ETYHHDTYSM MHPGNTRKGN YLYRLDGLSR AFEAGCDDVG LGVLFGLYDW RFEVLSMVRH ALYLQERYNV GPHTLSFPRL RPAQGVDFNE EYFVDDEDFK RIIAILRLAV PYTGLILTAR EKPELRRELM SFGVSQIDAG SRIELGGYTE AGDAQVMERE QFSLGDIRSL DEVMCELISD GYVPSFCTSC YRSGRTGEHF MEFSIPGFIK RYCTPNALLT LEEYLVDYAS EETRAVGEKL IAEELAKMED GEMKNRTLKQ LEEIKDRNVR DIYF
|
| |