Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2471 |
Symbol | |
ID | 8253578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2865213 |
End bp | 2866331 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644936121 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003092737 |
Protein GI | 255532365 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGTT TCAATGATAT TTTTAAAGCC TGCAGCTGGG AAGAAACCAG TAACAGCATT TACGCAAAGA CCAGTGCGGA TGTGGAACGG GCCCTGGCTT GGGGTACAGC TAAAAGAACG CTGGAAGATT TTAAAGCACT GCTCTCTCCT GCTGCTGCAC CTTACCTGGA ACAGATGGCC GAAATAAGCC GGCAGCTTAC CTTAAAACGC TTTGGGCGGG TGCTGCAAAT GTATGTGCCC CTGTACCTTT CCAATGAATG CAACAACATC TGCACGTATT GCGGCTTTAG CTACGACAAT AAAGTGAGGC GCAAGACCCT CTCTCCTATA GAGATCATGC AGGAAGTGGC GGCTATTAAA GAAATGGGTT TCGATCATGT ATTGCTGGTT ACGGGCGAGG CCAGCCAGTC GGTACATACC GCTTATTTTA AACAGGTGCT GGAACTGATC CGTCCGCATT TTGCGCAGAT CTCTATGGAA GTTCAGCCTT TAGACCTGGC CGATTACGAA GAGCTACGAC CCTATGGTTT AAATACCGTG CTGGTATATC AGGAAACCTA TCACCAGGAA GATTATAAAA AGCATCACCC CAGGGGTAAG AAATCCAATT TCCTGTACCG GCTGGAAACG CCCGACCGGC TGGGCCAGGC AGGCATACAT AAAATAGGCC TGGGGGTGTT GATTGGCCTG GAGGACTGGC GTACGGATTC ATTTTTTACG GCTTTGCACC TGGATTACCT GGAAAAAACC TACTGGCAAA GCAAATACAG CATTTCATTT CCGAGGTTGC GGCCTTTTAG CGGGGGGCTG GAGCCTAAGG TGGCGATGAG CGACAGGGAG CTGGTGCAGC TGATCTGCGC TTACCGCTTG TTTAATGAGG AGGTTGAGCT GTCCATCTCG ACCAGGGAAT CGCAGGTATT CAGGGACAAT ATCATTAAGC TGGGCATTAC TGCCATGAGT GCAGGTTCTA AGACCAATCC CGGTGGCTAT GTGGTAGAAC CGGCCTCGCT GGAGCAGTTT GAGATATCGG ACGAGCGCAG TGCAAAAGAA ATTGCGGCCA TGCTTGCACA GCAGGGCTAT GAAGCCGTTT GGAAAGATTG GGACAACAGT TTGGTTTAA
|
Protein sequence | MGSFNDIFKA CSWEETSNSI YAKTSADVER ALAWGTAKRT LEDFKALLSP AAAPYLEQMA EISRQLTLKR FGRVLQMYVP LYLSNECNNI CTYCGFSYDN KVRRKTLSPI EIMQEVAAIK EMGFDHVLLV TGEASQSVHT AYFKQVLELI RPHFAQISME VQPLDLADYE ELRPYGLNTV LVYQETYHQE DYKKHHPRGK KSNFLYRLET PDRLGQAGIH KIGLGVLIGL EDWRTDSFFT ALHLDYLEKT YWQSKYSISF PRLRPFSGGL EPKVAMSDRE LVQLICAYRL FNEEVELSIS TRESQVFRDN IIKLGITAMS AGSKTNPGGY VVEPASLEQF EISDERSAKE IAAMLAQQGY EAVWKDWDNS LV
|
| |