Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0273 |
Symbol | thiH |
ID | 8397047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 304728 |
End bp | 306146 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644994633 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003152045 |
Protein GI | 257065789 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGTAT ATAATCCAAA ATCTAGCAAG GCTAGTGAAT TTATTAACCA TGAAGAAATT TTAGAGACTC TTGATTATGG TAGAGAAAAC GCCCATAATA GAGAGCTTAT CATAGATATT TTAAATAAGG CAAAAAAGGC TAAGGGCCTA AGCCACAGGG AAGCCTTTGT GCTTTTATCT TGTAAGGAAG AAGACCTAAA CGCAGAGATC TTTAACCTTG CCAAAGAGCT TAAACACAAG TTTTACGCAA ATAGAATAGT GCTTTTCGCT CCTTTGTATT TGTCTAACTA TTGTGTGAAT GGTTGCTCTT ATTGTCCTTA CCACGGACAA AACAGAACTA TTCCAAGAAG AAAGCTAAGT CAAGAAGAAA TTCGTGAGCA AGTTATAGCT CTACAAGACT TAGGACACAA GAGACTTGCC CTAGAAGCTG GAGAAGATCC AGTAAATAAC CCTCTAGAAT ACATACTAGA GTCGATAGAT ACAATCTATA ATATTAAACA CAAAAACGGA GCTATAAGAA GGGTAAACGT AAATATCGCA GCTACTACTG TTGAAAATTA CAGAAAGCTT CACGAGGCAG AAATCGGAAC TTACATCCTC TTCCAAGAAA CTTACAACAA GGAAAATTAC GAGTCTCTCC ACAAGTTTGG TCCAAAATCT AACTACGAAT ATCACACTGA AGCAATGGAT AGGGCCTTCG AGGGAGGAAT CGACGATTTA GGACTTGGAG TTTTGTATGG ACTCGATTCT TATGAATATG AATTTGTTGG CCAATTAATG CACGCAGAAC ACCTTGAGGC GAGATTTAAT GTCGGCCCTC ATACAATCTC TGTGCCTAGA ATCCAACCTG CTGACGATAT TGATCCAAAC GACTTCGACA ATTCCCTATC AGATGAAATG TTTGAGAAAA TCGTAGCCTG CATCAGAGTT GCAGCACCAT ATACTGGTAT GATCGTATCT ACTAGGGAGA GCGAAAAGGT CCGTGCTAAG CTCTTAGACC TTGGTATTTC CCAAATCTCC GGAGGTTCCA AGACTTCTGT AGGAGGATAT ACAGAAAACG AAACCAAGGG AAGTGATCAA TTCGAACTAT CAGATAACAG AACTCTAGAT GAAGTAATCG ACTGGCTAAT ATCAAAAGAC CACGTACCAA GTTTCTGTAC AGCTTGCTAC AGGATGGGAA GAACTGGCGA GACCTTTATG GGCATGGTCA AAAAACACGG AATCGGTAAT ATTTGCCATC CAAACGCCCT CACAACTCTC GAAGAATATG TAATAGATTA CGCAAGTGAT AAGACAGCCA AGGATGCCGA AGCCCTTATC CAAAGAGAGC TAGCTAATAT TCCAAATCCA GAAGTCCGTG AATCAACTAG GGAAAATCTA GAAAAAATCA AACAAGGACA AAGAGACCTT TATATATAA
|
Protein sequence | MVVYNPKSSK ASEFINHEEI LETLDYGREN AHNRELIIDI LNKAKKAKGL SHREAFVLLS CKEEDLNAEI FNLAKELKHK FYANRIVLFA PLYLSNYCVN GCSYCPYHGQ NRTIPRRKLS QEEIREQVIA LQDLGHKRLA LEAGEDPVNN PLEYILESID TIYNIKHKNG AIRRVNVNIA ATTVENYRKL HEAEIGTYIL FQETYNKENY ESLHKFGPKS NYEYHTEAMD RAFEGGIDDL GLGVLYGLDS YEYEFVGQLM HAEHLEARFN VGPHTISVPR IQPADDIDPN DFDNSLSDEM FEKIVACIRV AAPYTGMIVS TRESEKVRAK LLDLGISQIS GGSKTSVGGY TENETKGSDQ FELSDNRTLD EVIDWLISKD HVPSFCTACY RMGRTGETFM GMVKKHGIGN ICHPNALTTL EEYVIDYASD KTAKDAEALI QRELANIPNP EVRESTRENL EKIKQGQRDL YI
|
| |