Gene Apre_0273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0273 
SymbolthiH 
ID8397047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp304728 
End bp306146 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content40% 
IMG OID644994633 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003152045 
Protein GI257065789 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTAT ATAATCCAAA ATCTAGCAAG GCTAGTGAAT TTATTAACCA TGAAGAAATT 
TTAGAGACTC TTGATTATGG TAGAGAAAAC GCCCATAATA GAGAGCTTAT CATAGATATT
TTAAATAAGG CAAAAAAGGC TAAGGGCCTA AGCCACAGGG AAGCCTTTGT GCTTTTATCT
TGTAAGGAAG AAGACCTAAA CGCAGAGATC TTTAACCTTG CCAAAGAGCT TAAACACAAG
TTTTACGCAA ATAGAATAGT GCTTTTCGCT CCTTTGTATT TGTCTAACTA TTGTGTGAAT
GGTTGCTCTT ATTGTCCTTA CCACGGACAA AACAGAACTA TTCCAAGAAG AAAGCTAAGT
CAAGAAGAAA TTCGTGAGCA AGTTATAGCT CTACAAGACT TAGGACACAA GAGACTTGCC
CTAGAAGCTG GAGAAGATCC AGTAAATAAC CCTCTAGAAT ACATACTAGA GTCGATAGAT
ACAATCTATA ATATTAAACA CAAAAACGGA GCTATAAGAA GGGTAAACGT AAATATCGCA
GCTACTACTG TTGAAAATTA CAGAAAGCTT CACGAGGCAG AAATCGGAAC TTACATCCTC
TTCCAAGAAA CTTACAACAA GGAAAATTAC GAGTCTCTCC ACAAGTTTGG TCCAAAATCT
AACTACGAAT ATCACACTGA AGCAATGGAT AGGGCCTTCG AGGGAGGAAT CGACGATTTA
GGACTTGGAG TTTTGTATGG ACTCGATTCT TATGAATATG AATTTGTTGG CCAATTAATG
CACGCAGAAC ACCTTGAGGC GAGATTTAAT GTCGGCCCTC ATACAATCTC TGTGCCTAGA
ATCCAACCTG CTGACGATAT TGATCCAAAC GACTTCGACA ATTCCCTATC AGATGAAATG
TTTGAGAAAA TCGTAGCCTG CATCAGAGTT GCAGCACCAT ATACTGGTAT GATCGTATCT
ACTAGGGAGA GCGAAAAGGT CCGTGCTAAG CTCTTAGACC TTGGTATTTC CCAAATCTCC
GGAGGTTCCA AGACTTCTGT AGGAGGATAT ACAGAAAACG AAACCAAGGG AAGTGATCAA
TTCGAACTAT CAGATAACAG AACTCTAGAT GAAGTAATCG ACTGGCTAAT ATCAAAAGAC
CACGTACCAA GTTTCTGTAC AGCTTGCTAC AGGATGGGAA GAACTGGCGA GACCTTTATG
GGCATGGTCA AAAAACACGG AATCGGTAAT ATTTGCCATC CAAACGCCCT CACAACTCTC
GAAGAATATG TAATAGATTA CGCAAGTGAT AAGACAGCCA AGGATGCCGA AGCCCTTATC
CAAAGAGAGC TAGCTAATAT TCCAAATCCA GAAGTCCGTG AATCAACTAG GGAAAATCTA
GAAAAAATCA AACAAGGACA AAGAGACCTT TATATATAA
 
Protein sequence
MVVYNPKSSK ASEFINHEEI LETLDYGREN AHNRELIIDI LNKAKKAKGL SHREAFVLLS 
CKEEDLNAEI FNLAKELKHK FYANRIVLFA PLYLSNYCVN GCSYCPYHGQ NRTIPRRKLS
QEEIREQVIA LQDLGHKRLA LEAGEDPVNN PLEYILESID TIYNIKHKNG AIRRVNVNIA
ATTVENYRKL HEAEIGTYIL FQETYNKENY ESLHKFGPKS NYEYHTEAMD RAFEGGIDDL
GLGVLYGLDS YEYEFVGQLM HAEHLEARFN VGPHTISVPR IQPADDIDPN DFDNSLSDEM
FEKIVACIRV AAPYTGMIVS TRESEKVRAK LLDLGISQIS GGSKTSVGGY TENETKGSDQ
FELSDNRTLD EVIDWLISKD HVPSFCTACY RMGRTGETFM GMVKKHGIGN ICHPNALTTL
EEYVIDYASD KTAKDAEALI QRELANIPNP EVRESTRENL EKIKQGQRDL YI