Gene Apre_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1631 
SymbolthiH 
ID8398443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1775534 
End bp1776691 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content40% 
IMG OID644995995 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003153373 
Protein GI257067117 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTA AAAGCGTAAA TGATTATTTC CCAGAAATGG ATATAATCGA CTCAGATATA 
AAAGAAAGAA TTGAAAAAGC TTACGATAGA GTAAAAGATA CAGATGTAAG TGAGGCTGAT
GTCTTGGCAA GTCTTAATAA GAAAAATCTC TCAGAAAGAG ACTTCTACAA TCTCATAAGC
GACAAGGCAG AAGATCACTT AGAAGAGATG GCAGAGCTTG CCAAAGATGC TAGGATTAGA
TATTTTGGAA ACAATGTATG CCTATTTTCT CCGATCTATA TAGCTAATTA CTGCGAAAAT
TCCTGCAGAT ATTGTGGTTT TAGGGCAAAA AGCGATATCA AAAGAGCTAA GCTTAACCTA
GAAGAGATCG AAGAAGAGAT GAAGGCTTTG GCAGAAACTG GGATTGAAGA TGTCCTAATC
CTTACTGGTG AAAGCGAGAG ATTTTCTTCT GTAGATTATA TAGGAGAAGC TTGTAGAATT
GCCAGCAAAT ATTTTAAGGT AGTAGGAATA GAAGTATATC CCGCAAATGT TTCTTCTTAC
GAGAAATTAA GAGAGGCTGG GGCGGATTTC GTTACAGTCT TCCAGGAATC CTACAACAAG
AAAGCTTTCG ACTACTATCA TCCCGCAGGG CATAAGAGAA GCTTTAACTA TAGAATCGAC
ACCCAAGAGC GAGCTCTTAT GGCAGGCTTT AGGGGAGTGG GTTTTGGGGC CCTCTTTGGA
CTTTCTGATC CTATAGAAGA TGCTTTTAAG CTTGCCATCC ACGCCAAGGA AGTTCAAAGG
AAATATCCTC AGGCAGAAAT TGCAATCTCT CTTCCAAGGA TTAGGCCAAC CCACGGGGCG
GATGATACTC TAGACTTTAA TATCGTAGAT GATAAGAAAT TCTTCCAAAT CATGCTTGCA
ATTAGAATGT TTCTGCCTTT TGCCTCTATT ACCCTTTCAA CACGTGAGTC AAAGGACTTT
AGGGACTTGG CTGTGAAATA TGCTGCGACT AAAATCTCTG CATCAGTAGA TACTGCTATA
GGACACAGGT CAAAGAAAAG TGCTGATGAG GGAGATGAGC AGTTTGAGAT TGACGATTCT
CGTTCTACCG AACAAGCCTT TGAGGACCTC AAGAAAATCG GCATGACACC AGTCTTTACT
GATTATATAA ATTTATAG
 
Protein sequence
MNIKSVNDYF PEMDIIDSDI KERIEKAYDR VKDTDVSEAD VLASLNKKNL SERDFYNLIS 
DKAEDHLEEM AELAKDARIR YFGNNVCLFS PIYIANYCEN SCRYCGFRAK SDIKRAKLNL
EEIEEEMKAL AETGIEDVLI LTGESERFSS VDYIGEACRI ASKYFKVVGI EVYPANVSSY
EKLREAGADF VTVFQESYNK KAFDYYHPAG HKRSFNYRID TQERALMAGF RGVGFGALFG
LSDPIEDAFK LAIHAKEVQR KYPQAEIAIS LPRIRPTHGA DDTLDFNIVD DKKFFQIMLA
IRMFLPFASI TLSTRESKDF RDLAVKYAAT KISASVDTAI GHRSKKSADE GDEQFEIDDS
RSTEQAFEDL KKIGMTPVFT DYINL