Gene Pcar_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcar_1631 
SymbolthiH 
ID3724331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter carbinolicus DSM 2380 
KingdomBacteria 
Replicon accessionNC_007498 
Strand
Start bp1905681 
End bp1907105 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content48% 
IMG OID637751226 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_357045 
Protein GI77919230 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0000108196 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGCTT TACCATCCAT GGAACTCAGT AAAAACGCCG TCGATTTTAT CGATGAGAAC 
CACCTCAATG CGCTTTTGGC CGGCAAGAAA CCGGATGCCA CTCGGATTCG CGAAATTATT
GCCAAAAGTC TGGCTAAAGA AGCGCTTTCC GTTGAGGAAA CGGCTGAACT TGTTCTGACC
GACGATCCTG CGCTGATTGA GGAAATATTT GCCGCAGCCC GGGAACTTAA AAAAACCGTT
TACGGTAATC GTATCGTTCT GTTTGCGCCT TTATATATCG GCAATGACTG TATTAATGAT
TGCACCTATT GTGCATTTAA GCGGTCGAAT TTTGATGCGA TACGGCGCAC TTTGACTCCC
GAGGAAATAG GTCAGCAGGT CGTTGCCCTT GAGGATAAGG GACACAAACG TCTGATACTG
GTATTCGGAG AACATCCCAA ATACGATGCC GATTTTATTG CGGATACGGT AAAAAATGTT
TATTCCGTCA AATCCGGAAA CGGGGAGATT CGCCGCGTAA ATATCAATGC TGCGCCTCTC
GATATCGAAG GCTATAAAAA GGTCAAAGAG GCAGGGATCG GCACGTACCA GATTTTCATG
GAAACCTATC ATCACGATAC CTATTCCATG ATGCATCCCG GCAATACCCG AAAAGGAAAT
TACCTTTATC GACTTGACGG TCTGAGTCGT GCATTTGAAG CCGGTTGCGA CGACGTCGGG
CTCGGTGTTC TTTTCGGTTT ATATGACTGG CGTTTCGAAG TGCTTAGCAT GGTTCGTCAT
GCATTGTATC TTCAGGAGCG GTACAATGTC GGCCCTCACA CATTGAGTTT CCCCAGGCTC
CGTCCTGCTC AAGGGGTTGA CTTCAACGAA GAGTATTTCG TCGACGACGA GGACTTCAAG
CGTATTATAG CTATCTTGCG ACTTGCGGTA CCCTATACGG GGCTGATTCT CACCGCCCGC
GAAAAACCTG AACTGCGCAG AGAGCTGATG TCCTTCGGTG TTTCTCAAAT CGATGCCGGC
AGCCGTATCG AACTCGGCGG ATACACCGAA GCGGGAGATG CCCAGGTTAT GGAACGGGAA
CAGTTCAGCC TTGGCGATAT TCGTTCCCTG GATGAGGTTA TGTGCGAGTT GATTAGCGAT
GGTTATGTTC CCAGTTTCTG TACCTCCTGT TATCGCAGCG GTCGCACAGG CGAACATTTT
ATGGAGTTCA GTATCCCCGG TTTCATCAAG CGTTACTGTA CCCCCAATGC GTTGTTGACC
CTGGAAGAGT ACCTGGTCGA TTATGCCTCC GAGGAAACAC GGGCTGTCGG TGAAAAACTT
ATTGCCGAAG AACTTGCCAA AATGGAAGAT GGCGAGATGA AAAACCGTAC TCTTAAACAA
CTGGAAGAAA TCAAGGACCG CAACGTTCGC GATATCTATT TTTGA
 
Protein sequence
MCALPSMELS KNAVDFIDEN HLNALLAGKK PDATRIREII AKSLAKEALS VEETAELVLT 
DDPALIEEIF AAARELKKTV YGNRIVLFAP LYIGNDCIND CTYCAFKRSN FDAIRRTLTP
EEIGQQVVAL EDKGHKRLIL VFGEHPKYDA DFIADTVKNV YSVKSGNGEI RRVNINAAPL
DIEGYKKVKE AGIGTYQIFM ETYHHDTYSM MHPGNTRKGN YLYRLDGLSR AFEAGCDDVG
LGVLFGLYDW RFEVLSMVRH ALYLQERYNV GPHTLSFPRL RPAQGVDFNE EYFVDDEDFK
RIIAILRLAV PYTGLILTAR EKPELRRELM SFGVSQIDAG SRIELGGYTE AGDAQVMERE
QFSLGDIRSL DEVMCELISD GYVPSFCTSC YRSGRTGEHF MEFSIPGFIK RYCTPNALLT
LEEYLVDYAS EETRAVGEKL IAEELAKMED GEMKNRTLKQ LEEIKDRNVR DIYF