Gene Ppha_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2004 
SymbolthiH 
ID6462958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2091942 
End bp2093042 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content57% 
IMG OID642728203 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002018833 
Protein GI194337039 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGAGA TACCCGACTG GCTCACGGAC AATCGGGAGA CGATGGCGCT TGCCGCCATG 
CTTGCGCCCC CCTATGCATC GGATTCCCTC GAAGCGCTTG CTGCCGAATC GAGAGCGATC
ACCCTGCGTC GTTTTGGACG CACCATGACG CTCTACGCCC CGCTCTACCT TTCGAACTAC
TGTTCCAGCG GCTGCGTCTA CTGCGGTTTT GCCTCCGACC GAAAAACTCT GCGCCACCGC
CTTGAACCCG ATGAGATCGT CCGGGAGCTG AAGGCCATGA AAAAGCTCGG CATCAGCGAC
ATCCTTCTCC TCACCGGCGA ACGGACAGCC GCTGCCGACT TCGACTACCT CCGAAAAAGT
GTTGAAATCG CTGCACAGGA GATGCAACGG GTCTCGGTTG AGGCCTTTCC TATGAGCGTC
AGTGAATACC GCGCACTTGC CGACTGCGGC TGCACCGGCA TCACCATATA CCAGGAGACC
TACAACCGGG AGCGTTACGA GGCGCTGCAC CGCTGGGGGC CAAAAAAAGA TTATATCGAC
CGGCTTGAAA CCCCGGCAAG AGCCCTCGAA GGGGGCATAA AAAACGTCGG ACTCGGAGTG
CTGTTCGGCC TCTCCGATCC GGTTGAAGAT GCTCTGGCCC TCTACCGGCA CCTTCGATAT
CTTGGCAGAA CCTGGTGGCG TGCAGGCATG TCACTCTCCT TTCCCCGCAT GAGACCCCAG
ACCGGCGGTT ATGAGCCCCC GTTCCCCGTT GATGACCACC TTCTCGCCCG CATGATCTTT
GCCTTCCGCA TAGCGCTGCC GGATACGGAG CTGGTTCTCT CAACAAGGGA GAGCGCAGCT
TACCGCGACG GCATGGCAGG ACTGGGCATT ACCCGAATGA GCATTGAAAG CCGCACCACC
GTTGGCGGCT ACGATAACCC GGAGAACAAG GAAGAGGGAC AGTTTGAAAT TTTTGACGAC
CGCACCGCCA AAGAATTTTG CACCGCGCTG CGCAAAAAAA ATATAGAACC TGTCTTTAAA
AACTGGGAAC CAGCCTATAA TGGTCCGTCT GATGGCAACA AGACAACAAT GCATCATGGA
GAAGCGGAAA CATGCCATTG A
 
Protein sequence
MREIPDWLTD NRETMALAAM LAPPYASDSL EALAAESRAI TLRRFGRTMT LYAPLYLSNY 
CSSGCVYCGF ASDRKTLRHR LEPDEIVREL KAMKKLGISD ILLLTGERTA AADFDYLRKS
VEIAAQEMQR VSVEAFPMSV SEYRALADCG CTGITIYQET YNRERYEALH RWGPKKDYID
RLETPARALE GGIKNVGLGV LFGLSDPVED ALALYRHLRY LGRTWWRAGM SLSFPRMRPQ
TGGYEPPFPV DDHLLARMIF AFRIALPDTE LVLSTRESAA YRDGMAGLGI TRMSIESRTT
VGGYDNPENK EEGQFEIFDD RTAKEFCTAL RKKNIEPVFK NWEPAYNGPS DGNKTTMHHG
EAETCH