Gene Phep_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2471 
Symbol 
ID8253578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2865213 
End bp2866331 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content50% 
IMG OID644936121 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_003092737 
Protein GI255532365 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGTT TCAATGATAT TTTTAAAGCC TGCAGCTGGG AAGAAACCAG TAACAGCATT 
TACGCAAAGA CCAGTGCGGA TGTGGAACGG GCCCTGGCTT GGGGTACAGC TAAAAGAACG
CTGGAAGATT TTAAAGCACT GCTCTCTCCT GCTGCTGCAC CTTACCTGGA ACAGATGGCC
GAAATAAGCC GGCAGCTTAC CTTAAAACGC TTTGGGCGGG TGCTGCAAAT GTATGTGCCC
CTGTACCTTT CCAATGAATG CAACAACATC TGCACGTATT GCGGCTTTAG CTACGACAAT
AAAGTGAGGC GCAAGACCCT CTCTCCTATA GAGATCATGC AGGAAGTGGC GGCTATTAAA
GAAATGGGTT TCGATCATGT ATTGCTGGTT ACGGGCGAGG CCAGCCAGTC GGTACATACC
GCTTATTTTA AACAGGTGCT GGAACTGATC CGTCCGCATT TTGCGCAGAT CTCTATGGAA
GTTCAGCCTT TAGACCTGGC CGATTACGAA GAGCTACGAC CCTATGGTTT AAATACCGTG
CTGGTATATC AGGAAACCTA TCACCAGGAA GATTATAAAA AGCATCACCC CAGGGGTAAG
AAATCCAATT TCCTGTACCG GCTGGAAACG CCCGACCGGC TGGGCCAGGC AGGCATACAT
AAAATAGGCC TGGGGGTGTT GATTGGCCTG GAGGACTGGC GTACGGATTC ATTTTTTACG
GCTTTGCACC TGGATTACCT GGAAAAAACC TACTGGCAAA GCAAATACAG CATTTCATTT
CCGAGGTTGC GGCCTTTTAG CGGGGGGCTG GAGCCTAAGG TGGCGATGAG CGACAGGGAG
CTGGTGCAGC TGATCTGCGC TTACCGCTTG TTTAATGAGG AGGTTGAGCT GTCCATCTCG
ACCAGGGAAT CGCAGGTATT CAGGGACAAT ATCATTAAGC TGGGCATTAC TGCCATGAGT
GCAGGTTCTA AGACCAATCC CGGTGGCTAT GTGGTAGAAC CGGCCTCGCT GGAGCAGTTT
GAGATATCGG ACGAGCGCAG TGCAAAAGAA ATTGCGGCCA TGCTTGCACA GCAGGGCTAT
GAAGCCGTTT GGAAAGATTG GGACAACAGT TTGGTTTAA
 
Protein sequence
MGSFNDIFKA CSWEETSNSI YAKTSADVER ALAWGTAKRT LEDFKALLSP AAAPYLEQMA 
EISRQLTLKR FGRVLQMYVP LYLSNECNNI CTYCGFSYDN KVRRKTLSPI EIMQEVAAIK
EMGFDHVLLV TGEASQSVHT AYFKQVLELI RPHFAQISME VQPLDLADYE ELRPYGLNTV
LVYQETYHQE DYKKHHPRGK KSNFLYRLET PDRLGQAGIH KIGLGVLIGL EDWRTDSFFT
ALHLDYLEKT YWQSKYSISF PRLRPFSGGL EPKVAMSDRE LVQLICAYRL FNEEVELSIS
TRESQVFRDN IIKLGITAMS AGSKTNPGGY VVEPASLEQF EISDERSAKE IAAMLAQQGY
EAVWKDWDNS LV