Gene Phep_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1588 
Symbol 
ID8252690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1877335 
End bp1878459 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content48% 
IMG OID644935242 
ProductRadical SAM domain protein 
Protein accessionYP_003091863 
Protein GI255531491 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0606442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTG CCGAATTATT ACAAAGGGCT TTGCACTTCG ACTTTCTAAC CAAAGAAGAG 
GGTGTGTTTT TATATCATCA TGCAGCAACA GCTGAACTGG CTTATGTGGC CAATGAATTG
AGAAAAAAGC AGGTACCAAG TGGAAAAGTG ACCTGGCAGA TAGACAGGAA CGTCAATACC
ACCAATGTAT GTATTGCCAA CTGTAAATTC TGTAATTTCT TCAGGCGCCC GGGCCACGAG
GAAAGTTATA TTACCGATAT TGAAACCTAT AAGCAAAAAA TAGAAGAGAC ATTCAGGCTG
GGCGGCGACC AGCTGCTCTT ACAGGGTGGC CACCACCCTG AGCTGGGCCT GAAATTCTAT
GCAGACCTGT TTAAACAGCT GAAAGAATTG TATCCCGACC TGAAGCTGCA TGCGCTTGGT
CCGCCCGAAA TTGCGCATGT AGCCAAGCTG GAAGGGCTTT CACATACTGA AGTTTTAACT
GCCCTAAAAG CAGCTGGCAT GGATTCTTTG CCCGGGGCCG GTGCAGAAAT TTTGAACGAC
AGGGTGAGAA GGCTGATCTC TAAAGGAAAA TGCGGGGGTC AGGAATGGCT GGATGTCATG
CGGGCAGCGC ACCAGCTGCA CATTACCACT TCGGCAACCA TGATGTTTGG TCATGTAGAA
ACCATAGAAG AGCGTTTTGA GCACCTGGTA TGGATCAGGG AGGTACAAAG TGAAAAACCG
GCTGATGCCA AGGGTTTTCT GGCTTTTATT CCATGGCCTT TCCAGGATGA TGGTACCTTG
CTGAAACGTT TAAGGGGTAT CAGCAACAAT GTTTCGGGCG ATGAATACAT CAGGATGCTG
GCTTTAAGCC GCATCATGCT GCCCAACATC AAAAATATAC AGGCATCCTG GCTTACAGTG
GGCAAAAATG TGGCAGAACT TTGTCTGCAT GCCGGGGCAA ATGATTTTGG TTCCATTATG
ATTGAAGAAA ATGTGGTATC TGCTGCCGGT GCCCCGCATC GTTTTACAGC AAAAGGCATA
CAGGATGCGA TCAGGGAAGC CGGATTTGAG CCACAGTTAA GGGGGCAGCA GTACAACTAC
CGCGACCTGC CCGATCACCT GGAAGAGCAG GTGATCAATT ACTAA
 
Protein sequence
MNTAELLQRA LHFDFLTKEE GVFLYHHAAT AELAYVANEL RKKQVPSGKV TWQIDRNVNT 
TNVCIANCKF CNFFRRPGHE ESYITDIETY KQKIEETFRL GGDQLLLQGG HHPELGLKFY
ADLFKQLKEL YPDLKLHALG PPEIAHVAKL EGLSHTEVLT ALKAAGMDSL PGAGAEILND
RVRRLISKGK CGGQEWLDVM RAAHQLHITT SATMMFGHVE TIEERFEHLV WIREVQSEKP
ADAKGFLAFI PWPFQDDGTL LKRLRGISNN VSGDEYIRML ALSRIMLPNI KNIQASWLTV
GKNVAELCLH AGANDFGSIM IEENVVSAAG APHRFTAKGI QDAIREAGFE PQLRGQQYNY
RDLPDHLEEQ VINY