Gene Phep_4112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4112 
Symbol 
ID8255246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4966158 
End bp4969298 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content41% 
IMG OID644937776 
ProductTetratricopeptide domain protein 
Protein accessionYP_003094365 
Protein GI255533993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACA CCAAGATTAA AGAAAAAGCG ATCATCAGAA AGGAACTTAT TGTACTTAAA 
ACGTATCCCT ATTCGGATCC GAGTCCCATT CCTGAGTTTG GACGCCTGTA TCCTTATAAC
CGTTTTGATG GATACACCAA TAAAAGCATA GAACAGACCT GGGAAATGAT TGTTATGGAA
AACAATCACA TCAAAATCTG GATCAATCCA GCTGTAGGGG GAAAAATATG GGGTGCCATA
GAAAAATCGA CAACAAGGGA ATTTATATAT TTTAACCATG CAGCAAAATT CAGAGATGTA
GCCATGAGAG GTCCCTGGAC TTCCGGTGGC ATGGAAATCA ATATGGGCAT CATCGGGCAT
ACTCCCTCCT GTTCGGCTCC GGTTGACTAC AAAACAATTG AGAATGAAGA TGGCAGTGTA
AGCTGCTTTA TCGGGGCCAC GGACTGGCCC TCCCGTACAG AATGGCGTGT AGAAATAAAA
CTGGGTAAAG ATGCTGCACA TTTTAGCACT AAAAGTTGGT GGCACAACAA TAGTTGTATG
TCCCAGTCCT ACTATCAATG GAACAATGTG GGTATTAAAA CCTCCGGAAA CCTGGAATAC
ATCTACCCGG GGCAGCATCG TTTAGGACAC GATGGAGTAC CACAGAGCTG GCCTGAGGAT
GACGAAAAAA GAAAAATTTC TTTCTACGAC CAGAATAATT ATGGTGAATA TAAATCCTAT
CATGTTTTTG GCGCCTATAC CGATTTTTGG GGGTGTTACT GGCATAATGA TCAGTTTGGC
ATGGGACATT CTGCTCCTTA TGACGAAAAA CCAGGAAAAA AGATATGGAT TTGGGGACTT
TCGCGTTATG GTATGATCTG GGAAGATCTT TTAACAGACG AGGATGGCCA GTACACAGAG
GTTCAAAGTG GTCGTTTATT TAATCAAAGT ATTGCTGCCA GCTCTAAAAC CCCGTTTAAA
CACAGGTCAT TTTTACCCTA TACTTACGAT ACCTGGGAAG AACATTGGTT TCCGGTTAAA
AACACTGGAG GATTAACCTA TGGCAATCAA CAACTTTCAT TTTATATAAC TGTACAAGAA
GGAAACCAAT TGATTAATAT CTGTGCCAAT GAAAGCCTGG ATGATCATTT TAAAGTTTTT
CACCACACTA AGGAAATCCT TTCAACACAG CTTAGAATGA AAGCAATGCA GAACTGCAGC
TTCGGACTTC CATACCCTGT AAAAGCAGCC GAATTACTGG TAACGTTAAA TGATTCGATA
ATATACAATG GCCCTGAACA GCATACAGTA TTAAAAAGAC CCACAAAACT GAGTAAGGAC
TACAATTTCG ATTCGGTACA AGCGCATTGC ATTCAGGCAA AGGAATGGGA AAGACAACGT
TTTTTTGACC GAGCAATAAC GCATTATCAG ATTTGCCTGG ACCTGGACCC CTTTTATACA
GAAGCGCTTA GCGGCCTGGC AGGTCTTTAT TTTAAACAAC TTAAATTTAC TGAAGCACTG
AACCTTTTAA GTATTGTATT ATCTGTTGAT ACGTATGACC CGGAGGCCAA TTATTTATAT
GGACTGGTAA ATGAAAGGTT ACGAAATACA GCAGACGCAA AGGATGGTTT TTCTATTGCC
AGTCAATCTG TCGAGTATAG GGTTGCATCA TTTATAGCTC TGGCAAAAAT GTTACTGCGT
GAAGGTCAGA TTGAAAAGGC ATATGCTTAT GTAAAAAAAG CAAGATTGTA TAGCCCGGAC
AATCTCCAAT CTTTATATTT AAGCATCATT ATAAACAATC TTAAAGGAAA TAAAAAACAG
TCGATGCTAC TGATCAAACA GCTTTTACAT ACAGACCCCA TTAATTACCT GGCAAAATTT
GAATTAAACA AAGCAGAAGG CATCCCGCTG GACAAAATTA CCGTAAGTGA ACTTCCTTAT
GAAACTTATA CAGAACTGGC TGCCTTCTAT TACAATGTAA ACCTGTATAG TGAAGCGCTT
AAACTATTGG AAGCAGCCCC TGATTACGCA ATGGTTTATC TTTGGAAAGC TTATCTATAT
TCGCTTTCCG ATTCGAAAAC TACTATTGCC AGTGCACTGG AAAAGGCAGT GGCAATGAGG
CCTGATTTTG TTTTTCCGCA TCGTGAAGAG GACATTACAG TATTGAACTG GGCAATTAGC
CAGCACAATG CCTGGCAGTT TAAATATTAT CTTGCACTTG CCCATATCCA GAACCTGCGC
AGAGAAGAAG CATTGAGTCT TTTAAACAGC TGTCAGCAGC TGCCCGACTT CTATCCTTTT
TATATTGTAC GGGCAAACTT AAAAGAGGAA TTGCAGAAGG ACGGCTGCCT TTCTGACCTA
AAAAAAGCCT TTCAGCTTGC ACCCGATGAA TGGAGGACAG TTCTAAACCT TTCCAATTAC
TACGCTGCAC ATGAGGACTG GGTGCAGGCA CTCAACATCA CCAGTAAAGG ATATAAAATG
TATCCCGACA ATTATTATTT AGGGCTGAAA CTTGCAAAAT GCTTTATGCA TACGCATAAA
TTTGAACAAG GGATTGCCCT GATGACCAAT ATGACCGTAT TGCCAAATGA GGGGGCCTCT
GAAGGCAGGA ACAGCTGGCG GGAAACTCAC CTGCTCTGTG CGTTTAACGC GCTTGAAGAT
AAGAACCAGG AAAAAGCGAC ACATCATATC AATATGGCTA GGACATGGCC TGAAAACATG
GGCATTGGCA AACCTCATCA TGTTGATGAG CGCCTCGAAG ACTACATGCA GCTTATCTGT
ATAGATCATG AAAATAAAGA AGAACGAAAG GCACTAACGG ATAAAATAAC AAGTTACAGA
ATGCACCATA AACTCAGTCC TTATGGTATA CTTGACTTCA TCAGCATTTT TCTTATGCAG
GAAATGGGTG ATGTAAAAGG TGCAGAAAGA ATCCTGGACA ACTGGTTAAA ACAAGATCCT
GATGCTTTAC CGCTAAAATG GAGCATCGCA TTTTTAAAAG GCAACCAACA GGAATTGGCT
GAACTTTCTC AACAGAAGGT TCCGGTTAAA GAGGTTCTGC CTTATGAAGT GCCATTTGAA
GACCGGTCCT TTCCTTTCGT AAAAAAATTA CACAGTATTG GATTATTTAA CAAGCATACA
CATTTAGCTA CAATGAATTA A
 
Protein sequence
MEDTKIKEKA IIRKELIVLK TYPYSDPSPI PEFGRLYPYN RFDGYTNKSI EQTWEMIVME 
NNHIKIWINP AVGGKIWGAI EKSTTREFIY FNHAAKFRDV AMRGPWTSGG MEINMGIIGH
TPSCSAPVDY KTIENEDGSV SCFIGATDWP SRTEWRVEIK LGKDAAHFST KSWWHNNSCM
SQSYYQWNNV GIKTSGNLEY IYPGQHRLGH DGVPQSWPED DEKRKISFYD QNNYGEYKSY
HVFGAYTDFW GCYWHNDQFG MGHSAPYDEK PGKKIWIWGL SRYGMIWEDL LTDEDGQYTE
VQSGRLFNQS IAASSKTPFK HRSFLPYTYD TWEEHWFPVK NTGGLTYGNQ QLSFYITVQE
GNQLINICAN ESLDDHFKVF HHTKEILSTQ LRMKAMQNCS FGLPYPVKAA ELLVTLNDSI
IYNGPEQHTV LKRPTKLSKD YNFDSVQAHC IQAKEWERQR FFDRAITHYQ ICLDLDPFYT
EALSGLAGLY FKQLKFTEAL NLLSIVLSVD TYDPEANYLY GLVNERLRNT ADAKDGFSIA
SQSVEYRVAS FIALAKMLLR EGQIEKAYAY VKKARLYSPD NLQSLYLSII INNLKGNKKQ
SMLLIKQLLH TDPINYLAKF ELNKAEGIPL DKITVSELPY ETYTELAAFY YNVNLYSEAL
KLLEAAPDYA MVYLWKAYLY SLSDSKTTIA SALEKAVAMR PDFVFPHREE DITVLNWAIS
QHNAWQFKYY LALAHIQNLR REEALSLLNS CQQLPDFYPF YIVRANLKEE LQKDGCLSDL
KKAFQLAPDE WRTVLNLSNY YAAHEDWVQA LNITSKGYKM YPDNYYLGLK LAKCFMHTHK
FEQGIALMTN MTVLPNEGAS EGRNSWRETH LLCAFNALED KNQEKATHHI NMARTWPENM
GIGKPHHVDE RLEDYMQLIC IDHENKEERK ALTDKITSYR MHHKLSPYGI LDFISIFLMQ
EMGDVKGAER ILDNWLKQDP DALPLKWSIA FLKGNQQELA ELSQQKVPVK EVLPYEVPFE
DRSFPFVKKL HSIGLFNKHT HLATMN