Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4112 |
Symbol | |
ID | 8255246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4966158 |
End bp | 4969298 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644937776 |
Product | Tetratricopeptide domain protein |
Protein accession | YP_003094365 |
Protein GI | 255533993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGACA CCAAGATTAA AGAAAAAGCG ATCATCAGAA AGGAACTTAT TGTACTTAAA ACGTATCCCT ATTCGGATCC GAGTCCCATT CCTGAGTTTG GACGCCTGTA TCCTTATAAC CGTTTTGATG GATACACCAA TAAAAGCATA GAACAGACCT GGGAAATGAT TGTTATGGAA AACAATCACA TCAAAATCTG GATCAATCCA GCTGTAGGGG GAAAAATATG GGGTGCCATA GAAAAATCGA CAACAAGGGA ATTTATATAT TTTAACCATG CAGCAAAATT CAGAGATGTA GCCATGAGAG GTCCCTGGAC TTCCGGTGGC ATGGAAATCA ATATGGGCAT CATCGGGCAT ACTCCCTCCT GTTCGGCTCC GGTTGACTAC AAAACAATTG AGAATGAAGA TGGCAGTGTA AGCTGCTTTA TCGGGGCCAC GGACTGGCCC TCCCGTACAG AATGGCGTGT AGAAATAAAA CTGGGTAAAG ATGCTGCACA TTTTAGCACT AAAAGTTGGT GGCACAACAA TAGTTGTATG TCCCAGTCCT ACTATCAATG GAACAATGTG GGTATTAAAA CCTCCGGAAA CCTGGAATAC ATCTACCCGG GGCAGCATCG TTTAGGACAC GATGGAGTAC CACAGAGCTG GCCTGAGGAT GACGAAAAAA GAAAAATTTC TTTCTACGAC CAGAATAATT ATGGTGAATA TAAATCCTAT CATGTTTTTG GCGCCTATAC CGATTTTTGG GGGTGTTACT GGCATAATGA TCAGTTTGGC ATGGGACATT CTGCTCCTTA TGACGAAAAA CCAGGAAAAA AGATATGGAT TTGGGGACTT TCGCGTTATG GTATGATCTG GGAAGATCTT TTAACAGACG AGGATGGCCA GTACACAGAG GTTCAAAGTG GTCGTTTATT TAATCAAAGT ATTGCTGCCA GCTCTAAAAC CCCGTTTAAA CACAGGTCAT TTTTACCCTA TACTTACGAT ACCTGGGAAG AACATTGGTT TCCGGTTAAA AACACTGGAG GATTAACCTA TGGCAATCAA CAACTTTCAT TTTATATAAC TGTACAAGAA GGAAACCAAT TGATTAATAT CTGTGCCAAT GAAAGCCTGG ATGATCATTT TAAAGTTTTT CACCACACTA AGGAAATCCT TTCAACACAG CTTAGAATGA AAGCAATGCA GAACTGCAGC TTCGGACTTC CATACCCTGT AAAAGCAGCC GAATTACTGG TAACGTTAAA TGATTCGATA ATATACAATG GCCCTGAACA GCATACAGTA TTAAAAAGAC CCACAAAACT GAGTAAGGAC TACAATTTCG ATTCGGTACA AGCGCATTGC ATTCAGGCAA AGGAATGGGA AAGACAACGT TTTTTTGACC GAGCAATAAC GCATTATCAG ATTTGCCTGG ACCTGGACCC CTTTTATACA GAAGCGCTTA GCGGCCTGGC AGGTCTTTAT TTTAAACAAC TTAAATTTAC TGAAGCACTG AACCTTTTAA GTATTGTATT ATCTGTTGAT ACGTATGACC CGGAGGCCAA TTATTTATAT GGACTGGTAA ATGAAAGGTT ACGAAATACA GCAGACGCAA AGGATGGTTT TTCTATTGCC AGTCAATCTG TCGAGTATAG GGTTGCATCA TTTATAGCTC TGGCAAAAAT GTTACTGCGT GAAGGTCAGA TTGAAAAGGC ATATGCTTAT GTAAAAAAAG CAAGATTGTA TAGCCCGGAC AATCTCCAAT CTTTATATTT AAGCATCATT ATAAACAATC TTAAAGGAAA TAAAAAACAG TCGATGCTAC TGATCAAACA GCTTTTACAT ACAGACCCCA TTAATTACCT GGCAAAATTT GAATTAAACA AAGCAGAAGG CATCCCGCTG GACAAAATTA CCGTAAGTGA ACTTCCTTAT GAAACTTATA CAGAACTGGC TGCCTTCTAT TACAATGTAA ACCTGTATAG TGAAGCGCTT AAACTATTGG AAGCAGCCCC TGATTACGCA ATGGTTTATC TTTGGAAAGC TTATCTATAT TCGCTTTCCG ATTCGAAAAC TACTATTGCC AGTGCACTGG AAAAGGCAGT GGCAATGAGG CCTGATTTTG TTTTTCCGCA TCGTGAAGAG GACATTACAG TATTGAACTG GGCAATTAGC CAGCACAATG CCTGGCAGTT TAAATATTAT CTTGCACTTG CCCATATCCA GAACCTGCGC AGAGAAGAAG CATTGAGTCT TTTAAACAGC TGTCAGCAGC TGCCCGACTT CTATCCTTTT TATATTGTAC GGGCAAACTT AAAAGAGGAA TTGCAGAAGG ACGGCTGCCT TTCTGACCTA AAAAAAGCCT TTCAGCTTGC ACCCGATGAA TGGAGGACAG TTCTAAACCT TTCCAATTAC TACGCTGCAC ATGAGGACTG GGTGCAGGCA CTCAACATCA CCAGTAAAGG ATATAAAATG TATCCCGACA ATTATTATTT AGGGCTGAAA CTTGCAAAAT GCTTTATGCA TACGCATAAA TTTGAACAAG GGATTGCCCT GATGACCAAT ATGACCGTAT TGCCAAATGA GGGGGCCTCT GAAGGCAGGA ACAGCTGGCG GGAAACTCAC CTGCTCTGTG CGTTTAACGC GCTTGAAGAT AAGAACCAGG AAAAAGCGAC ACATCATATC AATATGGCTA GGACATGGCC TGAAAACATG GGCATTGGCA AACCTCATCA TGTTGATGAG CGCCTCGAAG ACTACATGCA GCTTATCTGT ATAGATCATG AAAATAAAGA AGAACGAAAG GCACTAACGG ATAAAATAAC AAGTTACAGA ATGCACCATA AACTCAGTCC TTATGGTATA CTTGACTTCA TCAGCATTTT TCTTATGCAG GAAATGGGTG ATGTAAAAGG TGCAGAAAGA ATCCTGGACA ACTGGTTAAA ACAAGATCCT GATGCTTTAC CGCTAAAATG GAGCATCGCA TTTTTAAAAG GCAACCAACA GGAATTGGCT GAACTTTCTC AACAGAAGGT TCCGGTTAAA GAGGTTCTGC CTTATGAAGT GCCATTTGAA GACCGGTCCT TTCCTTTCGT AAAAAAATTA CACAGTATTG GATTATTTAA CAAGCATACA CATTTAGCTA CAATGAATTA A
|
Protein sequence | MEDTKIKEKA IIRKELIVLK TYPYSDPSPI PEFGRLYPYN RFDGYTNKSI EQTWEMIVME NNHIKIWINP AVGGKIWGAI EKSTTREFIY FNHAAKFRDV AMRGPWTSGG MEINMGIIGH TPSCSAPVDY KTIENEDGSV SCFIGATDWP SRTEWRVEIK LGKDAAHFST KSWWHNNSCM SQSYYQWNNV GIKTSGNLEY IYPGQHRLGH DGVPQSWPED DEKRKISFYD QNNYGEYKSY HVFGAYTDFW GCYWHNDQFG MGHSAPYDEK PGKKIWIWGL SRYGMIWEDL LTDEDGQYTE VQSGRLFNQS IAASSKTPFK HRSFLPYTYD TWEEHWFPVK NTGGLTYGNQ QLSFYITVQE GNQLINICAN ESLDDHFKVF HHTKEILSTQ LRMKAMQNCS FGLPYPVKAA ELLVTLNDSI IYNGPEQHTV LKRPTKLSKD YNFDSVQAHC IQAKEWERQR FFDRAITHYQ ICLDLDPFYT EALSGLAGLY FKQLKFTEAL NLLSIVLSVD TYDPEANYLY GLVNERLRNT ADAKDGFSIA SQSVEYRVAS FIALAKMLLR EGQIEKAYAY VKKARLYSPD NLQSLYLSII INNLKGNKKQ SMLLIKQLLH TDPINYLAKF ELNKAEGIPL DKITVSELPY ETYTELAAFY YNVNLYSEAL KLLEAAPDYA MVYLWKAYLY SLSDSKTTIA SALEKAVAMR PDFVFPHREE DITVLNWAIS QHNAWQFKYY LALAHIQNLR REEALSLLNS CQQLPDFYPF YIVRANLKEE LQKDGCLSDL KKAFQLAPDE WRTVLNLSNY YAAHEDWVQA LNITSKGYKM YPDNYYLGLK LAKCFMHTHK FEQGIALMTN MTVLPNEGAS EGRNSWRETH LLCAFNALED KNQEKATHHI NMARTWPENM GIGKPHHVDE RLEDYMQLIC IDHENKEERK ALTDKITSYR MHHKLSPYGI LDFISIFLMQ EMGDVKGAER ILDNWLKQDP DALPLKWSIA FLKGNQQELA ELSQQKVPVK EVLPYEVPFE DRSFPFVKKL HSIGLFNKHT HLATMN
|
| |