Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2034 |
Symbol | |
ID | 8253138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2347192 |
End bp | 2353722 |
Gene Length | 6531 bp |
Protein Length | 2176 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644935682 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003092301 |
Protein GI | 255531929 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00320121 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTAATA AAGGAACAAT AACGGATTCA TTTTCAGGGC AAGCCAAAAT TTTCCCTGAA AATATTGCGC TGGATTTTAA TAACAATCAG CTTACCTACA AAGAACTAGA CGAAAGGTCG AATCAGCTGG CCCATTATTT AAGGGCGCAG GGCGTAAAAC AAGAAACACT TGTTCCCATC TGCATTAACC GCTCTATAGA GATGATTGTA GGCATACTTG GCATTATTAA AGCTGGCGGT GCTTATGTTC CCATTGACCC TCAATATCCA GCTGCAAGAA TTGGGTACAT CCTGGACGAA ATTAAAACTG AATTCACTGT CAGTAATCGT GAAAGTTGCA AAAATTTGCC GGGAAAGACA ATTTTGCTAG ACGAAGAATG GGATTTAATC AGCAAAGAGC CGTTTGTTGC CACTACCGTA GCTACCAAAG ACACGGATTT GATATATGTA ATCTATACGT CCGGATCAAC GGGTAAGCCC AAAGGTGTTA TGGTTGAACA CCGGTCATTA TCTACTTACA TCCGCACCCA GACAACTTAT TTTCAAATCA ACCCCGCAGA TAGGATACTA CAGTTTTCGA GCTATTGCTT TGATGCCTCA GTTGAGCAAA TATTCCTGGC CTTATTAAAT GGGGCTACAC TGATATTGGT TGAAGGGGCA TTGCTCAATG ATATGGATGC GTTTTCTTTA TTCCTTAAAA ACAAAAAAGT AACGCACTTA CACAGCACAC CGGGTTTTCT GGAAAATCTG AACCCGGATG GTTATCCAAA TTTAAAAAGA GTGGTAGCCG GTGGTGACCT GTGTAAAAAG TCATTATTTA AAAAATGGAT AGGAAAGGTT GACTTTTATA ACAAGTACGG CCCGACAGAA TGTACAATAA CCGCAGCTGA ATACCACTGT ACATCAGATG ATGCAGAACG AGTTAAGGTC CTTCCCATAG GACGTCCCCT CGAAAATGTG GATATCTACA TCCTTGATGA GTTCATGAAT CCTGTTCCTG AATCGACCGG AGAAATACAT ATTGGTGGCA TACAGGTATC CAGGGGGTAT TTTAACCGCC CGGAACTAAC TAAAAAACAG TTTATTGACA GTCCGTTTAA AGCCGGAGAA CGATTGTACA AGACAGGTGA TCTTGGCAGA TGGTTGCCTG ACGGAAATAT TGAGTTTCAG GGGAGAGCTG ATGATCAGGT AAAAATAAGA GGTTACAGAA TTGAACTTGC TGAAGTGGAA AATGCACTTA GCGAATGTGG TGACGTAAAA CAATGTGTTG TAATTACCAG AACGGATCAA CAAAATGGCA AAAGTTTAAT TGCCTATGTC ATACCTCATG GCGATTTCAA CAGAACAGCG ATTGCGGATT ACATGAAAAA GAAACTGCCG GAATATATGA TCCCTCAGTT GATTGTTTCG TTGGATACCA TTCCACTTAC CCCGAACGGA AAGGTGGATA AAAAAGCGCT GCCAGATGTA GATGCAGGAG CCTTTTTGAG TAGTATATAT GTTGCGCCGA AACACCCTAC TGAACAAAAG ATTGCAGCAG TATGGCAATC TCTGTTAAAT GTAAAACGGG TAGGCATTAA AGATAATTTT TTTGAATTGG GAGGAAATTC CCTGCTGGCT CAAAAATTAG TGGCACTTTT GAAAGAGCTG GAACTGTTGT TGCCTGTAAC CAGGCTGTAC CAACACCCTA CGGTTGAAGG TATTGCAGCT TTTATAGACG GACGCAGTTT CAAAACGGCC CCTAAAAAAG AATTGAAAGC TATAAGTGGA AATGATATTG CGGTAGTCGG GATGGCCGGG AGGTTTCCAG GAGCAGATAC CGTTGATGAA TTCTGGCAGA ATTTAAAGGA AGGAAAGGAA ACTACACATT TCTTTACAGA TGAAGAACTG GACCCATCTA TACCGGATGT GCTGAAAAAA AGCCCCAATT ATGTTAAGGC AAGAGGAATA ATAAATGATC CCGCTGGTTT TGATGCTGTT TTTTTTGGTA TTAATCCTAA ACTTGCAGAA TTGATGGATC CTCAACATCG TCTCTTTCTA GAAATTTCCT GGGAGGCATT GGAACACAGT GGCCATGTAC CTCAGAAATA TGATGGCACT ATTGGTGTTT TTGCAGGATG CAGGTTTAAC ACCTATTATT CGAACAATGT AATTTCCAAC ACAGCACTTA TTGAAAATGC AGGAGCTTTT CAAGTAAGCA CTGTAAGTGA TAAGGATTAC ATTGCAAGCC GTACCGCTTA TGCACTGGAT TTAAAAGGAC CAGCTGTAAA TGTTCAGTCA GCCTGTTCCA CTTCGCTGCT GGCGATAGCA CAGGCAGTAG AAAGTATCAG AAACGGGCAC TGCGATGTAG CTCTGGCGGG TGGCTCTTCT ATGCTCGTAC CTGTAAATAG CGGACACCTT TATGAAGATG GTGCAATGTT GAGTAGCGAT GGACATTGCC GGGCTTTCGA TGCAGATGCA AAGGGTACCG TATTTAGCGA TGGTGCAGGT GTAATTGTGT TAAAGAACAA GGCTAAAGCC GAACTGGATG GTGATACCAT TTATGCAGTA ATTAAAGGCA TTGGCTTAAG TAACGACGGC GGCGGAAAAG GTAGTTTCAC AGCACCCAGT GCAGAAGGCC AGGCTGCTGC GATCAGTATG GCTATAGCAG ATGCCGGGGT TTCTGCAGCA GATATCTCTT ACATTGAAGC GCATGGCACA GCCACACCTT TGGGTGATCC AATTGAGATT GAGGGCCTGA ATATGGCTTT TGCTGAGCAG GAAAAAAAAC AATTTTGCGC CATTGGCTCT GTTAAAAGCA ATTTCGGCCA TTTAACCGGT GCGTCGGGTG TTGCAGGTAT GATCAAGACG GTGTTTTCAT TGTACTACAA ACAACTCCCT CCTTCAATCA ATTACAAAAC GCCCAATCCT CATATCGATT TCGCAAACAG TCCTTTTTTT GTCAATGATA TACTACGCGA TTGGAATCCA GGTCGTAAAC GGATTGCTGG CGTTAGTTCT TTTGGCGTTG GCGGCACGAA TGTACATATA GTATTGGAAG AGGCGGAAAA TCCGGTTAAA CCAGACAATA AAAGCATCCG TCCTGTCCAA TTGCTGTGCT GGTCGGCAAA GGCTGAAAAT AGCATCCATG GTTATGGTTT AAAATTAGCC GACTACCTCA ATAAACAGGG CTTAAACCTG GCAGATGTCG CATATACCCT ACATAGTTCA AGATCAGATT TTAATCACAG AAGATTTGTC GTAGCTTCAT CTGCAGCAGA CTTTGCAGAA AAAATAGCCA ATGAGCCGCT ACTCGCTGCA AACACCAGGA TTCTTAAAGA ATATCCACAA GAGCTTGTAT TCATGTTTCC GGGACAGGGA GCCCAGTTTC TGAATATGGG TAAGGACCTG TATGTTGCTG AACCCGTATT CAGACAAGCT ATGGATGAAT GTGCTGCGCT GCTTGCTGAA GTGATGCAGG AGAACATTTT GGATGTAATC TATCCTGAAA ATGTAAATGA AGCCGCAGAA AACAGACTTA AAAATACCCG ATATAGTCAG CCCGCCCTAT TTACTATTGG ATATGCCTTA GGCAAACTCT GGATGAGCTG GGGTATATTC CCCACTGCGT TTATCGGGCA TAGTATAGGC GAATTCGTAG CTGCCTATTT TTCTGGGATA CTGTCTTTAC CAGATGCCTT GAAACTAATT GCCTCCCGTG GCAAGATGAT GAGTGGTTTA CCGGAAGGCA GTATGTTGTC TATACGTTCT GATGTGGAAA CAGTAAAAAC ACTACTTTCT GATGAGATAG CCCTGGCTGC AGTAAACAGT CCAAATTTAA GTGTTGTTGC AGGCACTACC ACAGCAATTG CGGACCTGGT TACTCAACTG GATGAAATGG GGATACTTAA TCGCTTATTG CCAACCAGCC ATGCGTTTCA TTCACACATG ATGGATCCCG TTATTGAACC GTTCAATGCG CTTGTGGAAA CCATAACGCT AAATGAACCG CTTATTCCAA TTGTTTCAAC CGTAACAGGT GAATGGCTAA CTAATGAAGA GGCCACCAAC CCGACTTACT GGGCTAAGCA TTTAAGATCA ACCGTTAATT TCGGAGCAGC TGCGCAAAAA CTTTTAACTG AGGGCTATAA TTTATTTGTG GAAATTGGCC CTGGCAATTC AGCTGCTACC TTAACCCGTC AGCAGGCTGC CGGTAAGCCG ATTGCAGTTA TTGCCTCGCT GGAACAAGGA GAAAATGAGC AGTCTTCCTA TAATTCAGTA TTAAAGGCAT TAGGTCAGCT CTGGCTGAAC GGTGTAGAGC CAAATTGGAC CTCGTTTTAT AAAAACGAAG ACAGAACAGT GATAAATGGA CTGCCAACCT ATTTTTTCAA TAAAAAACAA TACTGGGTAA ACCCGGTTTT ACCTGTTCAA TTTACAAACC TATTGTTACC TGCTGAACCG ATTACTGTCC AGCAGGATAT TAAATCATCC CCAATAATGA GAAAGCAAGT TTTAATTGAC AAAATTAAAG AGCTATTGGA AAATGCATCA GGCATAGACA TGAGCAACAT CACCCCTGAA ATGACCTTTA TTGAGATGGG GTTAGATTCC TTGTTGCTCA CACAGGTTGC CCTGAATTTA AAAAAGCAAT TTGCATTGCC GATCACGTTC AGACAACTGA ATGAGGAATA CAGCACTACG GGACTTTTAG CAGACTATTT GGCAACCAAA TTACCTGCTG AGGCAATGCC GGCGAATCCT GGCCCGCCAA CTACGGCACA GCCGGTTTAC TTAAATGGCA GTCAAGCGGG TCCGGTCAAC CATACGGCAT TAGACCTGAT CAGTCAGCAA TTGCAGCTAT TGGCAAAGCA GGTTTCAATT TTGCAAGGCA GCCAGATGCC ATCGGTAAAT GAACAGTATA ACAACCATCA GGTACAGCAG CCAATAGTAA AGTTTAACGC AAATACTCAG CTAACCGCTG AAGAAAGCCT GGAAATTAAA AAACCATTTG GAGCGGCTGC AAAAATTGAA AGACAATCAG CAGCGCTGAA TGAAGTGCAG CAAAATTACC TGAACGATTT AATTGTACGT TACAACAATA AAACAAAAGG CAGCAAAGCC TATACACAAC AGCACCGGTC GTACATGGCC GATCCGAGAG TGGTTTCAGG ATTCAGGCCA GCAACAAAAG ACCTGGTCTA TTCAATTGTA ATCAACAAAT CCAAAGGAAG TCGTTTATGG GATATTGATG ATAACGAATA TATCGATGCC TTGAACGGTT TTGGCTCAAA TATGCTTGGT TATCAGCCAG ACATCATTAA ACATGCTTTA ATTGATCAGA TTGAAAAGGG TTATGAAATC GGACCTCAGC ATGAATTATC CGGAGAGGTA TCCAAACTCA TCTGTGAATT CACCAGCTTT GATCGGGCAG CACTGTGCAA CACTGGTTCA GAAGCTGTAT TAGGTGCAAT GCGAATAGCT CGGACCGTTA CAGGGCGATC AATAATCGTT GCTTTTACGG GCTCTTATCA TGGCATTGCA GATGAGGTGA TCGTTAGGGG AAGTAAAAAG TTAAAAACTT TTCCTGCTGC TCCGGGCATC ATGCCTGAAG CGGTTCAAAA TATGCTTATC CTGGATTATG GTACAGAAGA ATCTTTGCAG ATCATTGAAG AACGTGCACA TGAAATTGCT GCAGTATTGG TTGAACCGAT ACAAAGCAGA CGAGCTGATT TTCAACCAAT TGAATTTCTT AATAAATTGA GAAAAATTAC TGAGGATGCT GAAACCGTCT TAATTTTTGA TGAGGTTATT TCTGGCTTTC GCTTTCATCC CGGAGGAGTT CAGGCACTCT TTGGCATTAA AGCTGATATT GGCACCTACG GTAAGGTAGC GGGTGGCGGC ATTTCCATAG GAATCATTGC AGGTAAGAAA AAATATATGG ATGCATTGGA TGGGGGGTTC TGGCAATATG GTGATGACTC AATTCCAGAG GTTGGCGTTA CGTATTTTGC AGGAACATTT GTAAGGCATC CATTGGCATT GGCTACTGCC AAAGCCTCGC TAACTTACCT GAAAGAAGCA GGTCCGGCGT TGCAGGAAAA CCTCAATGCA AACGGCTTAT ATATTGCCGA TGCAATTAAT AAAATATGCA GGAAATTAAA TGTTCCGATG TACATCGCAC AGTATGGTTC ATTATGGAGG ATTAAGTTTA TTGAGGAATA TCCTTATAAT GAACTGTTCT TTACATTAAT GCGTTACAAA GGGATTCATA TACTCGAGGG ATTTGCCTGC TTCCTTACCA CCGCACATAC TGCAGAAGAT ATCCAAACGA TCATCAGGTG CTTCGAAGAA AGTTTAATGG AACTTAAAGC CGTAGGTTTA ATTCCCGAAT ATCAGCACCA GATAGCTGAA ACCGACAATG AAATAGCCAA TTATAATAAC TTAAATATCC CCCCCTTCCC AGGTGCAAAA TTGGGTAAAG ATAAAGATGG CAATCCTGCC TGGTTTATCA AGGATGAGAA AAATCCAGGA GAATTTCTTC AGGTGATATA A
|
Protein sequence | MLNKGTITDS FSGQAKIFPE NIALDFNNNQ LTYKELDERS NQLAHYLRAQ GVKQETLVPI CINRSIEMIV GILGIIKAGG AYVPIDPQYP AARIGYILDE IKTEFTVSNR ESCKNLPGKT ILLDEEWDLI SKEPFVATTV ATKDTDLIYV IYTSGSTGKP KGVMVEHRSL STYIRTQTTY FQINPADRIL QFSSYCFDAS VEQIFLALLN GATLILVEGA LLNDMDAFSL FLKNKKVTHL HSTPGFLENL NPDGYPNLKR VVAGGDLCKK SLFKKWIGKV DFYNKYGPTE CTITAAEYHC TSDDAERVKV LPIGRPLENV DIYILDEFMN PVPESTGEIH IGGIQVSRGY FNRPELTKKQ FIDSPFKAGE RLYKTGDLGR WLPDGNIEFQ GRADDQVKIR GYRIELAEVE NALSECGDVK QCVVITRTDQ QNGKSLIAYV IPHGDFNRTA IADYMKKKLP EYMIPQLIVS LDTIPLTPNG KVDKKALPDV DAGAFLSSIY VAPKHPTEQK IAAVWQSLLN VKRVGIKDNF FELGGNSLLA QKLVALLKEL ELLLPVTRLY QHPTVEGIAA FIDGRSFKTA PKKELKAISG NDIAVVGMAG RFPGADTVDE FWQNLKEGKE TTHFFTDEEL DPSIPDVLKK SPNYVKARGI INDPAGFDAV FFGINPKLAE LMDPQHRLFL EISWEALEHS GHVPQKYDGT IGVFAGCRFN TYYSNNVISN TALIENAGAF QVSTVSDKDY IASRTAYALD LKGPAVNVQS ACSTSLLAIA QAVESIRNGH CDVALAGGSS MLVPVNSGHL YEDGAMLSSD GHCRAFDADA KGTVFSDGAG VIVLKNKAKA ELDGDTIYAV IKGIGLSNDG GGKGSFTAPS AEGQAAAISM AIADAGVSAA DISYIEAHGT ATPLGDPIEI EGLNMAFAEQ EKKQFCAIGS VKSNFGHLTG ASGVAGMIKT VFSLYYKQLP PSINYKTPNP HIDFANSPFF VNDILRDWNP GRKRIAGVSS FGVGGTNVHI VLEEAENPVK PDNKSIRPVQ LLCWSAKAEN SIHGYGLKLA DYLNKQGLNL ADVAYTLHSS RSDFNHRRFV VASSAADFAE KIANEPLLAA NTRILKEYPQ ELVFMFPGQG AQFLNMGKDL YVAEPVFRQA MDECAALLAE VMQENILDVI YPENVNEAAE NRLKNTRYSQ PALFTIGYAL GKLWMSWGIF PTAFIGHSIG EFVAAYFSGI LSLPDALKLI ASRGKMMSGL PEGSMLSIRS DVETVKTLLS DEIALAAVNS PNLSVVAGTT TAIADLVTQL DEMGILNRLL PTSHAFHSHM MDPVIEPFNA LVETITLNEP LIPIVSTVTG EWLTNEEATN PTYWAKHLRS TVNFGAAAQK LLTEGYNLFV EIGPGNSAAT LTRQQAAGKP IAVIASLEQG ENEQSSYNSV LKALGQLWLN GVEPNWTSFY KNEDRTVING LPTYFFNKKQ YWVNPVLPVQ FTNLLLPAEP ITVQQDIKSS PIMRKQVLID KIKELLENAS GIDMSNITPE MTFIEMGLDS LLLTQVALNL KKQFALPITF RQLNEEYSTT GLLADYLATK LPAEAMPANP GPPTTAQPVY LNGSQAGPVN HTALDLISQQ LQLLAKQVSI LQGSQMPSVN EQYNNHQVQQ PIVKFNANTQ LTAEESLEIK KPFGAAAKIE RQSAALNEVQ QNYLNDLIVR YNNKTKGSKA YTQQHRSYMA DPRVVSGFRP ATKDLVYSIV INKSKGSRLW DIDDNEYIDA LNGFGSNMLG YQPDIIKHAL IDQIEKGYEI GPQHELSGEV SKLICEFTSF DRAALCNTGS EAVLGAMRIA RTVTGRSIIV AFTGSYHGIA DEVIVRGSKK LKTFPAAPGI MPEAVQNMLI LDYGTEESLQ IIEERAHEIA AVLVEPIQSR RADFQPIEFL NKLRKITEDA ETVLIFDEVI SGFRFHPGGV QALFGIKADI GTYGKVAGGG ISIGIIAGKK KYMDALDGGF WQYGDDSIPE VGVTYFAGTF VRHPLALATA KASLTYLKEA GPALQENLNA NGLYIADAIN KICRKLNVPM YIAQYGSLWR IKFIEEYPYN ELFFTLMRYK GIHILEGFAC FLTTAHTAED IQTIIRCFEE SLMELKAVGL IPEYQHQIAE TDNEIANYNN LNIPPFPGAK LGKDKDGNPA WFIKDEKNPG EFLQVI
|
| |