Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1702 |
Symbol | |
ID | 8252804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2014770 |
End bp | 2017403 |
Gene Length | 2634 bp |
Protein Length | 877 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644935354 |
Product | DNA ligase D |
Protein accession | YP_003091975 |
Protein GI | 255531603 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02777] DNA ligase D, 3'-phosphoesterase domain [TIGR02778] DNA polymerase LigD, polymerase domain [TIGR02779] DNA polymerase LigD, ligase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.170827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00786599 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGACTTG ACAAGTATAA CCATAAACGC GATTTTAACA AAACTGCTGA GCCAAAAGCT GGCAAATCTA AAGATGCTGA TCAATTACAT TTTGTAATCC AGAAACATGA CGCTTCCCAC CTTCATTACG ATTTCAGACT GGAAATGGAA GGCGTGTTAA AAAGCTGGGC AGTACCAAAA GGACCGTCAA CAGACCCTAA GGTGAAACGT CTGGCCATGA TGGTGGAGGA TCATCCGTAC GACTATAAAG ATTTTGAGGG CATAATTCCT AAAGGAGAAT ATGGTGGTGG AACAGTGATT GTTTGGGATG AGGGTACTTA TGAACCTATT GAAGAAATTA AAGGAAAAAA AGCTCAGGAT AAGTACTTGC GGAAACAGCT GAAGGAAGGA TCGCTGAAAA TAAAACTGCA GGGCAAAAAA CTAAAAGGGG AATTTGCCCT GGTTAAAACC CGCGGCATGG GCGAGAATGG ATGGTTGCTG ATCAAGCATA AGGATGATTA TGCAGCTACG GCAGATATTA CTAAACAGGA TAAATCAGTG CTATCCGGCA AAACCATTGC TGCCATGGAA CAGACCAGTG ATAAAGTATG GCAGCATGGT CATGAGGAAA AACTGGAAAA AGCACCCAAA AAAAAGATAC CCATAGGCAT GAAACCCATG CTGGCAACGC TGGTTGACGA GCCATTTGAT GATCCGGACT GGGTTTATGA GGTGAAATGG GACGGCTACA GGGCGCTCGG CTTCAGCAGG AAAAACGGTG ATGTACAATT GCTTTCACGC AATAACAAAC CATTCAATGA AAAATTTTAC CCCATTTACG AGGTTATGCA AAACTGGAAA ATTGATGCCG TTGTAGATGG GGAAATACTC GTGCTGAATG ACAAAGGAGT CTCCAACTTT GCTGAACTGC AAAACTGGCG CAGCGAAGCT GATGGCGAAC TGGTGTATTA CGTTTTTGAC CTGATCTGGT ATGATGGAAA AGACCTTACC GGCTTAAGCT TATTGCAGCG GCAGGCAATT TTAAAAAGCA TATTGCCCCA GGACGATGAC CGCATTCGCC TGAGTAAGGT ATTTACCGCG GAAGGAACCG AGTTTTTTGA GGCTGCCAAA AAAATGGGAT TGGAAGGCAT TATTGCCAAA AAAGGAAGCA GTACCTATAC ACCGGGCAAC CGAAGTACCG ACTGGCTGAA AATTAAAATT AACAAAAGGC AAGAGGTGGT GATAGCGGGT TTTACCAAAA ATAAGGATAG TTCAAAACAA TTCAGTTCTT TACTGCTGGG TGTTTATGAA AGGGGGCATC TGCAGTATGT AGGTAAGGTA GGTACGGGCT TTTCTGATCA GTTGCAAAAG GAAATGATGA AGCAGTTTGA ACCATTAATG ATTACTAAAA GTCCTTTTGA TGAAATTCCG GATGTGAACA AGCCTTCCCG TTTTCGTCCG AACCCACCTC AGGCAAAGGC AACCTGGTTA AAACCTGAAC TGGTTTGTGA AGTGGCTTAT GCAGAAGTCA CTTCAGATGG GGTATTCAGA CATCCTTCTT TTCAGGGCAT GCGTATAGAT AAGAAAGCAA AAGACGTAAT AAGGGAGGCT GCAGTAGAAA CGAAAGCTGT TGTAAATTTA GATATGGAGA CAGATGACAT GGCTAAAACA AATAAACATG CAAAGGCGAT TAAGCCGCCC AAGGCTGCTG CCCGCAAAAC TTTGCTGAAC CCTAAAGATG AAACCCAGGT TCGTAAAGTT TGCGGACATG AACTGAAATT TACTCATTTG AGCAAAGTAT ACTGGCCGGA AGATCAGGTA ACCAAAAGAG ACCTGTTTAA TTATTATTAC CAGGTTGCGG AGTACATATT GCCCTACCTG AAAGACCGGC CCATGTCATT GAATCGCTTT CCTGGTGGTA TTAATGGTCA GAGTTTTTAT CAGAAAGATG TAAAAGGAAA GGCACCGGAC TGGGCCAGAA CTTTTCCTTA TACTACGGGT GAGGGCGAAG CTAAAGAATA CCTGGTGGGG GATGATGAGG CCACCTTACT TTGGATGGTC TCTCTTGGCT GCATTGAAAT GAATCCCTGG TTTAGCAGGG TAAAATCGCC TGACAACCCG GACTATTGTG TGATCGACCT TGATCCGGAT AAGCAGAATT TTGATCAGGT TGTAGAAGCG GCCCTTACGG TAAAGGAGGT TTGCGACCAG ATGGATGTGC CAAGTTTTTG TAAAACATCC GGTTCTACCG GTATCCACAT TTATATCCCC CTTGCTGCAA AATACAGTTA TGACCAATCG CAGATGTTTG CAAGGATCAT TGTCAGTCTA GTCCATCAGC GGATTCCGGA ATATACCAGC CTGGAACGGA TGATTCCTGC GCGCAAGGGA AAAATGTACC TCGACTTTTT GCAAAACCGT CCGGGGGCGA CCATTGCCGG CCCTTATTCA TTAAGACCAA AACCAGGTGC AACGGTTTCT ATGCCCCTGC ATTGGGAGGA AGTAAAGCCT GGTTTAAAAA TGACTGATTT TACTATTTTT AATGCGCTGG AACGTTTAAA AAGTGAAGGT GATTTTTTTA GAGGCGTACT GGGAAAAGGC ATTGATCTGG AAAAAACGAT TCATAAAGCC AAAGGTGTTT TTGGCAATAG TTAA
|
Protein sequence | MGLDKYNHKR DFNKTAEPKA GKSKDADQLH FVIQKHDASH LHYDFRLEME GVLKSWAVPK GPSTDPKVKR LAMMVEDHPY DYKDFEGIIP KGEYGGGTVI VWDEGTYEPI EEIKGKKAQD KYLRKQLKEG SLKIKLQGKK LKGEFALVKT RGMGENGWLL IKHKDDYAAT ADITKQDKSV LSGKTIAAME QTSDKVWQHG HEEKLEKAPK KKIPIGMKPM LATLVDEPFD DPDWVYEVKW DGYRALGFSR KNGDVQLLSR NNKPFNEKFY PIYEVMQNWK IDAVVDGEIL VLNDKGVSNF AELQNWRSEA DGELVYYVFD LIWYDGKDLT GLSLLQRQAI LKSILPQDDD RIRLSKVFTA EGTEFFEAAK KMGLEGIIAK KGSSTYTPGN RSTDWLKIKI NKRQEVVIAG FTKNKDSSKQ FSSLLLGVYE RGHLQYVGKV GTGFSDQLQK EMMKQFEPLM ITKSPFDEIP DVNKPSRFRP NPPQAKATWL KPELVCEVAY AEVTSDGVFR HPSFQGMRID KKAKDVIREA AVETKAVVNL DMETDDMAKT NKHAKAIKPP KAAARKTLLN PKDETQVRKV CGHELKFTHL SKVYWPEDQV TKRDLFNYYY QVAEYILPYL KDRPMSLNRF PGGINGQSFY QKDVKGKAPD WARTFPYTTG EGEAKEYLVG DDEATLLWMV SLGCIEMNPW FSRVKSPDNP DYCVIDLDPD KQNFDQVVEA ALTVKEVCDQ MDVPSFCKTS GSTGIHIYIP LAAKYSYDQS QMFARIIVSL VHQRIPEYTS LERMIPARKG KMYLDFLQNR PGATIAGPYS LRPKPGATVS MPLHWEEVKP GLKMTDFTIF NALERLKSEG DFFRGVLGKG IDLEKTIHKA KGVFGNS
|
| |