Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2044 |
Symbol | |
ID | 8253148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2360936 |
End bp | 2362762 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644935692 |
Product | DNA topoisomerase type IA central domain protein |
Protein accession | YP_003092311 |
Protein GI | 255531939 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0995937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00726991 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATTG TTATTGCAGA GAAACCTTCC GTGGGACGTG AATTGGCAAA GGTTTTTGGT GCTACAACTA AAAAGGATGG ATATATTGAA GGGAAAGGTT ATTCTTTTAC CTGGGCATTT GGCCATTTAT TACAACTGGC CCCGCCGCAG GAATATGGTT TTATAGGTTG GCGAAGACAG CATTTGCCTA TGCTGCCCAA GAAATTTAAA CTGGCTATCC GTAAAATCAA AACCAAGGAC GGCATGGTTG AAGATCCGGG TGTGCGGAAG CAGCTGGATA TCATTAAAAA GTTATTTGAT GAAGCTACAG AGATCATTGT GGCAACGGAT GCCGGGCGTG AAGGTGAACT CATTTTCCGC TATATTTATT ATTTCCTGAA ATGCAAGAAG CCTTTTAGAA GGCTCTGGAT TTCATCGCAG ACCGATGAAG CCATAAAAGA GGGGTTCAGG AACTTAAAGC CGGGTACAGA TTACGATACC CTGTTCAATT CTGCACACTG CAGGTCTGAA TCTGACTGGC TGGTAGGGAT GAACGCCACA CAGGCTTTAA GTATCTCGGC AGGAAACCGT TCGGTATTGT CGCTGGGCAG GGTACAGACA CCTACACTGG CCATGATCTG CTCCCGTTTT CTGGAGATCA AAAATTTTGT CCCCCAAACT TATTATCAGC TGGCCATACA GCTGGATAAG GACGGACAGC TGTTCAGGGC CATGTCGGTC AGCAATTTTG ATAAAAAGGA AGAAGCAGAG GAACTGCTGG CTAAAATTGA AGACGTGGCC TCGGGTTTTA GTAATGGAGG GAAGATTTTA AGTGTGGAAG CCAAGCCGCG TAAGGAACCA CCACCATTGC TGCATGACCT GAGCAGTTTG CAGCAGGAGG CCAATAAGCG CAAGGGCTTT ACGGCAGACC AGACCTTAAG TTTGCTCCAG GGTCTTTACG AAAGCAAGCT GGTTACTTAC CCGCGTACGG GCAGCCGGTA TATCGGCGAT GATATATTTG CGGGTGTGCC TGCTTTGATC GATAAGGTAA GGGGCCATAA AGATTTTGGA AAGCAGGCAG AGTTTCTGCT TACGGTTCCT TTAAACAAGC GCAGTGTAAA TGCGAAAAAG GTAACCGACC ACCATGCCAT TTTACCTACA GGCGAGTCCC CTTATCAGTT AAATGGTGAT AAACAAGCTG TTTATGATAT GGTAGTTGGA CGGATGATTG AGGCTTTTCA TCAGGAATGT GTAAAAGAGA TCACTAAGAT ATCTGTCGAA TCCGGTTCTT TATTTATTGC CAATGGCACG GTGATCCGTG CTGCGGGCTG GCGGTCGGTA TTTAATGAAT CGGATGAGGA GAAGAAGGAT GAGGATAACC CGGCATTGCC TAAGTTGAAA AAAGGAGAGG AGCTTCCGGT TACCAATAAG GCGTTGCTGG AAAAGCAAAC CAAACCTAAA GCAATGTACA ATGAGGCTTC TTTGTTAAAA GCACTGGAAA CTTCGGGTAA GGACATTGAA GATGAGGAAT TGAGGTACGC CATGAAGGAT AGCGGATTGG GTACACCAGC TACGCGTGCG GCCATCATCG AAACGCTCAT TAGCCGTGAA TACGTTTCCA GGGAAAAGCG GAACCTGGTG CCCACAACTA AAGGACTGGC AGTTTATGAT GTGGTAAAAG ACCAGAAAAT TGCCCAGGCT GAACTGACCG GACAATGGGA AAAAAGGCTG GAAGAGATCA GGTCTGGTGC TTCTGTAAGT GATTTTAAAG CCGAAATAGC CGATTACACC AAAACCATTA CCAATGAATT GCTTGCAGCG GGCTTAACAC TGGCAGAAAA AATATAA
|
Protein sequence | MKIVIAEKPS VGRELAKVFG ATTKKDGYIE GKGYSFTWAF GHLLQLAPPQ EYGFIGWRRQ HLPMLPKKFK LAIRKIKTKD GMVEDPGVRK QLDIIKKLFD EATEIIVATD AGREGELIFR YIYYFLKCKK PFRRLWISSQ TDEAIKEGFR NLKPGTDYDT LFNSAHCRSE SDWLVGMNAT QALSISAGNR SVLSLGRVQT PTLAMICSRF LEIKNFVPQT YYQLAIQLDK DGQLFRAMSV SNFDKKEEAE ELLAKIEDVA SGFSNGGKIL SVEAKPRKEP PPLLHDLSSL QQEANKRKGF TADQTLSLLQ GLYESKLVTY PRTGSRYIGD DIFAGVPALI DKVRGHKDFG KQAEFLLTVP LNKRSVNAKK VTDHHAILPT GESPYQLNGD KQAVYDMVVG RMIEAFHQEC VKEITKISVE SGSLFIANGT VIRAAGWRSV FNESDEEKKD EDNPALPKLK KGEELPVTNK ALLEKQTKPK AMYNEASLLK ALETSGKDIE DEELRYAMKD SGLGTPATRA AIIETLISRE YVSREKRNLV PTTKGLAVYD VVKDQKIAQA ELTGQWEKRL EEIRSGASVS DFKAEIADYT KTITNELLAA GLTLAEKI
|
| |