Gene Phep_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2044 
Symbol 
ID8253148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2360936 
End bp2362762 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content46% 
IMG OID644935692 
ProductDNA topoisomerase type IA central domain protein 
Protein accessionYP_003092311 
Protein GI255531939 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0995937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00726991 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTG TTATTGCAGA GAAACCTTCC GTGGGACGTG AATTGGCAAA GGTTTTTGGT 
GCTACAACTA AAAAGGATGG ATATATTGAA GGGAAAGGTT ATTCTTTTAC CTGGGCATTT
GGCCATTTAT TACAACTGGC CCCGCCGCAG GAATATGGTT TTATAGGTTG GCGAAGACAG
CATTTGCCTA TGCTGCCCAA GAAATTTAAA CTGGCTATCC GTAAAATCAA AACCAAGGAC
GGCATGGTTG AAGATCCGGG TGTGCGGAAG CAGCTGGATA TCATTAAAAA GTTATTTGAT
GAAGCTACAG AGATCATTGT GGCAACGGAT GCCGGGCGTG AAGGTGAACT CATTTTCCGC
TATATTTATT ATTTCCTGAA ATGCAAGAAG CCTTTTAGAA GGCTCTGGAT TTCATCGCAG
ACCGATGAAG CCATAAAAGA GGGGTTCAGG AACTTAAAGC CGGGTACAGA TTACGATACC
CTGTTCAATT CTGCACACTG CAGGTCTGAA TCTGACTGGC TGGTAGGGAT GAACGCCACA
CAGGCTTTAA GTATCTCGGC AGGAAACCGT TCGGTATTGT CGCTGGGCAG GGTACAGACA
CCTACACTGG CCATGATCTG CTCCCGTTTT CTGGAGATCA AAAATTTTGT CCCCCAAACT
TATTATCAGC TGGCCATACA GCTGGATAAG GACGGACAGC TGTTCAGGGC CATGTCGGTC
AGCAATTTTG ATAAAAAGGA AGAAGCAGAG GAACTGCTGG CTAAAATTGA AGACGTGGCC
TCGGGTTTTA GTAATGGAGG GAAGATTTTA AGTGTGGAAG CCAAGCCGCG TAAGGAACCA
CCACCATTGC TGCATGACCT GAGCAGTTTG CAGCAGGAGG CCAATAAGCG CAAGGGCTTT
ACGGCAGACC AGACCTTAAG TTTGCTCCAG GGTCTTTACG AAAGCAAGCT GGTTACTTAC
CCGCGTACGG GCAGCCGGTA TATCGGCGAT GATATATTTG CGGGTGTGCC TGCTTTGATC
GATAAGGTAA GGGGCCATAA AGATTTTGGA AAGCAGGCAG AGTTTCTGCT TACGGTTCCT
TTAAACAAGC GCAGTGTAAA TGCGAAAAAG GTAACCGACC ACCATGCCAT TTTACCTACA
GGCGAGTCCC CTTATCAGTT AAATGGTGAT AAACAAGCTG TTTATGATAT GGTAGTTGGA
CGGATGATTG AGGCTTTTCA TCAGGAATGT GTAAAAGAGA TCACTAAGAT ATCTGTCGAA
TCCGGTTCTT TATTTATTGC CAATGGCACG GTGATCCGTG CTGCGGGCTG GCGGTCGGTA
TTTAATGAAT CGGATGAGGA GAAGAAGGAT GAGGATAACC CGGCATTGCC TAAGTTGAAA
AAAGGAGAGG AGCTTCCGGT TACCAATAAG GCGTTGCTGG AAAAGCAAAC CAAACCTAAA
GCAATGTACA ATGAGGCTTC TTTGTTAAAA GCACTGGAAA CTTCGGGTAA GGACATTGAA
GATGAGGAAT TGAGGTACGC CATGAAGGAT AGCGGATTGG GTACACCAGC TACGCGTGCG
GCCATCATCG AAACGCTCAT TAGCCGTGAA TACGTTTCCA GGGAAAAGCG GAACCTGGTG
CCCACAACTA AAGGACTGGC AGTTTATGAT GTGGTAAAAG ACCAGAAAAT TGCCCAGGCT
GAACTGACCG GACAATGGGA AAAAAGGCTG GAAGAGATCA GGTCTGGTGC TTCTGTAAGT
GATTTTAAAG CCGAAATAGC CGATTACACC AAAACCATTA CCAATGAATT GCTTGCAGCG
GGCTTAACAC TGGCAGAAAA AATATAA
 
Protein sequence
MKIVIAEKPS VGRELAKVFG ATTKKDGYIE GKGYSFTWAF GHLLQLAPPQ EYGFIGWRRQ 
HLPMLPKKFK LAIRKIKTKD GMVEDPGVRK QLDIIKKLFD EATEIIVATD AGREGELIFR
YIYYFLKCKK PFRRLWISSQ TDEAIKEGFR NLKPGTDYDT LFNSAHCRSE SDWLVGMNAT
QALSISAGNR SVLSLGRVQT PTLAMICSRF LEIKNFVPQT YYQLAIQLDK DGQLFRAMSV
SNFDKKEEAE ELLAKIEDVA SGFSNGGKIL SVEAKPRKEP PPLLHDLSSL QQEANKRKGF
TADQTLSLLQ GLYESKLVTY PRTGSRYIGD DIFAGVPALI DKVRGHKDFG KQAEFLLTVP
LNKRSVNAKK VTDHHAILPT GESPYQLNGD KQAVYDMVVG RMIEAFHQEC VKEITKISVE
SGSLFIANGT VIRAAGWRSV FNESDEEKKD EDNPALPKLK KGEELPVTNK ALLEKQTKPK
AMYNEASLLK ALETSGKDIE DEELRYAMKD SGLGTPATRA AIIETLISRE YVSREKRNLV
PTTKGLAVYD VVKDQKIAQA ELTGQWEKRL EEIRSGASVS DFKAEIADYT KTITNELLAA
GLTLAEKI