Gene Phep_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3470 
Symbol 
ID8254590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4122690 
End bp4125623 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content42% 
IMG OID644937122 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_003093725 
Protein GI255533353 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0658423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG AAATAGAAAA CAACATAAAC GAAGAAAATA AACATACCGT AATTCCAATC 
AACGGACTTT ATGAAAACTG GTTCCTCGAC TATGCTTCTT ATGTAATTCT TGACCGCGCT
GTGCCACACA TCAATGATGG CTTGAAGCCG GTGCAGCGAC GTATCATGCA TTCTTTGAAA
GAGATGGATG ATGGACGCTT CAATAAGGCA GCCAACGTGA TTGGAAATAC AATGAAATAC
CATCCGCATG GGGATGCTTC CATAGGTGAT GCCATGGTGC AGATTGGACA AAAAGACCTG
CTGATTGATT GCCAGGGTAA CTGGGGAGAC CCGATTACCG GTGATAATGC TGCTGCGCCG
CGTTATATTG AGGCCCGTTT ATCTAAATTT GCCAATGAAG TTGTGTTTAA TGGTGATACA
ACCATCTGGC AGTTGAGTTA TGACGGGCGT AACAACGAAC CGGTTACCTT GCCTGTTAAA
TTCCCCCTGT TGCTTGCCCA GGGAGCTGAA GGTATTGCCG TTGGTTTAGC AACCAAAGTA
ATGCCACATA ACTTTATCGA ATTGCTGGAT GCTTCTATTG AAGCATTGCA AGGGCATCGG
CCTAACATTT TGCCCGATTT TTTTACAGGT GGTATGGCCG ATTTTTCTGC TTACAACGAG
GGGATGCGTG GCGGCCGTAT CAGGGTGAGG GCCAAAATCA CCGAAAAGGA TAAAAAAACA
CTGGTTATTA CCGAAATACC TTACAGCACG ACTACCGGAT CGGTAATTGA CAGTATCTTA
TCGGCCAACG ACAAGGGCAA GATCAAGATC AAAAAGATTG AGGACAATAC CGCTGCCCAT
GTCGAGATCG TGATCCAGCT GGCACCAGGT ATTTCTCCTG ATGTGACCAT TGATGCCTTA
TACGCCTTTA CTTCATGTGA GGTTTCAATA TCACCAAATA CCTGCATCAT TAAGGATGAA
AAGCCTCAGT TTTTAAGTGT TAATGATATC CTGATAGAGA ATTCGATGCA CACCAAGGCC
CTGTTGAAAA AGGAACTGGA AATTAAACTG CATGAATTAC AGGAAAAGAT ATTTTTCAGC
TCATTGCTAA AGATATTTAT TCAGGAAGGG ATGTATAAAA ATGCTGAATA TGAAAATTCA
GTCAATTTTG AAATGGTGGT AGAGGTTTTG AACAGGTTGT TTGAGCCTTT TAAACCAGGT
CTTTACAGAG AAATACTTCC TGAGGATTTT AAAAAGCTGA TTGATAAACC AATGAGCAGC
ATTACCCGCT TTGATGTTAA AAAAGCGGAT GAACAGATGA AAGCCCTTTC TGACGAGATC
AAAGTGGTTA AAAACCATTT ACGGCATTTA ACGGAATATG CCATTGCCTG GTTTCAGAAA
TTAAAGGACA AGTATGGTAA GGGCAGGGAG CGTAAAACTG AGATCCGTTT GTTTGATCGG
GTTGAGGCAT CTAAAGTAGC CTTGGCCAAT GTTAAACTTT ACATGAACCG TGAAGATGGT
TTTATAGGGA CCGGATTGCG GAAGGATGAA TTTGTTGCCG ATTGTTCAGA TATTGATGAG
CTCATTGTTT TTAGAGAAGA CGGTAAATGC ATCATTACGA AAGTTGCCGA TAAAACTTTT
GTAGGTAAGG GTATCCTGCA TGCCCAGGTA TTTAAAAAGG GGGATGAACG GACCATTTAC
AACATGATCT ATAAAGATGG GGCGAGCGGG GTTTCTTATA TTAAGCGTTT TGCAGTTGTA
GGGGTTACCC GTGATAAGGA ATACGATCTG ACAAAAGGGA GTAAGGGCTC AAAAGTGCTG
TATTTTACAG CTAATCCTAA TGGAGAAGCC GAAATTGTGA CTGTACAGCT TAAACCACAT
AGCAAGTTAA AAAAGCTGCA GTTCGATCTC GATTTTGCTG AAATTGCGAT AAAAGGACGC
GGTTCACAGG GAAATATTGT TTCTAAATAT CCGGTTAAAA AAATACTGCT GAAAAGCAAA
GGCGTATCTA CACTTTCGGG ACTAAAAATC TGGTATGATG ATTTGCTAAG AAGGTTAAAT
GTGGACGGAA GGGGCAAATA TCTTGGTGAA TTTGATGGCG AGGATAAAAT ATTGCAGGTA
CATAAAGACG GCTGGTATGA ACTGAGTACT TTTGAGCTGA GCAACCACTT TGATGCCGAT
CTGATCTTAA TTCAGAAGTT TGACCCTGAA AAACCTTTTG CAGTTGTGCA ATATGAGGGT
AAAGCCAAAA ACTATTTTAT CAAAAGGTTC CTTTTTGAGG CGATTGCTGT AGGCAAAAAA
GTAAGTCTGA TTAGTGAGGA GAACGGATCG AAGTTCCTTT ACCTCAGCAG CAATCCGGCA
GCGGTTTTAA CAGTTGATGT ATTGAAAGGA AAAACCCAGG TTCCGGAAAC ACTTGAAATT
GTTCTTGCCG AATTTATTGA TGTAAAAGGA ATTAAAGCCA ATGGAAACAG GCTTACTGCC
CATGAAGTTA AAAATATAAC CATTTCAAAT CACACGGAAA TTGAACCTGC TGAAGAAGTT
AAAGTTGTTT CTGTGGCAAA TGAGGCTGAG GATGAGTTTC AAGAGGATGA AGCCGCTGCA
ACGAATATAG AAAATGGGGA AGAACTGGTT GATCCTGCAT TGAGTAAAGA TGCAGATGAG
GAGGAAGCTC CGTTACCTGA ACCTGAAATT CCCGCTACTG AAAAGCCTGC TGCGGCCGAA
AAACCTAAAA AACAATGGGA CAGCGGGGCC ACCAAACCTG CCGAAGCACC TAAGCCAAAA
AAAGAAACGG GTCCTGCCAA AGAAAAGTCT GCTGATGAAA CAAAACCGAA AAAAGAGCCC
ATAGCCAAAC CTGAGCCGGA ACAGGAAGAG AAACCCGCAA AAAAAATAGA TTTTGAGATC
ACCAATCCGG ATGATATCAA AATTGATGAT AAAGGACAGC TAGGGTTCTT TTAG
 
Protein sequence
MSEEIENNIN EENKHTVIPI NGLYENWFLD YASYVILDRA VPHINDGLKP VQRRIMHSLK 
EMDDGRFNKA ANVIGNTMKY HPHGDASIGD AMVQIGQKDL LIDCQGNWGD PITGDNAAAP
RYIEARLSKF ANEVVFNGDT TIWQLSYDGR NNEPVTLPVK FPLLLAQGAE GIAVGLATKV
MPHNFIELLD ASIEALQGHR PNILPDFFTG GMADFSAYNE GMRGGRIRVR AKITEKDKKT
LVITEIPYST TTGSVIDSIL SANDKGKIKI KKIEDNTAAH VEIVIQLAPG ISPDVTIDAL
YAFTSCEVSI SPNTCIIKDE KPQFLSVNDI LIENSMHTKA LLKKELEIKL HELQEKIFFS
SLLKIFIQEG MYKNAEYENS VNFEMVVEVL NRLFEPFKPG LYREILPEDF KKLIDKPMSS
ITRFDVKKAD EQMKALSDEI KVVKNHLRHL TEYAIAWFQK LKDKYGKGRE RKTEIRLFDR
VEASKVALAN VKLYMNREDG FIGTGLRKDE FVADCSDIDE LIVFREDGKC IITKVADKTF
VGKGILHAQV FKKGDERTIY NMIYKDGASG VSYIKRFAVV GVTRDKEYDL TKGSKGSKVL
YFTANPNGEA EIVTVQLKPH SKLKKLQFDL DFAEIAIKGR GSQGNIVSKY PVKKILLKSK
GVSTLSGLKI WYDDLLRRLN VDGRGKYLGE FDGEDKILQV HKDGWYELST FELSNHFDAD
LILIQKFDPE KPFAVVQYEG KAKNYFIKRF LFEAIAVGKK VSLISEENGS KFLYLSSNPA
AVLTVDVLKG KTQVPETLEI VLAEFIDVKG IKANGNRLTA HEVKNITISN HTEIEPAEEV
KVVSVANEAE DEFQEDEAAA TNIENGEELV DPALSKDADE EEAPLPEPEI PATEKPAAAE
KPKKQWDSGA TKPAEAPKPK KETGPAKEKS ADETKPKKEP IAKPEPEQEE KPAKKIDFEI
TNPDDIKIDD KGQLGFF