Gene Phep_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4036 
Symbol 
ID8255170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4879252 
End bp4881291 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content47% 
IMG OID644937700 
Productexcinuclease ABC, B subunit 
Protein accessionYP_003094289 
Protein GI255533917 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.545808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTC AAATCGTTTC AGATTATAAA CCAACCGGAG ATCAGCCTGC TGCGATTAAA 
CAACTGGTTG AAGGTGTAAA CAACGAAGAT CATTATCAGA CTTTACTAGG GGTGACCGGT
TCGGGCAAGA CCTTTACCAT AGCGAATGTG ATACAGCAGA CCCAGAAACC CACACTGATC
CTGAGCCATA ATAAAACACT GGCAGCACAG CTTTATGGGG AATTTAAACA GTTTTTTCCG
GAAAATGCAG TGAACTATTT TGTTTCTTAT TATGATTATT ACCAGCCTGA AGCTTTCATT
GCCAGCAGCA ATACCTATAT AGAGAAAGAT TTAAGTATCA ATGAGGAAAT AGAGAAATTG
CGGCTGCGGA CGACTTCTGC CCTGATGTCG GGCCGGAGGG ATGTCATCGT GGTCTCTTCT
ATCTCCTGTA TTTATGGTAT GGGCAATCCG GAAGATTTTT CGAGATCGGT ATTCCGCTTT
TCGGTGGGGT TACGGATTTC AAGGAATTCC TTTCTGCATA GCCTGGTGGA GATCCTGTAT
GCCCGTACCA CCACTGATTT TAAGCGCGGT ACTTTCAGGG TAAAAGGCGA TACGGTTGAT
ATTTTTCCGG CCTACCTGGA TAATGCTTAC CGTGTTTCTT TTTTTGGGGA TGACATTGAA
GCGCTGAGCG TGATAGACCC GGTTACAGGA AAGACACTGG AAAAGCTGGA AGACATGGCG
ATCTATCCTG CCAATTTGTT TGTCACACCT AAGGAAAGAT TTAATTCATC GATATGGGGG
ATACAGGAAG AACTGGAGAT CAGGAAGAAC CAATTGATTG GCGACCGGCA TTTGCTGGAA
GCAAAGCGGC TGGAAGAAAG GGTGAACTTT GATATAGAGA TGATGAAGGA ACTGGGCTAT
TGCTCAGGTA TAGAAAACTA CTCCCGCTTT TTTGACGGGA GGGCGCCGGG AATGCGGCCC
TTCTGTTTGC TGGATTACTT TCCGGATGAT TATTTAATGG TGATTGATGA AAGCCATGTT
ACGGTACCAC AGATCAGGGC GATGTATGGT GGCGACAGGT CGCGGAAAAT GTCGCTTGTA
GAATACGGGT TCCGTTTGCC ATCGGCCCTG GACAACAGGC CCCTGAACTT TGATGAGTTT
GAACGTCTGG CACCACAGAC CATTTATGTA AGTGCTACCC CGGCAGATTA TGAATTGCAG
AAATCGGAAG GAATTGTGAT TGAACAGGTG ATCAGGCCTA CAGGTTTACT GGATCCTTTG
ATTGATGTGC GGCCGGCAGT TAACCAGGTG GATGACCTGC TGGATGAAAT TGACAAGACC
ATTAAGCTGG GAGACAGGGT ACTGGTAACT ACACTGACCA AAAGGATGGC AGAGGAGCTG
ACCAAATATA TGGACCGGCT GAACATCAAA TGCCGTTATA TCCACTCGGA AGTGAAAACG
CTGGAAAGGG TAGAGATCTT ACGGGGGCTG CGCCTGGGTG AATTTGATGT TTTGATCGGG
ATTAACCTGT TGCGGGAGGG GCTTGACCTG CCGGAAGTAT CGCTGGTAGC TATTCTGGAT
GCCGATAAGG AAGGCTTTTT GCGTTCTGAC CGTGCGCTGA TCCAGACCAT TGGCCGTGCA
GCAAGGAATG ACAGGGGACG GGTGATCATG TATGCAGACA ACATGACCGA TTCGATGGAA
CGGACGATTG AGGAAACCAA CAGGCGGAGG GAAAAGCAGG TGGCCTATAA CCTGGAGCAT
GGAATTGTAC CTAAAACGGT TGGTAAGAGC AGGGAGGCGA TAATGGAGCA GAGCTCGGTA
CTGGACTTCT CGTCGGGTGA GCGCAAGCGG GCAAAACCTT ATGTGGAGGT AGATGAGGTG
AGCATTGCTG CTGATCCTGT TGTGCAGTAC ATGACCAAAC CGGAAATGCA GAAATCTATT
GATAAAACCC GTAAGGAAAT GGCTAAGGCA GCAAAAGACA TGGACTTTTT ACTTGCAGCC
AGGTTGCGTG ACGAGATGTT TGCAATGGAG AAATTATTTG AAGAAAAATT TAGTAAGTAA
 
Protein sequence
MKFQIVSDYK PTGDQPAAIK QLVEGVNNED HYQTLLGVTG SGKTFTIANV IQQTQKPTLI 
LSHNKTLAAQ LYGEFKQFFP ENAVNYFVSY YDYYQPEAFI ASSNTYIEKD LSINEEIEKL
RLRTTSALMS GRRDVIVVSS ISCIYGMGNP EDFSRSVFRF SVGLRISRNS FLHSLVEILY
ARTTTDFKRG TFRVKGDTVD IFPAYLDNAY RVSFFGDDIE ALSVIDPVTG KTLEKLEDMA
IYPANLFVTP KERFNSSIWG IQEELEIRKN QLIGDRHLLE AKRLEERVNF DIEMMKELGY
CSGIENYSRF FDGRAPGMRP FCLLDYFPDD YLMVIDESHV TVPQIRAMYG GDRSRKMSLV
EYGFRLPSAL DNRPLNFDEF ERLAPQTIYV SATPADYELQ KSEGIVIEQV IRPTGLLDPL
IDVRPAVNQV DDLLDEIDKT IKLGDRVLVT TLTKRMAEEL TKYMDRLNIK CRYIHSEVKT
LERVEILRGL RLGEFDVLIG INLLREGLDL PEVSLVAILD ADKEGFLRSD RALIQTIGRA
ARNDRGRVIM YADNMTDSME RTIEETNRRR EKQVAYNLEH GIVPKTVGKS REAIMEQSSV
LDFSSGERKR AKPYVEVDEV SIAADPVVQY MTKPEMQKSI DKTRKEMAKA AKDMDFLLAA
RLRDEMFAME KLFEEKFSK