Gene Phep_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0387 
Symbol 
ID8251472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp458472 
End bp460268 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content39% 
IMG OID644934035 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003090673 
Protein GI255530301 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.971851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTT TCGATCATAA AAAGGCCCTG ACCAACATAC CACATAAACC TGGTATTTAC 
CAGTACTGGG ATGCAGAAGG TACGCTTATT TATATCGGTA AAGCTAAAGA TCTACGAAAC
CGGGTAGGCT CCTATTTCAA CAAAGACAAT CAGATGAACG GAAAAACCAG GGTACTGGTT
TCCCGGATCC GTAAAATTAC CTTTACCATT GTTGATACAG AAATAGACGC CTGGCTACTG
GAAAACAGTC TGATCAAAAA GCATCAGCCA AGATATAACA TCCTGCTCAA AGATGATAAA
ACCTATCCCT GGATCATCAT TAAAAAAGAA CCTTTCCCCA GGGTATACTG GACAAGGAAA
ATGATAAAAG ACGGATCAAC TTATTTTGGC CCTTATGCTT CTGTTGGTAT GATGCACACC
ATCATCGACC TTATTAAAGA AACCTATCCC TTAAGAACCT GTAACCTGCC ACTCAGCAAA
AAAAATATTG ACGATGGTAA ATTCAAGGTA TGCCTTGAAT ACCAGATCGG GAACTGCAAA
GGGCCATGTC AGGCCTATCA GACCGAACCC GATTACGATT CAAATATTGA AGAGATCAAA
GATATTCTCA ATGGAAAAAT CGGCAATGTC ATTAAAGACG TAAAACGGGT TATTAAAAAG
TCAGTAGATG AATTAAATTT TGAACTAGCC CACCAGTACC AGCGTAAATT GATGGTATTA
GAAAAATACC AGAGCAAATC AACGGTAGTA AACAGTGCCA TCACCAATGT AGATGTAGTT
AGTATCGCTT CTGATGAACG TTACGCTTTT GTGAATTACC TTAAAGTAAT GAACGGAAGT
ATCATTCAAA CACAAACCAT TGAGATTAAG AAACGCCTGG ATGAAACTGA TGAAGAGCTT
CTTACTATTG CCATAATGGA ATTCCGCACC AGGTTTAATA GTACCTCAAA AGAGATCATT
GTCCCTTTTG ACATTACATT AACGGATGAA AATCTTAAGT TTACTTTACC CAAACTGGGG
GAAAAGAAAA AACTGCTTGA ACTCTCACAG AAAAACGTAT TGTTCTTTAA AAAAGAGAAG
CTAAACCAGT ACGAGAAATT AAATCCTGAT CTGAGAACAG ACCGGATCCT TAGTCAGATG
CAGAAAGACC TTGGGCTTAC AAAAAGCCCC AAACACATCG AATGCTTTGA CAATTCCAAC
TTCCAGGGCA AGTACCCGGT TTCTGCCATT GTTGTTTTTA AGGACGCCAA ACCTTCTAAA
AAAGATTACA GGCATTTCAA TGTGAAAACA GTAGAAGGAC CAAATGATTT TGCCACCATG
GAAGAGGCTG TTTACAGGCG TTATAAGCGC ATGCTGGAAG AAGAGAACAC CTTACCTGAA
CTCATCATCA TTGATGGAGG TAAGGGCCAG CTTTCCTCTG CCATGAACAG TTTAAAGAAA
CTGGGCATAG AAAAACGGGT TACTGTAATC GGAATAGCTA AAAGATTGGA AGAATTGTTT
TTCCCCGGTG ATCCATATCC TTTATACCTG GATAAAAAAT CGGAAACCTT AAAAGTAATT
CAGCAGCTAC GCGATGAGGC TCACCGCTTT GGTATCACCT TCCACCGAAA AAAGAGAGAT
CAGGGCACAC TGAAAACAGA ATTGGAACAA ATTCCCGGCA TTGGTAAAAC CACTGCCGAT
AAACTACTCA GGCATTTCAA ATCTGTTAAA AAAATAAAAG AGGCTAAAGA AGAAGAGTTA
ACTATGGTAC TTAATAAAAC TCAGGTAAAA ACCCTTTTAG ATTACTTTAC AAGATAA
 
Protein sequence
MSAFDHKKAL TNIPHKPGIY QYWDAEGTLI YIGKAKDLRN RVGSYFNKDN QMNGKTRVLV 
SRIRKITFTI VDTEIDAWLL ENSLIKKHQP RYNILLKDDK TYPWIIIKKE PFPRVYWTRK
MIKDGSTYFG PYASVGMMHT IIDLIKETYP LRTCNLPLSK KNIDDGKFKV CLEYQIGNCK
GPCQAYQTEP DYDSNIEEIK DILNGKIGNV IKDVKRVIKK SVDELNFELA HQYQRKLMVL
EKYQSKSTVV NSAITNVDVV SIASDERYAF VNYLKVMNGS IIQTQTIEIK KRLDETDEEL
LTIAIMEFRT RFNSTSKEII VPFDITLTDE NLKFTLPKLG EKKKLLELSQ KNVLFFKKEK
LNQYEKLNPD LRTDRILSQM QKDLGLTKSP KHIECFDNSN FQGKYPVSAI VVFKDAKPSK
KDYRHFNVKT VEGPNDFATM EEAVYRRYKR MLEEENTLPE LIIIDGGKGQ LSSAMNSLKK
LGIEKRVTVI GIAKRLEELF FPGDPYPLYL DKKSETLKVI QQLRDEAHRF GITFHRKKRD
QGTLKTELEQ IPGIGKTTAD KLLRHFKSVK KIKEAKEEEL TMVLNKTQVK TLLDYFTR