Gene Phep_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1241 
Symbol 
ID8252339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1460548 
End bp1462713 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content44% 
IMG OID644934894 
Productpeptidase S9B dipeptidylpeptidase IV domain protein 
Protein accessionYP_003091519 
Protein GI255531147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.349009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC TGATTTTACT GATCTATCTG TTGGCTGTAG CATCTGGTAG TTTTGCACAG 
CAGCAGCAGC AATTGAGCAT GCAGGATGCG ATGAGCAATG CCCGGACAAC GCTGGCACCC
GAAAACTTAT CCCAACTTCA ATTTATTTAT GGAACAGAGG ATTATGTATA TGCCAAACGT
ATCGGCAATA GTCCGGTTTG GTTAAGTGGC AATGCCAAAT CAAAGGAAGA CCAGCCTTTT
CTGACCTTAA CACAATTAAA CCAGAAATTA AGGAACGCGA AGAAGGATAC TTTGAAGATG
ATGCCCGTGA TCCAGTTTAA CCAGGGGCCG GAATGGATTT TAAACCTTAA CGGTAGTAAA
GTAGCGATCA ACCCGGTTAA AAATACGGTA GATGTATTGG TAGACCAGTC GTTAATGGCA
AAAACAAACG CAGAGGAAAG CAAGGCCGGT TATGTGGCTT ATCTGGATAA TTTCAACTTG
TTTGTTGCTA AAGACGGGGA TCGGAAACAG GTTACTACTG ATGGTAACAG TGATATTGTT
TACGCTTCTT CGGTACACAG GGAAGAGTTC GGGATCAGTA AAGGAACTTT CTGGAGTAAT
AATGGTAAGG TGCTTGCTTT CTACAGAATG GACCAGCGAA TGGTTACAGA TTATCCGATC
ATCGACTGGA CCAGCCGGCC TGCTCACAAT GTAAACATCA AATATCCTAT GGCGGGTGAC
AAGAGCCATC ATGTTACTGT GGGGGTGTAT CATGCAGAAA CTAAAGCTGT AGTGTATTTG
AAAACCGGCG AGCCGGCAGA GCAGTATTTA ACAAATATTG CCTGGAGTCC GGATGATAAA
TATGTTTATA TAGCGGTATT GAACCGTGGA CAAAATCACA TGAAGCTAAA CCAGTACGAC
GCGGCTACAG GCGATTTTGT GAAAACCTTA TTTGAAGAGA AAGATGATAA ATATGTAGAG
CCACTGGTGC CGATGTTATT CCTGAAAAAT GATCCTTCAA AATTTATATG GCAAAGCAAC
AGGGATGGCT GGAACCATTT ATACCTGTAC GATTTAAAAG GCAGGGTGGT AAAACAACTA
ACCAGGGGGG CATGGGAAGT GCTGGAGGTA AAAGGTTTTG ATGCTAAAGG TGAGCGGCTG
TTTTACGTTT CAACGGAAGA GTCGCCGGTA ACCAGGAATT TATATGTATT AAATGTGAAA
TCTGGTCAGT CGCGCAGGCT TACATCTGCT TTTGCGGTAC ACAATACGCA GGTAAGCATT
TCCGGAAATA CTGTAATTGA TGTTTACAGT ACACCTGATG TGCCCAGGGT GATCCAGCTT
GTAGAAACAC CTGGTTCAAA AGCTAAGTTA TTGTTGAAGT CTGCAAACCC CTTGTCGGCT
TATGCTACAG AAAACTCATC GATATTTACC ATTAAAAGTA AATCGGGTGA GGACTTGTAT
ATGAACCTGT ACAAGCCGGT AAATTATGAT GCCGGTAAAA AATATCCTGT AGTGGTTTAC
TGGTATGGCG GTCCGCATGC ACAGCTGATC ACCAACAGCT GGAATGCCGG TGCAGGCGAT
TACTGGTCGC GGTATATGGC GCAACGGGGT TATGTAGTGC TTACGGTTGA TGTAAGGGGT
AGCGACAACA GGGGCAGGGC CTTTGAACAA TCTATGTTCC GCAGGGCAGG TGAGGTACAG
ATGGAAGATA TGATGAGTGC CGTGGATTAT CTGAAAGCTC AGCCTTATGT AGATGCAGCC
AACATGGGCT TATTTGGCTG GAGCTTTGGT GGCTTTGCCA CTACAGATTT TATGCTGACC
CACCCGGGTG TGTTTAAAGC TGCCGTAGCT GGCGGGCCGG TAATAAACTG GGCCTTTTAT
GAGATCATGT ATACCGAACG TTATATGGAT ACCCCACAGG AAAACCCTGA AGGTTATGCC
GCGACTTACC TGAGTAACCG TGTTGATCAG CTGAAAGGAA AGTTATTGCT TATCCATGGA
TTACAGGATC CGGTTGTAGT ACAGCAGCAT TCGGTCGATT TTGTGAAACA TGCGGTTGAT
AAAGGTGTAC AGGTAGATTA CATGATCTAT CCTGGTCATG AGCACAATGT ATTGGGTAAA
GACCGGGTGC AGCTGTATCA GAAAGTAACG GATTATTTTG AACTGTACCT GAAAGGGGGA
AAATAA
 
Protein sequence
MKRLILLIYL LAVASGSFAQ QQQQLSMQDA MSNARTTLAP ENLSQLQFIY GTEDYVYAKR 
IGNSPVWLSG NAKSKEDQPF LTLTQLNQKL RNAKKDTLKM MPVIQFNQGP EWILNLNGSK
VAINPVKNTV DVLVDQSLMA KTNAEESKAG YVAYLDNFNL FVAKDGDRKQ VTTDGNSDIV
YASSVHREEF GISKGTFWSN NGKVLAFYRM DQRMVTDYPI IDWTSRPAHN VNIKYPMAGD
KSHHVTVGVY HAETKAVVYL KTGEPAEQYL TNIAWSPDDK YVYIAVLNRG QNHMKLNQYD
AATGDFVKTL FEEKDDKYVE PLVPMLFLKN DPSKFIWQSN RDGWNHLYLY DLKGRVVKQL
TRGAWEVLEV KGFDAKGERL FYVSTEESPV TRNLYVLNVK SGQSRRLTSA FAVHNTQVSI
SGNTVIDVYS TPDVPRVIQL VETPGSKAKL LLKSANPLSA YATENSSIFT IKSKSGEDLY
MNLYKPVNYD AGKKYPVVVY WYGGPHAQLI TNSWNAGAGD YWSRYMAQRG YVVLTVDVRG
SDNRGRAFEQ SMFRRAGEVQ MEDMMSAVDY LKAQPYVDAA NMGLFGWSFG GFATTDFMLT
HPGVFKAAVA GGPVINWAFY EIMYTERYMD TPQENPEGYA ATYLSNRVDQ LKGKLLLIHG
LQDPVVVQQH SVDFVKHAVD KGVQVDYMIY PGHEHNVLGK DRVQLYQKVT DYFELYLKGG
K