Gene Phep_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1901 
Symbol 
ID8253005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2195639 
End bp2198893 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content49% 
IMG OID644935552 
Productamidohydrolase 
Protein accessionYP_003092171 
Protein GI255531799 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACAT CACTACTCCT GTTAATTCTT GTATTCTCTT TTTTTCAATC GTCATTCTCC 
CAATCTCCCA CACCTGGTCC CGACAAACAA TGGGACATAG AAAAATATAA GGGCAATACC
AAAACCTTTA CCTTTAACAC AGATGAAGGT ACATGGATGA ACCTGGATGT AAGCCCTGAT
GGCAAGGATA TCGTGTTTGA CCTGTTGGGC GACCTCTACA TCATGCCCAT TAGCGGGGGT
TCAGCCGTTT TGTTAAGCAG CGGGCCGGCC TGGGATATCC AGCCCCGCTT TAGCCCCAAT
GGCCGGTACA TATCCTATAC CAGTGATAAA AGCGGTGCCG ACAACATCTG GATCATGAAC
CGTGATGGCT CCGGCAAAAG GCAGATATCC AAAGAAACTT TCCGTTTATT GAACAATGCA
ACCTGGATGC CCAACAGTGA GTACCTCATT GCACGCAAAC ACTTTACCGG TACCCGTTCG
CTGGGCGCGG GGGAAATGTG GATGTACAGC ATTTATGGTG GTGAAGGTGT ACAGCTGACC
AAGAGAAAAA ATGACCAGCA GGATGCCGGA GAACCAAATG TTTCGCCGGA TGGCAAATAC
CTTTATTTTA GTGAGGACAT GTCGCCCGGC CCTAACTTCG AATACAGCAA AGACCCTAAC
GGCATGATCT ATGCCATCCG TCAGTTAGAC CTGACCAGCG GCAAAATTAC CAGCCTGATT
GCTCAGCCAG GCGGGGCAGC CCGGCCACAG GTTTCCCCGG ATGGAAAAAT GATGGCCTAT
GTAAAACGCA TGCGCTTAAA ATCTGTACTG GTATTGCAAA ACCTGCAAAC GGGAGAAGAA
TGGCCCATCT ATGAAGACCT GTCGCACGAC CAGCAGGAAA CCTGGGCCAT TTTTGGCGTT
TACCCCAATT ATGCCTGGAC ACCCGATGGC AAGAGCATCA TTTTCTATGC CCGTGGCAAG
ATCAAAAAGA TCGACCTCAA TTCATTGTTC ACCAATAACC TCCCATTTCA TGTCACCGGT
TCACAAACCA TACAGCAGGC CCTGCATTAT CAGCAGCAGG TGTTCAGCAA TGATTTTACG
GTAAAAATGA TCCGGCAGCT TAACACTTCT GCCGACGGCA AATTTGTTGT ATTTAATGCC
GCGGGCTTCC TCTATAAAAA AGACCTGCCA AACGGGCTGC CTGAAAGGGT AAGCAAGGGT
ATCGACTTTG AGTTCGAGCC CGAGATCAGT GCCGACGGCA AATCTGTCAT TTATACCACC
TGGAGTGATG AATTTAAAGG GGCTATTAAA AAAACCGACC TGAAATCGGG TAAAACAGTT
ACGCTGACCA CCGAAAAAGG CTTTTATTAT TCTCCTTCTT TTTCCAATAA AGGCGATAAG
ATCGTGTTCC GAAAAGGAGT GGGCAACGAC GTGCTTGGTT ATGCTTTTGG CCGGGGAACA
GGAATTTTTA CCATGCCCGC CAGTGGTGGC CCGAAAACAC TGATCACCGA GAATGGCATC
AGGCCCAAAT TCAATGCCGA CGACAGCAGG ATCTATTTTC AATCCAGCGA AGACGGAAAA
AAGGCATTCA AAAGCATAGA CCTGAATGGC GCCAACGAAC GGACGCACTA TACTTCAACA
TACGCCACAC AATTTGCACC CAGTCCGGAT GGCAAATGGA TGGCCTTTAC AGAGCTTTTC
AACATTTACG TTACACCGAT GGTAACTACG GGCAGGCCGC TCGACCTGTC GGCCGGCAAC
AAAACCATAC CACTTACCCG CATCACCAAA GATGCCGGCA CCTATATCCA CTGGAGCCAC
GACAGCAATA AATTGTTCTG GACACTGGGC GAACAGTATT TTAGCCGTGA TGTAAAAACA
GCATTCAACT TTACCGATGG CTCAACAGCG CAGATCAAAG CACCCGACAG TACTGGCCTG
GCCATAGGCT TAAAACTAAA GACCGATGTT CCAACCGGCC TCACCGCACT TACAGGTGCC
AGGATCATCA CCATGAAAGG CGATGAGGTG ATAGAGAACG GTGCTATACT CATTGAAAAC
AATAAAATTG TATCCATTGG CAAAAACCTG CCCCTGCCAG CAAACACAAG GGTAATTGAT
GTAACGGGAA AAACCATTAT GCCGGGTATG GTAGATGTGC ATGCACATCT GCGCAGCAGT
CCGGATGGCA TTACACCGCA GCAGGACTGG TCGTACCTGG CCAACCTGGC CTTTGGGGTA
ACCACCTCGC ACGACCCCTC CAGCAATACA GAAATGGTGT TCAGTCAGTC GGAAATGATC
AAAGCAGGCC GGCTTACCGG CCCCAGGTTA TATTCTACAG GCTCCATCCT CTATGGTGCC
GATGGCGATT TTAAGGTAGT CATCAACAGC CTGGACGATG CCCTGTCTAA CCTGAGAAGA
CTAAAAGCAG TGGGTGCCTT TTCTGTAAAA TCCTACAACC AGCCCCGTCG CGATCAGCGT
CAGCAGATCA TCGAGGCTGC AAGGCAATTA AAAATGATGG TTGTGCCAGA AGGCGGCTCT
ACCTTTTTTA CCAATATGAA CATGATTGCC GACGGGCATA CGGGCATTGA ACACAGCATT
CCGGTTGCAC CTGTTTTTAA AGATGTAACC ACCTTCTGGA ACAAAACTGA GGTAGCTTAT
ACCCCTACAC TCATTGTTAG CTATGGCAGC CAGTGGGGAG AAAATTACTG GTACGACCGT
ACCAATGTAT GGGAAAATGA GCGCCTGATG GCCTTTACCC CCAGGTCCAT CATAGACCCG
CGGGCCAGGA GAAGAACAAC GTCAGAATAT GGCGATTACG GTCATATTGA AGTGGCCAAA
ACGGCCAGGC AGATTGCCGA GGGTGGCACC AAAGTAAATT TAGGCGCGCA TGGGCAGATC
CAGGGACTGG GTGCACACTG GGAACTCTGG ATGCTGGCAC AGGGTGGTAT GACCCCCTTA
CAGGCCATCA GGTGTGCCAC CATAAACGGC GCGGCCTATC TGGGTATGGA CAAAGAGATC
GGCTCACTGG AAATCGGAAA ACTGGCCGAT CTGATCGTGA TGGATGCCAA TCCGCTGGAC
GACATCAGGA ATTCGGAAAA AATTAAATAC GTAATGATCA ACGGCCGTAT TTTTGACAGC
CTGTCTATGA ATGAAATAGG CAACCGCGAA AAGGTACGGG GTAAATTGTG GTTTGAGACT
GGAAAGGGAA TGGTTTACAC CTTCCCGACC GGCAATGCCG AAACCTGGAC TTATACCATT
CCCAATTGCG AATAA
 
Protein sequence
MRTSLLLLIL VFSFFQSSFS QSPTPGPDKQ WDIEKYKGNT KTFTFNTDEG TWMNLDVSPD 
GKDIVFDLLG DLYIMPISGG SAVLLSSGPA WDIQPRFSPN GRYISYTSDK SGADNIWIMN
RDGSGKRQIS KETFRLLNNA TWMPNSEYLI ARKHFTGTRS LGAGEMWMYS IYGGEGVQLT
KRKNDQQDAG EPNVSPDGKY LYFSEDMSPG PNFEYSKDPN GMIYAIRQLD LTSGKITSLI
AQPGGAARPQ VSPDGKMMAY VKRMRLKSVL VLQNLQTGEE WPIYEDLSHD QQETWAIFGV
YPNYAWTPDG KSIIFYARGK IKKIDLNSLF TNNLPFHVTG SQTIQQALHY QQQVFSNDFT
VKMIRQLNTS ADGKFVVFNA AGFLYKKDLP NGLPERVSKG IDFEFEPEIS ADGKSVIYTT
WSDEFKGAIK KTDLKSGKTV TLTTEKGFYY SPSFSNKGDK IVFRKGVGND VLGYAFGRGT
GIFTMPASGG PKTLITENGI RPKFNADDSR IYFQSSEDGK KAFKSIDLNG ANERTHYTST
YATQFAPSPD GKWMAFTELF NIYVTPMVTT GRPLDLSAGN KTIPLTRITK DAGTYIHWSH
DSNKLFWTLG EQYFSRDVKT AFNFTDGSTA QIKAPDSTGL AIGLKLKTDV PTGLTALTGA
RIITMKGDEV IENGAILIEN NKIVSIGKNL PLPANTRVID VTGKTIMPGM VDVHAHLRSS
PDGITPQQDW SYLANLAFGV TTSHDPSSNT EMVFSQSEMI KAGRLTGPRL YSTGSILYGA
DGDFKVVINS LDDALSNLRR LKAVGAFSVK SYNQPRRDQR QQIIEAARQL KMMVVPEGGS
TFFTNMNMIA DGHTGIEHSI PVAPVFKDVT TFWNKTEVAY TPTLIVSYGS QWGENYWYDR
TNVWENERLM AFTPRSIIDP RARRRTTSEY GDYGHIEVAK TARQIAEGGT KVNLGAHGQI
QGLGAHWELW MLAQGGMTPL QAIRCATING AAYLGMDKEI GSLEIGKLAD LIVMDANPLD
DIRNSEKIKY VMINGRIFDS LSMNEIGNRE KVRGKLWFET GKGMVYTFPT GNAETWTYTI
PNCE