Gene Phep_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1803 
Symbol 
ID8252906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2100989 
End bp2104147 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content44% 
IMG OID644935454 
Productamidohydrolase 
Protein accessionYP_003092074 
Protein GI255531702 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000234818 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAC TGATCAATTT TATAGCATTA CTCTGTTTGC TTTTTGCAGG TAATGCAGGT 
TTAGCGCAAC TTCCGATTCA GGCGGAGCGT GCCGTTTCCT TTACGACAAA AGAAGGAAGT
AACATGAGCG TGGACCTTTC TCCTGATGGT AAGACCGTTG TTTTTTGTTT GCTGGGAGAT
TTGTATACGG TATCGTCAAA AGGTGGAATT GCTACACAAA TTACACGAGG AATTGCGATT
AATGACTTGC CTGTTTGGAG CCCTGACGGA AAGAGGATCG CCTACATCAG CGATAGATCA
GGTGATGACC GTCTTACCGT CAGGAATGTG TCAGGCAACG CGATTCAAAC CTTTGAGGGG
AAATTGCCAG GAGTTCCGGT TTGGTTTGGG CCAAATGATT GGGTTACTAC CTCCAATGAT
TTTGACAGAC ACTATCCTTT GTACCATTTG ACAGGAGGTG AGGTTGACGA TTCTAAAAAC
ATTTCCAATG TTGTGGGGTT CTCATCAGAC TACAAATTTA TTTATTACAT GCATAGAGAA
GCATCAAATA GCTTAGTTAT TTATCAGCAT GCAAAATCCA GGGGCGAAGA AAAAATATTG
ATTGAACTAC AGGGGATTGC CGCAAAGGCT GCAAAGCGGA TCAAAGTATC GCAGGATTGT
AATTGGTTAA GCTATTTAAT GACAGAAGGT GTATGGTGCA GTTTAAGGCT GGTTGATCTG
TCGTCAAAAA AAGAACGGGT ACTTGCCAGA TGGGAGCAAC ATGTTCCTGG AATCGGTAAC
AGTTTACCTA ACTATAATTT TTCGGGTGAT TCAAAAAAGA TCCTGATCGG TTATGGGGGA
AAGATCCATA TGATTGAGAT AAGAACAGGC AAAGATGAAA TTATCCCCTT TACTGCCAAC
GTAAAGGTAG ATATGGGAAA GCCTAATACT GCTACGTTTA AAGTTTCTCA GGATTCGCTG
CAGGTCAAGT ATATGCGTTC GGCCTGCGCA AGCCCTGAAG GCAGGCAATT GGTGTTTTCT
GCGTTGAACC GGATCTATAT CATGGATTTA CCCGGAGGTA AGCCCCGGAT ATTGGTGAAA
CAGCCTTTTA GCCAGTTTCA GCCGGCATGG TCGGCGGATG GGCAATGGAT TACTTTTGTA
AGCTGGAGCG ATGCTGAGTT TGGACAGGTC TGGAAAGTGG ATAAAAATGG TGACAGTCTA
ACACAAATAT CACATAAGGC CGGGGTTTAC CATTATCCAA ACTGGTCTCC TGATGGAAAA
TCTATTGCGG TTACAAAAGG GCGTAAAGTG TGGCAGGGTA AGCCGATGCT TGGGGACAGA
GATGGGCCTG GAATCGGTCA ATTAATTACC CTTGAATTGC AAAATGGGAA CCAAAAAGTG
ATTGCAGATA GTGTTCCACT TTCTAACAGA ACTACGTTTT CAGCAAATGG TGAAGGGCTC
ATTTATGCAC CCTCAAGGGT AGGAAAGAGT GGGGTATTTC CATTTTTGGT ATCCAAAGAT
CAGGAAGGGA AAGTGAATGT TTTAGCAACC GCAAGATATG AGGGAATAGG TAGTGAATTA
TTTTTGCGTC AGATTATTCA ATCACCGGAT GGCAGATATT TCGTATACCT GAATGAAGAA
AATTTGCATT TGGTTCCTGT TGATCCTTCA GGAGCACCGA CAATATTGTA TGATACCGAA
AAAAAAAATC CTGTAATACG TTTTGCCAAG GGTGGTTTTG ATCCGCACTG GGAAAAGGGG
GGGGAGGTGT TGAGCTGGTC TTTCGCCAAC CAATATTTCC GGATTGACCC GGATATGATT
GTTGCAGCAG CAATTGCAGC TGCCGGGCAA CGAAAAAAAA TGGGTTTGGC TGAATCGGGA
ATACTGGATG TAGAAATTGT TCCGGACGAG AGTATTGACA TTAACCTTAA GGTTGCTCAA
CAAGTGGCTA ATGGGATGCT GGCTTTAAAA AATGCCAGGA TCATTACGGC CAGGGAAAAT
GAAGTTATTG AAAATGGCAC CATTTTAATT CGTGACGGAC GTTTTGTAGC CGCAGGTAAA
AACGCGGAAG TAAATATTCC GCCGGGTACA AAAGTTATGG ATATGCTGGG AAAAACGATT
ATGCCCGGCC TGATAGACTT ACATGATCAC CTGCGCCCGC CAGCAGAAGT TTTTCCTCAG
CAACCATGGA GTTTTTTTGC AGGACTGGCT TATGGTGTAA CCACCGCGAG GGAACCTTCT
GGAAGCCATG ATTCTTTTGG GTATGAGGAA TTGTTGAAAA CCGGACAGAT GACTGGCCCG
AGGTTTTTTA ATGTAGGTTA TGCAGTTAGG GAAGATAGGT ACCCGAATAT GAATGACCTG
AACGAAGCCT ATATTATTGC CCAAAACCGT AAACGTATGG GGGCTATAGC GGTTAAGCAG
TATGCGCAAC CGACTCGCTT AAAACGACAG TTGTTATTAC TGGCCTGCGA GCAGGCAGGG
CTAAATATGA CCAATGAAGT TGAAAAAGAT ATGCGAGGGT TTATCGGCCA CATCAAAGAC
AGTACTTTCG GTATAGAGCA CAACCCGCTA TGGGGTGAAG TGTATAATGA TGTCATCCAG
CTGATCGCAA AATCTGGTGT TTACTTAACA CCAACTTTGC AGGTGGCTTA TGGAACTGAG
CTGGGAAGGA ACCATTTTTT AGAAAAATAT GCTCAGCCTG ATGCAAAAAT GAAGCGTTTT
TATCCGGAAG AAGAAATAAA ACGCCGCCAG GAAGAGCTGA AAAAGCTGAA AATTTACGCA
GAAGCACATC AGGAGCTGCC TTCATTTGTG AACCAAAGTA AAGTTGATGC TGCCATCCGT
CATGCAGGCG GGAGGGTTAC TATGGGTAGC CATGGCAATG ACCCGGCATT GGGTGCTCAT
TTTGAAATTT GGGCGCTGCA AATGGGGGGA CTGACGAACC TGGAAGCGAT ACAGGCCGCG
ACGATTATGG CTGCGGGCGG CCTGGGGATG CAGGAAGATC TGGGTTCTAT AGAGCCAGGA
AAGATTGCTG ACCTGATTAT TTTGAATAAA AATCCTTTAG ACAATATCAG GAACACCATG
GAAATACAAA GCGTAATGAA AGACGGCGTT TTGTATGATG GCAATACACT GGATGAAATA
TGGCCAAAGG CTAAGAAATT CCAAACGATT AAAAACTAA
 
Protein sequence
MMKLINFIAL LCLLFAGNAG LAQLPIQAER AVSFTTKEGS NMSVDLSPDG KTVVFCLLGD 
LYTVSSKGGI ATQITRGIAI NDLPVWSPDG KRIAYISDRS GDDRLTVRNV SGNAIQTFEG
KLPGVPVWFG PNDWVTTSND FDRHYPLYHL TGGEVDDSKN ISNVVGFSSD YKFIYYMHRE
ASNSLVIYQH AKSRGEEKIL IELQGIAAKA AKRIKVSQDC NWLSYLMTEG VWCSLRLVDL
SSKKERVLAR WEQHVPGIGN SLPNYNFSGD SKKILIGYGG KIHMIEIRTG KDEIIPFTAN
VKVDMGKPNT ATFKVSQDSL QVKYMRSACA SPEGRQLVFS ALNRIYIMDL PGGKPRILVK
QPFSQFQPAW SADGQWITFV SWSDAEFGQV WKVDKNGDSL TQISHKAGVY HYPNWSPDGK
SIAVTKGRKV WQGKPMLGDR DGPGIGQLIT LELQNGNQKV IADSVPLSNR TTFSANGEGL
IYAPSRVGKS GVFPFLVSKD QEGKVNVLAT ARYEGIGSEL FLRQIIQSPD GRYFVYLNEE
NLHLVPVDPS GAPTILYDTE KKNPVIRFAK GGFDPHWEKG GEVLSWSFAN QYFRIDPDMI
VAAAIAAAGQ RKKMGLAESG ILDVEIVPDE SIDINLKVAQ QVANGMLALK NARIITAREN
EVIENGTILI RDGRFVAAGK NAEVNIPPGT KVMDMLGKTI MPGLIDLHDH LRPPAEVFPQ
QPWSFFAGLA YGVTTAREPS GSHDSFGYEE LLKTGQMTGP RFFNVGYAVR EDRYPNMNDL
NEAYIIAQNR KRMGAIAVKQ YAQPTRLKRQ LLLLACEQAG LNMTNEVEKD MRGFIGHIKD
STFGIEHNPL WGEVYNDVIQ LIAKSGVYLT PTLQVAYGTE LGRNHFLEKY AQPDAKMKRF
YPEEEIKRRQ EELKKLKIYA EAHQELPSFV NQSKVDAAIR HAGGRVTMGS HGNDPALGAH
FEIWALQMGG LTNLEAIQAA TIMAAGGLGM QEDLGSIEPG KIADLIILNK NPLDNIRNTM
EIQSVMKDGV LYDGNTLDEI WPKAKKFQTI KN