Gene Phep_4175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4175 
Symbol 
ID8255310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5048294 
End bp5049604 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content45% 
IMG OID644937840 
Productamidohydrolase 
Protein accessionYP_003094428 
Protein GI255534056 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000266471 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000965062 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAGT ACATTTTAAT CCTCTTACCA TTAGCCATTA GCCTGAACTG TCTGGCGCAG 
GCCAATATTT CTCCGGCAAA ACCACAGGAT ACAAGAACAG TGATTATGGG GGCAAAACTA
CACATCGGTA ACGGACAGGT TGTAGAAACC GGATACCTGA TATTTGATAA AGGAAAAATT
ACCGGTGTAG GAGATGCTAC AGTTGCCCGT ATTGACCTTA CAGGCGCCAT AGTAATTACC
GCAAATGGTA AACAGGTATA TCCGGGTTTC ATAGCGCCGG TTACCAATCT CGGACTGGTA
GAGATCAGTT CAGTAAAGGC AACATTGGAT TATAATGAAA TTGGCGAACT GAACCCACAC
ATCAGGGCCC TGGTAGCCTA TAATACAGAT TCAAAAGTAC CCGCTACCAT AAGGAGCAAT
GGTGTGCTGA TGGCTCAGAT TACACCTCAG GGTGGCACCT TGTCGGGCAG CTCATCAGTT
GTACAGCTGG ATGCCTGGAA CTGGGAAGAT GCAGCAATAA GAAAAGACGA TGCCCAGCAT
TTAAACTGGC CGGTTACCTC AAAATTCAGC AGTTCGGGAA ACAGGGCCAT GGCCCAGGCC
GAAGTTTTTA AAGAGCGCAC ACAGCAGGCG ATTAATGACC TTGAACAGCT CTTTGCAGAA
GCAAAAGCTT ATGCCGAAAC AGATAAACCA GCAGTGGTAA ATGCCCGTCT GGCTGCCATG
AGGCACCTGT TTGATGGTTC GCAAAAACTG TTTATCCATG CAAATGCAGA GAAAGACATC
ATTACTGCAG TAAAATTTGC AAAAAAATAT GGCATTACAC CGGTCCTGGT TGGTGGAGAT
GAAGCCTATC TGGCTATCCC CTTCTTAAAA GAGAACAATA TTACGGTGGT GGTAAAGGAG
CCACACAATT TGCCCAATAA CAGCGACGAC GATGTAAACC TGCCCTATAA AAATGCAGGT
TTACTGGCCA ATGCCGGAAT CAATGTAGTG ATGAGCCTGC ACAGCTACTG GCAACTGCGG
AACCTGCCTT TTATGGCAGG AACAATAACG GCCTGGGGTC TTGACAAAGA AAAAGCTTTA
CAAACCATTA CCTTAAACAC CGCTAAAGCA TTAGGTATTG AAAAGATTGC GGGTAGCCTG
GAAATCGGTA AGGATGCGAC TTTCTTTATT TCGTCCGGGG ATGCGTTGGA CATGAAGACC
AATAAAGTGG AAAGAGCGTT TATCCAGGGC AGGGATATCA ATCTGGATAA TCTTCATAAA
CAATTAGACA AAAAATTCAG TGACAAGTAC CTGCTGAAAA GCTTAAAATA A
 
Protein sequence
MNKYILILLP LAISLNCLAQ ANISPAKPQD TRTVIMGAKL HIGNGQVVET GYLIFDKGKI 
TGVGDATVAR IDLTGAIVIT ANGKQVYPGF IAPVTNLGLV EISSVKATLD YNEIGELNPH
IRALVAYNTD SKVPATIRSN GVLMAQITPQ GGTLSGSSSV VQLDAWNWED AAIRKDDAQH
LNWPVTSKFS SSGNRAMAQA EVFKERTQQA INDLEQLFAE AKAYAETDKP AVVNARLAAM
RHLFDGSQKL FIHANAEKDI ITAVKFAKKY GITPVLVGGD EAYLAIPFLK ENNITVVVKE
PHNLPNNSDD DVNLPYKNAG LLANAGINVV MSLHSYWQLR NLPFMAGTIT AWGLDKEKAL
QTITLNTAKA LGIEKIAGSL EIGKDATFFI SSGDALDMKT NKVERAFIQG RDINLDNLHK
QLDKKFSDKY LLKSLK