Gene Phep_4272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4272 
Symbol 
ID8255408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5149902 
End bp5151146 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID644937938 
Productimidazolonepropionase 
Protein accessionYP_003094525 
Protein GI255534153 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCC TGCTGCTCAT CAATATTGGC TGCCTTGTTG GCCTCCACCC CGGCAACACA 
AGGCAGTTAA AGGGCGCACA GCTGGGCGAA CTCCCGCTGA TGGAAAACGC CTGGTTGTTG
TGCGAAGACG GAAAGATCTC GGATTTTGGA AAGATGGGCA GCCTGCCTGC ACAATTGCCA
AATGCCCTGC ATACACACGA TGCTAAAAAG GGCTATGTCT TTCCTTCCTG GTGCGATTCA
CATTCCCATA TAGTTTTTGC GGCCTCCCGG GAAGAGGAAT TCGAGATGAA AATAGCCGGA
AAAAGCTATG AAGAGATCGC TGCTGCAGGA GGTGGCATCT TAAATTCTGC ACGTAAGCTG
CAGCACGCTT CAGCAGATGC GCTTTATGAT GCTGCAGCGC TGCGGATGAA CGACATGATC
AGGCAGGGTA CAGGCGCTGT AGAGATCAAG AGCGGCTATG GATTGACCAC AGCCAGCGAA
CTCAAAATGC TGAGGGTGAT CCGCAGATTA AAAGAATCTT TCCCCATTCC GGTTAAAGCT
TCCTTTTTAG CAGCACATGC CTATCCCCCG GAATATAAGA ACGACCATGC CGCTTACATT
AAGCTGATTA CCGATGAAAT GCTGCCCAGG ATTGCGGATG AAGGCCTGGC CGACTATATG
GACGTTTTTT GCGAGCAGGG CTTCTTTTCT GTGGCTGAAA CGGATGAGCT GCTTGCTGCA
GCAGCGGGTT ACGGACTAAA ACCTAAAATC CATGCCAACC AGTTATCGGT ATCGGGCGCG
GTACAGTTGG GGGTAAAGCA CCAGGCCGTG TCGGTAGACC ACCTGGAAGT TACAGATGAG
GCCGTCATCA GCAGTTTGCA AAACAGCCAT ACCATCGCTA CCTTATTGCC TTCCTGTTCT
TTTTACATCA ATATCCCCTA TGCCAACGCC AGGGGGCTGA TCAATGCCGA TATCCCTGTA
GCCATAGCCA GCGATTACAA CCCGGGCTCT ACCCCTTCCG GCAACATGAA CCTGGTCGTG
TCCCTGGCCT GCATCAAACT GCGGATGCAG CCCCGGGAAG CCATCAATGC GGCTACCCTG
AACGGGGCTG CGGCTATGGA GCTGAGCGGG GAAACCGGTA GCATTACCAA AGGTAAAAAA
GCCAACCTGT TCATCACCAG GCCCATGCCT TCCCTTGCCT TTCTGCCCTA TAGCTTCGGA
CAGTCGCAAA TAGAAAGCAT TATCCTTAAC GGAAAGATCT GCTGA
 
Protein sequence
MAALLLINIG CLVGLHPGNT RQLKGAQLGE LPLMENAWLL CEDGKISDFG KMGSLPAQLP 
NALHTHDAKK GYVFPSWCDS HSHIVFAASR EEEFEMKIAG KSYEEIAAAG GGILNSARKL
QHASADALYD AAALRMNDMI RQGTGAVEIK SGYGLTTASE LKMLRVIRRL KESFPIPVKA
SFLAAHAYPP EYKNDHAAYI KLITDEMLPR IADEGLADYM DVFCEQGFFS VAETDELLAA
AAGYGLKPKI HANQLSVSGA VQLGVKHQAV SVDHLEVTDE AVISSLQNSH TIATLLPSCS
FYINIPYANA RGLINADIPV AIASDYNPGS TPSGNMNLVV SLACIKLRMQ PREAINAATL
NGAAAMELSG ETGSITKGKK ANLFITRPMP SLAFLPYSFG QSQIESIILN GKIC