Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4175 |
Symbol | |
ID | 8255310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 5048294 |
End bp | 5049604 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644937840 |
Product | amidohydrolase |
Protein accession | YP_003094428 |
Protein GI | 255534056 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000266471 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000965062 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAAGT ACATTTTAAT CCTCTTACCA TTAGCCATTA GCCTGAACTG TCTGGCGCAG GCCAATATTT CTCCGGCAAA ACCACAGGAT ACAAGAACAG TGATTATGGG GGCAAAACTA CACATCGGTA ACGGACAGGT TGTAGAAACC GGATACCTGA TATTTGATAA AGGAAAAATT ACCGGTGTAG GAGATGCTAC AGTTGCCCGT ATTGACCTTA CAGGCGCCAT AGTAATTACC GCAAATGGTA AACAGGTATA TCCGGGTTTC ATAGCGCCGG TTACCAATCT CGGACTGGTA GAGATCAGTT CAGTAAAGGC AACATTGGAT TATAATGAAA TTGGCGAACT GAACCCACAC ATCAGGGCCC TGGTAGCCTA TAATACAGAT TCAAAAGTAC CCGCTACCAT AAGGAGCAAT GGTGTGCTGA TGGCTCAGAT TACACCTCAG GGTGGCACCT TGTCGGGCAG CTCATCAGTT GTACAGCTGG ATGCCTGGAA CTGGGAAGAT GCAGCAATAA GAAAAGACGA TGCCCAGCAT TTAAACTGGC CGGTTACCTC AAAATTCAGC AGTTCGGGAA ACAGGGCCAT GGCCCAGGCC GAAGTTTTTA AAGAGCGCAC ACAGCAGGCG ATTAATGACC TTGAACAGCT CTTTGCAGAA GCAAAAGCTT ATGCCGAAAC AGATAAACCA GCAGTGGTAA ATGCCCGTCT GGCTGCCATG AGGCACCTGT TTGATGGTTC GCAAAAACTG TTTATCCATG CAAATGCAGA GAAAGACATC ATTACTGCAG TAAAATTTGC AAAAAAATAT GGCATTACAC CGGTCCTGGT TGGTGGAGAT GAAGCCTATC TGGCTATCCC CTTCTTAAAA GAGAACAATA TTACGGTGGT GGTAAAGGAG CCACACAATT TGCCCAATAA CAGCGACGAC GATGTAAACC TGCCCTATAA AAATGCAGGT TTACTGGCCA ATGCCGGAAT CAATGTAGTG ATGAGCCTGC ACAGCTACTG GCAACTGCGG AACCTGCCTT TTATGGCAGG AACAATAACG GCCTGGGGTC TTGACAAAGA AAAAGCTTTA CAAACCATTA CCTTAAACAC CGCTAAAGCA TTAGGTATTG AAAAGATTGC GGGTAGCCTG GAAATCGGTA AGGATGCGAC TTTCTTTATT TCGTCCGGGG ATGCGTTGGA CATGAAGACC AATAAAGTGG AAAGAGCGTT TATCCAGGGC AGGGATATCA ATCTGGATAA TCTTCATAAA CAATTAGACA AAAAATTCAG TGACAAGTAC CTGCTGAAAA GCTTAAAATA A
|
Protein sequence | MNKYILILLP LAISLNCLAQ ANISPAKPQD TRTVIMGAKL HIGNGQVVET GYLIFDKGKI TGVGDATVAR IDLTGAIVIT ANGKQVYPGF IAPVTNLGLV EISSVKATLD YNEIGELNPH IRALVAYNTD SKVPATIRSN GVLMAQITPQ GGTLSGSSSV VQLDAWNWED AAIRKDDAQH LNWPVTSKFS SSGNRAMAQA EVFKERTQQA INDLEQLFAE AKAYAETDKP AVVNARLAAM RHLFDGSQKL FIHANAEKDI ITAVKFAKKY GITPVLVGGD EAYLAIPFLK ENNITVVVKE PHNLPNNSDD DVNLPYKNAG LLANAGINVV MSLHSYWQLR NLPFMAGTIT AWGLDKEKAL QTITLNTAKA LGIEKIAGSL EIGKDATFFI SSGDALDMKT NKVERAFIQG RDINLDNLHK QLDKKFSDKY LLKSLK
|
| |