Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1803 |
Symbol | |
ID | 8252906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2100989 |
End bp | 2104147 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644935454 |
Product | amidohydrolase |
Protein accession | YP_003092074 |
Protein GI | 255531702 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00000234818 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAAAC TGATCAATTT TATAGCATTA CTCTGTTTGC TTTTTGCAGG TAATGCAGGT TTAGCGCAAC TTCCGATTCA GGCGGAGCGT GCCGTTTCCT TTACGACAAA AGAAGGAAGT AACATGAGCG TGGACCTTTC TCCTGATGGT AAGACCGTTG TTTTTTGTTT GCTGGGAGAT TTGTATACGG TATCGTCAAA AGGTGGAATT GCTACACAAA TTACACGAGG AATTGCGATT AATGACTTGC CTGTTTGGAG CCCTGACGGA AAGAGGATCG CCTACATCAG CGATAGATCA GGTGATGACC GTCTTACCGT CAGGAATGTG TCAGGCAACG CGATTCAAAC CTTTGAGGGG AAATTGCCAG GAGTTCCGGT TTGGTTTGGG CCAAATGATT GGGTTACTAC CTCCAATGAT TTTGACAGAC ACTATCCTTT GTACCATTTG ACAGGAGGTG AGGTTGACGA TTCTAAAAAC ATTTCCAATG TTGTGGGGTT CTCATCAGAC TACAAATTTA TTTATTACAT GCATAGAGAA GCATCAAATA GCTTAGTTAT TTATCAGCAT GCAAAATCCA GGGGCGAAGA AAAAATATTG ATTGAACTAC AGGGGATTGC CGCAAAGGCT GCAAAGCGGA TCAAAGTATC GCAGGATTGT AATTGGTTAA GCTATTTAAT GACAGAAGGT GTATGGTGCA GTTTAAGGCT GGTTGATCTG TCGTCAAAAA AAGAACGGGT ACTTGCCAGA TGGGAGCAAC ATGTTCCTGG AATCGGTAAC AGTTTACCTA ACTATAATTT TTCGGGTGAT TCAAAAAAGA TCCTGATCGG TTATGGGGGA AAGATCCATA TGATTGAGAT AAGAACAGGC AAAGATGAAA TTATCCCCTT TACTGCCAAC GTAAAGGTAG ATATGGGAAA GCCTAATACT GCTACGTTTA AAGTTTCTCA GGATTCGCTG CAGGTCAAGT ATATGCGTTC GGCCTGCGCA AGCCCTGAAG GCAGGCAATT GGTGTTTTCT GCGTTGAACC GGATCTATAT CATGGATTTA CCCGGAGGTA AGCCCCGGAT ATTGGTGAAA CAGCCTTTTA GCCAGTTTCA GCCGGCATGG TCGGCGGATG GGCAATGGAT TACTTTTGTA AGCTGGAGCG ATGCTGAGTT TGGACAGGTC TGGAAAGTGG ATAAAAATGG TGACAGTCTA ACACAAATAT CACATAAGGC CGGGGTTTAC CATTATCCAA ACTGGTCTCC TGATGGAAAA TCTATTGCGG TTACAAAAGG GCGTAAAGTG TGGCAGGGTA AGCCGATGCT TGGGGACAGA GATGGGCCTG GAATCGGTCA ATTAATTACC CTTGAATTGC AAAATGGGAA CCAAAAAGTG ATTGCAGATA GTGTTCCACT TTCTAACAGA ACTACGTTTT CAGCAAATGG TGAAGGGCTC ATTTATGCAC CCTCAAGGGT AGGAAAGAGT GGGGTATTTC CATTTTTGGT ATCCAAAGAT CAGGAAGGGA AAGTGAATGT TTTAGCAACC GCAAGATATG AGGGAATAGG TAGTGAATTA TTTTTGCGTC AGATTATTCA ATCACCGGAT GGCAGATATT TCGTATACCT GAATGAAGAA AATTTGCATT TGGTTCCTGT TGATCCTTCA GGAGCACCGA CAATATTGTA TGATACCGAA AAAAAAAATC CTGTAATACG TTTTGCCAAG GGTGGTTTTG ATCCGCACTG GGAAAAGGGG GGGGAGGTGT TGAGCTGGTC TTTCGCCAAC CAATATTTCC GGATTGACCC GGATATGATT GTTGCAGCAG CAATTGCAGC TGCCGGGCAA CGAAAAAAAA TGGGTTTGGC TGAATCGGGA ATACTGGATG TAGAAATTGT TCCGGACGAG AGTATTGACA TTAACCTTAA GGTTGCTCAA CAAGTGGCTA ATGGGATGCT GGCTTTAAAA AATGCCAGGA TCATTACGGC CAGGGAAAAT GAAGTTATTG AAAATGGCAC CATTTTAATT CGTGACGGAC GTTTTGTAGC CGCAGGTAAA AACGCGGAAG TAAATATTCC GCCGGGTACA AAAGTTATGG ATATGCTGGG AAAAACGATT ATGCCCGGCC TGATAGACTT ACATGATCAC CTGCGCCCGC CAGCAGAAGT TTTTCCTCAG CAACCATGGA GTTTTTTTGC AGGACTGGCT TATGGTGTAA CCACCGCGAG GGAACCTTCT GGAAGCCATG ATTCTTTTGG GTATGAGGAA TTGTTGAAAA CCGGACAGAT GACTGGCCCG AGGTTTTTTA ATGTAGGTTA TGCAGTTAGG GAAGATAGGT ACCCGAATAT GAATGACCTG AACGAAGCCT ATATTATTGC CCAAAACCGT AAACGTATGG GGGCTATAGC GGTTAAGCAG TATGCGCAAC CGACTCGCTT AAAACGACAG TTGTTATTAC TGGCCTGCGA GCAGGCAGGG CTAAATATGA CCAATGAAGT TGAAAAAGAT ATGCGAGGGT TTATCGGCCA CATCAAAGAC AGTACTTTCG GTATAGAGCA CAACCCGCTA TGGGGTGAAG TGTATAATGA TGTCATCCAG CTGATCGCAA AATCTGGTGT TTACTTAACA CCAACTTTGC AGGTGGCTTA TGGAACTGAG CTGGGAAGGA ACCATTTTTT AGAAAAATAT GCTCAGCCTG ATGCAAAAAT GAAGCGTTTT TATCCGGAAG AAGAAATAAA ACGCCGCCAG GAAGAGCTGA AAAAGCTGAA AATTTACGCA GAAGCACATC AGGAGCTGCC TTCATTTGTG AACCAAAGTA AAGTTGATGC TGCCATCCGT CATGCAGGCG GGAGGGTTAC TATGGGTAGC CATGGCAATG ACCCGGCATT GGGTGCTCAT TTTGAAATTT GGGCGCTGCA AATGGGGGGA CTGACGAACC TGGAAGCGAT ACAGGCCGCG ACGATTATGG CTGCGGGCGG CCTGGGGATG CAGGAAGATC TGGGTTCTAT AGAGCCAGGA AAGATTGCTG ACCTGATTAT TTTGAATAAA AATCCTTTAG ACAATATCAG GAACACCATG GAAATACAAA GCGTAATGAA AGACGGCGTT TTGTATGATG GCAATACACT GGATGAAATA TGGCCAAAGG CTAAGAAATT CCAAACGATT AAAAACTAA
|
Protein sequence | MMKLINFIAL LCLLFAGNAG LAQLPIQAER AVSFTTKEGS NMSVDLSPDG KTVVFCLLGD LYTVSSKGGI ATQITRGIAI NDLPVWSPDG KRIAYISDRS GDDRLTVRNV SGNAIQTFEG KLPGVPVWFG PNDWVTTSND FDRHYPLYHL TGGEVDDSKN ISNVVGFSSD YKFIYYMHRE ASNSLVIYQH AKSRGEEKIL IELQGIAAKA AKRIKVSQDC NWLSYLMTEG VWCSLRLVDL SSKKERVLAR WEQHVPGIGN SLPNYNFSGD SKKILIGYGG KIHMIEIRTG KDEIIPFTAN VKVDMGKPNT ATFKVSQDSL QVKYMRSACA SPEGRQLVFS ALNRIYIMDL PGGKPRILVK QPFSQFQPAW SADGQWITFV SWSDAEFGQV WKVDKNGDSL TQISHKAGVY HYPNWSPDGK SIAVTKGRKV WQGKPMLGDR DGPGIGQLIT LELQNGNQKV IADSVPLSNR TTFSANGEGL IYAPSRVGKS GVFPFLVSKD QEGKVNVLAT ARYEGIGSEL FLRQIIQSPD GRYFVYLNEE NLHLVPVDPS GAPTILYDTE KKNPVIRFAK GGFDPHWEKG GEVLSWSFAN QYFRIDPDMI VAAAIAAAGQ RKKMGLAESG ILDVEIVPDE SIDINLKVAQ QVANGMLALK NARIITAREN EVIENGTILI RDGRFVAAGK NAEVNIPPGT KVMDMLGKTI MPGLIDLHDH LRPPAEVFPQ QPWSFFAGLA YGVTTAREPS GSHDSFGYEE LLKTGQMTGP RFFNVGYAVR EDRYPNMNDL NEAYIIAQNR KRMGAIAVKQ YAQPTRLKRQ LLLLACEQAG LNMTNEVEKD MRGFIGHIKD STFGIEHNPL WGEVYNDVIQ LIAKSGVYLT PTLQVAYGTE LGRNHFLEKY AQPDAKMKRF YPEEEIKRRQ EELKKLKIYA EAHQELPSFV NQSKVDAAIR HAGGRVTMGS HGNDPALGAH FEIWALQMGG LTNLEAIQAA TIMAAGGLGM QEDLGSIEPG KIADLIILNK NPLDNIRNTM EIQSVMKDGV LYDGNTLDEI WPKAKKFQTI KN
|
| |