Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0356 |
Symbol | |
ID | 8251441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 420679 |
End bp | 421686 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644934004 |
Product | amidohydrolase 2 |
Protein accession | YP_003090642 |
Protein GI | 255530270 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0473488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTA AAAAATCAAA AACGAAACCA TCCGCATCCC TGTCCCGGCG CAAGTTTATT GCCGGGACAG GCCTGCTGCT GGCCGGAACT GCCTTAAAAC TACAGGCGAA CCCGCTCTCT ACGCTCTCTA CGCATTCTCC GCTCGATTCA CTTTTTGCTG AGCCGATTAT TGACATCCAC CAGCATACCG ATTACATGGG GCGTACTACC GAACAGCTGA TTGCGCACCA GCGGGCCATG GGGATTACCA AAACCATTCT TTTGCCTTCA GGCCGTCCGG TTAATTCGGC CAGTACACAC TTTGGCGTAG GCAACGGCTT GCAGGCCAAA GCTACCGGAA ACACCATTTG CTACGGACTG GCAAAAAAAT ATCCTAAAGA ATTTTTGTTT GGTGCCAATG AGGTGCCCGA CCTGCCGGAT GCCCTTCAAG AAATTGAAAA ATATTTAAAG CTGGGTGCTT CTGTTATAGG GGAGTCTAAA TTTGGGGTAG AATGTGATTC GCCGGAGATG CAGAAGATCT ATGAGCTGGC ACAGGCCTAT AATGTTCCGG TGCTGATGCA CTGGCAGTAT GAAATGTACA ACTATGGCCT GGAGCGCTTT TATAAAATGC TGGAAAAATA TCCTAAGGTG AACTTCATCG GGCACTCCCA AACCTGGTGG GCTAATATTG ACAGGAACCA CCTGGACCAG AAGGTGCTTT ATCCGAAAAC AAAGGTGACC CCCGGCGGAT TGACAGACCA GCTGCTGAGC AATTACGACA ATATTTACGG CGACTTGTCG GCAGGTTCAG GACTAGGTTC CATGACCAGG GATGAGGACC ATGCGCGTGC CTTTATAGAA AAACATCAGG ACAGGTTGCT TTTTGGCAGC GACTGTACCG ATATAACCGG GCATATAGAC AATTGTTTCG GGGCAAAAAT AATTGCCGAG GTAAGGAAAT TATCGCCCAC AAAAAAAATA GAAAGAAAGA TCTTATACCA CAATGCAAAA AAACTGTTCC GCCTGTAA
|
Protein sequence | MDFKKSKTKP SASLSRRKFI AGTGLLLAGT ALKLQANPLS TLSTHSPLDS LFAEPIIDIH QHTDYMGRTT EQLIAHQRAM GITKTILLPS GRPVNSASTH FGVGNGLQAK ATGNTICYGL AKKYPKEFLF GANEVPDLPD ALQEIEKYLK LGASVIGESK FGVECDSPEM QKIYELAQAY NVPVLMHWQY EMYNYGLERF YKMLEKYPKV NFIGHSQTWW ANIDRNHLDQ KVLYPKTKVT PGGLTDQLLS NYDNIYGDLS AGSGLGSMTR DEDHARAFIE KHQDRLLFGS DCTDITGHID NCFGAKIIAE VRKLSPTKKI ERKILYHNAK KLFRL
|
| |