Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2247 |
Symbol | |
ID | 8253353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2607917 |
End bp | 2609398 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644935896 |
Product | sulfatase |
Protein accession | YP_003092513 |
Protein GI | 255532141 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAG CTACAGTATT ATTTTTTACC CTTTCTGTCC TTATTTTATT TAGCAGTCAT AAATATGTTC CGGCACCTAC CGCAAAACCT TATAACGTGC TTTTTATTTT TGTTGACGAC CTTCGTCCCG ATCTGGGCTG TTATGGTAAC CGCATTATAA AATCTCCCCA TATAGATGCC TTGGCTGCAC AATCGGTTCT TTTTAAGCAA CAATTTGTAA CAGTACCTAC CTGTGGGGCT TCAAGGGCCA GTATACTTAC AGGTTTAAGG CCCCGTTCAG TAAATGATCT TTCCAATGAG GCTTTTGAAC TTAAACCAAA AAGCCAGAAT ATACCCGAAT CTTTTATTGC GTTACTTAGA CAGCAAGGAT ATTATACGGT AGGGATAGGT AAAATAAGTC ACTCACCCGA TGGTTATGTA TATAAATACC TGGAACCAAA AAGTTCACAA ATGGAACTGG AAAGAAGCTG GGATGAAATG CTTTTTAATG CAGGTAAATG GAAAACAGGG TGGAATGCCT TTTTTGGCTA TGCCGATGGC AACAACAGAA ATGAATTAAA AGGCGAAGTA AAACCTTATG AACATGCGCC TGTAAGTGAC AGCAACTACC CCGATGGCTT AACAGCCGAA ATGGCAGTCA GCAAGTTAAA AGAACTGAGT ACAAAAGAGA AACCTTTTTT TTTGGGTGTA GGTCTGTTTA AGCCCCATCT ACCATTTACT GCGCCGCAGA AGTATTGGGA TTTATATCAG GAGGCCGACA TCAGCTTAAC ACCATCACCA GATATACCAG TAGATGTTAA TCCTGTCAGT TTGCAGGAAA GCGGGGAGTT TAACGGGTAT AAAAAGGGGG AAGAAAGAGC CTCACTGGCC AAGCCTGTAT CTGATGCTTA TGCCCGTAAA CTTCGTCATG CCTATTATGC TGCAGTAAGT TATTCAGATG CCCAGATAGG TAAAATACTG GATGAACTGA AGCGAAGCGG AAAGGATAAA AATACCATTG TGGTATTATG GGGTGATCAT GGCTGGCACC TGGGCGACGA CCGCGTTTGG GGTAAACATA CGCTATCTGA ATGGGCCTTG CACAGTCCCC TGATCATAAA GGTACCTGGT TTGCCCCAGG CCATAAACAA TAATGTGGTG AGCGCTGTAG ACGTGTATCC TACTTTAATG GAACTCTGCG GAATAAAGAA GCCAGCGCAT ATTGACGGGA CAAGTCTGGT ACCTGCATTA AAAAATCCGC TTGCCAGTTC AGCGGGCGGT ATAGCCTACA GTTATTTTAA GAAAGGGATC AGTCTGCGTA CAGACCGTTA CCGTTTAACA AAATACTTCC GGGCCGCAAT GCCTGCAATT GAATTATACG ATCACCAGAC AGATCCTTAT GAAAATAAGA ACATAGCAGC ACAGCAGCCG GAACTGGTTA AACAATTGAT GGTTTTACTG GAAAAAGGTA ATACCGGTTT ATACAACAAG CCGGTAAATT GA
|
Protein sequence | MKRATVLFFT LSVLILFSSH KYVPAPTAKP YNVLFIFVDD LRPDLGCYGN RIIKSPHIDA LAAQSVLFKQ QFVTVPTCGA SRASILTGLR PRSVNDLSNE AFELKPKSQN IPESFIALLR QQGYYTVGIG KISHSPDGYV YKYLEPKSSQ MELERSWDEM LFNAGKWKTG WNAFFGYADG NNRNELKGEV KPYEHAPVSD SNYPDGLTAE MAVSKLKELS TKEKPFFLGV GLFKPHLPFT APQKYWDLYQ EADISLTPSP DIPVDVNPVS LQESGEFNGY KKGEERASLA KPVSDAYARK LRHAYYAAVS YSDAQIGKIL DELKRSGKDK NTIVVLWGDH GWHLGDDRVW GKHTLSEWAL HSPLIIKVPG LPQAINNNVV SAVDVYPTLM ELCGIKKPAH IDGTSLVPAL KNPLASSAGG IAYSYFKKGI SLRTDRYRLT KYFRAAMPAI ELYDHQTDPY ENKNIAAQQP ELVKQLMVLL EKGNTGLYNK PVN
|
| |