Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3852 |
Symbol | |
ID | 8254986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4622249 |
End bp | 4623655 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937516 |
Product | sulfatase |
Protein accession | YP_003094105 |
Protein GI | 255533733 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAT ACCTTTTTAC CGTTTTATTG GTTTTAGCGA CAATCCATTT AAGTGCCCGC CAAAAGCCCA ATGTGATCTT TATTTTAGCA GATGACATGG GCTATGGCGA TTTAGGCTGT TACGGCCAAC AACTCATAGA AACGCCTAAC ATTGACAAGC TGGCTGCAAA TGGAATCCGA TTTACTCAAT TTTATGCTGG CACTTCGGTA TGTGCTCCAT CAAGGGCATC TTTAATGACC GGCTTACATA CCGGCCATAC GCCAATAAGG GGTAATCATG AAATTAAACC AGAAGGACAG CTACCCTTAC CCAAGGACAC CTATACCTTG GCCAGACTAT TTAAAGCTGC TGGTTATAAG ACCGAGGCAT TTGGTAAATG GGGACTGGGC TATCCAGGTT CTGAAGGCGA CCCGGTAAAA CAGGGCATAG ATCAGTTTTA TGGCTACAAT TGCCAGCGCC AGTCGCATAA CTTCTTTCCA GACCATTTAT GGGACAACGA AAAACGTGTT GAACTGGGCA ATACTTTAAG CCAGCAAACA CAATATGCCC CCGAACTGAT CCAGAAACAG GCTATGTCTT TTATGAAAGC AAATCAATCC AACCCTTTCT TTCTGTATCT GGCCTATACC CTGCCCCATG CAGCATTACA GTTACCCAAA AACGACCAGG CATTTGAATA CTATAAAAAG AAATTTAAGG AACAGCCCAA GCCTGTAAAA GAAAACTGGG ACGGCATTGC TTATCAACCG CAGCCTTACC CACACGCAGC TTACGCAGCT ATGGTAAGCA AACTGGACAA TTATGTAGGT GAAGTAGTAA AACAGCTCAA GGCACTTGAC CTGGAAAAAC AAACGCTGAT CGTTTTTACC AGTGACAATG GTCCGCACAA TGAAGGTGGA AATGAACCTG CTTTTTTTAA CAGCAGTGCT GGCTTTAAAG GAATAAAAAG ACAGCTTACG GAAGGTGGGA TTAGAGAACC AATGATTGTT AGCTGGCCAG GCAAGATCAA AGCCGGGCAG AGCTCGGCAC ATATTGGTGC ATTCTGGGAT TTTATGCCAA CTTTTGCAGA ACTGACGGCT CAACCCCTGC CTGTTAAAAC AGATGGATTA TCCATACTAC CTGTATTGCT AAATAAAGGC ACACAAAAAC AACATGATTT TTTATACTGG GAGTTTCATG AACAGGGCGG AAGACAAGCC CTGAGAATGG GTAAATGGAA GGCCATCCGG GAAAAGGTTA AACAGGATGC AAATGGCCCA ATTTTGTTAT ATGATCTGGA TATTGATCCA AAAGAGCGCA ACGACCTGGC TGCCAAACAT CCGGAAGTGG TAAAAAAAGC GGCATTGCTT ATGCAGCATG AGCATGTAGA AAACAAGGAT TTCCCACTAA TTTCAAATAA ATTTTAA
|
Protein sequence | MRIYLFTVLL VLATIHLSAR QKPNVIFILA DDMGYGDLGC YGQQLIETPN IDKLAANGIR FTQFYAGTSV CAPSRASLMT GLHTGHTPIR GNHEIKPEGQ LPLPKDTYTL ARLFKAAGYK TEAFGKWGLG YPGSEGDPVK QGIDQFYGYN CQRQSHNFFP DHLWDNEKRV ELGNTLSQQT QYAPELIQKQ AMSFMKANQS NPFFLYLAYT LPHAALQLPK NDQAFEYYKK KFKEQPKPVK ENWDGIAYQP QPYPHAAYAA MVSKLDNYVG EVVKQLKALD LEKQTLIVFT SDNGPHNEGG NEPAFFNSSA GFKGIKRQLT EGGIREPMIV SWPGKIKAGQ SSAHIGAFWD FMPTFAELTA QPLPVKTDGL SILPVLLNKG TQKQHDFLYW EFHEQGGRQA LRMGKWKAIR EKVKQDANGP ILLYDLDIDP KERNDLAAKH PEVVKKAALL MQHEHVENKD FPLISNKF
|
| |