Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3537 |
Symbol | |
ID | 8254658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4207326 |
End bp | 4209278 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937188 |
Product | sulfatase |
Protein accession | YP_003093790 |
Protein GI | 255533418 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACA AATTATTTAA AGGCAGGTAT AGCAGCTTGT TCTCTTTTCT ACTTGTCTTT ATTTTTAGCT CTTTCCTGAT CAGGACTGTA TTGCTCTTTA TCTCAATTGG AAAAGCAGAT TTTACTATTT TAGGCGTAAT TCAGATTTAC CTGCTGGGCT TTGTTTATGA CCTGGCTGTA GGTTTGTTTT TAACTGGCTT GTATAACCTG TACCTGCTTT TTCTGCCAGG TAAATGGGCC AATTCTATAG CAAATAAGGT CCTTACTTAT GCGGGGCTTT TTATCATCTT GCTCATTTCT TTCTTTTCTT TTTTTGCCGA ACTTACTTTT TGGCAGGAAT TTGAGAGCAG GTTTAATTTT ATTGCTGTAG ATTACCTCAT CTACACTACC GAAGTAATCA ATAACATCAT TGAATCGTAT CCTTTGCCAT TACTGATCAG CGGGATATTG CTGCTGGTGG TATTGGTGTT CTGGTTGTTT ACCAAAAAGA AGGTCTTCCG GTATACCTTT CAGTCGGCCA TGCCATTTAA ACAAAGAATA GCAATATCTG GTGTTTTATT GCTGGCAACC ATCGTTTATC CGCTGGTGCT CAGCAATTCT TTTGCAGAAT CTGGTACCAA TCGTTACCAG AACGAGCTTT CAAAAGCAGG TATCTATTCC ATTTTTGCAG CCTTTAAAAA CAATGAACTT AATTATAAGG ATTTTTATGC CTTGCTTCCT GACGATAAAG CTTTTGCCCT GATGCGAAAA CAGCTGCAGG ATCGGCACAG TGCATTTGTC AGTACAGGCC ATTCGATAAA GAGAACAGTT AAGAGTGACA AACCTTTGTA TAAACCCAAT GTGATCATGA TTACGGTGGA GAGCTTGAGT GCAGATTTTC TTGGGCATTT TGGCAATACA CAGCATTTAA CCCCGGTACT GGATTCGCTG TCGCAACACA ACCTGGTATT TAACAATATG TTTGCAACGG GCACCCGTAC CGTAAGGGGA ATGGAAGCCC TCTCGCTTGC TATTCCTCCA ACACCGGGAA GCAGTATTGT AAGGCGAAGT AAAAATGAGA ACCTGTGTAC TGTTGGTTAT ATTTTTCAAC AGGCAGGTTA TACCCGGACT TTCTATTATG GTGGCGATGG TTATTTCGAT AACATGAATG AGTATTTTGG CAGTAATGGT TTTGACATTA CTGACAGGGG CAGGAACATT AAGGTAGGCG AGAGTTACCT GACCAAAAGG ACCATCATTC AGGATAAACA GGTAACCTTT GAAAATGCAT GGGGAATATG TGATGGCGAT CTTTTTGATG CCGTGATAAG GGGTGCCGAC CAAGATTACC AGAATGGTAA GCCTTTCTAC AATTTTGTAA TGACCACCTC TAACCACAGG CCTTTTACTT TTCCTGATGG TAAAATTGAG GCCAAAGTGA AGAACAGGGA GGCTGCAGTG CGGTATACAG ATTTTGCTAT AGGCGACTTT TTGAAAAAGA TGCAGAAGAA GGCGTGGTTT AAAAATACAG TGGTGATTAT TGTGGCCGAC CATTGTGCGG CCAGTGCGGG AAAAAATGAG ATCGACATCA GTAAATATCA CATTCCCTGT ATTGTACTGA ACCTGCCGGT AAAAGGTAAA GTAGCAATTG ATCAACTTTG CTCGCAGATA GACCTGTACC CCACTTTGTT CGACCTGCTG GGCTGGAATT ACGAGAGTAA CCTGTATGGA CAGAATGTTT TAGAACCAGG CTACCAGCCC CGTGCTGTAT TAGGTACCTA CCAACAGCTG GGATATTTAA AGCAGGACAG CCTGGTTATA TTGGGGCCAC AGCAAAAAAC CGAGACCTTT ATTTATCACC GGGAGAACAA TGAACAGGTA CCCAATCCGT TGTCCAGAAC GGTTATTGAG CAGGCCATGG CCAATTATCA AACGGCTTAC GACCTGTTTA AAAACGGTGG CCTGCACCAG TAA
|
Protein sequence | MLNKLFKGRY SSLFSFLLVF IFSSFLIRTV LLFISIGKAD FTILGVIQIY LLGFVYDLAV GLFLTGLYNL YLLFLPGKWA NSIANKVLTY AGLFIILLIS FFSFFAELTF WQEFESRFNF IAVDYLIYTT EVINNIIESY PLPLLISGIL LLVVLVFWLF TKKKVFRYTF QSAMPFKQRI AISGVLLLAT IVYPLVLSNS FAESGTNRYQ NELSKAGIYS IFAAFKNNEL NYKDFYALLP DDKAFALMRK QLQDRHSAFV STGHSIKRTV KSDKPLYKPN VIMITVESLS ADFLGHFGNT QHLTPVLDSL SQHNLVFNNM FATGTRTVRG MEALSLAIPP TPGSSIVRRS KNENLCTVGY IFQQAGYTRT FYYGGDGYFD NMNEYFGSNG FDITDRGRNI KVGESYLTKR TIIQDKQVTF ENAWGICDGD LFDAVIRGAD QDYQNGKPFY NFVMTTSNHR PFTFPDGKIE AKVKNREAAV RYTDFAIGDF LKKMQKKAWF KNTVVIIVAD HCAASAGKNE IDISKYHIPC IVLNLPVKGK VAIDQLCSQI DLYPTLFDLL GWNYESNLYG QNVLEPGYQP RAVLGTYQQL GYLKQDSLVI LGPQQKTETF IYHRENNEQV PNPLSRTVIE QAMANYQTAY DLFKNGGLHQ
|
| |