Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3489 |
Symbol | |
ID | 8254609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4152384 |
End bp | 4153991 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937139 |
Product | sulfatase |
Protein accession | YP_003093742 |
Protein GI | 255533370 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTAA GAACTCAGAC CATTCAGATC ATCTCTTTTT TAGGGCTGGT TACCATGGGT TTTCAGGTAC TTGCTCAAAA GCAGGAGAAG CCAAACGTAT TGTTTATCGC TGTTGATGAC CTGAAGCCTA TTTTAGGTTG TTATGGCGAT CGGTTGATTA AAACACCCAA TATAGACCGG TTGGCAAAAA TGGGTACGGT TTTTAAAAGC AACTATTGCC AGCAGGCAGT TTGCGGTCCA ACCAGAGCGA GTATCATGAC CGGAATGCGC CCTGATATAA CCAAAGTATG GGATTTGAAA ACAAAGATGA GGGATATGAA TCCTGATATT CTGACCATCC CGCAATACTT TGCCAGTCAG GGATACTCCA CGCAGGCTAT CGGTAAGATA TATGATCCAA GATGTGTGGA TGAGGATTTA GATAAACCAA GCTGGACCGT TCCACATTAC AGAACAGATA AAAAATATTA TGCTGCCTCT ACCGGACAGC CTGTTTTAAA TTATTATCAG GGAAAAGAGA TTAAATCACT GGTTGAAAAA CGCAGGGCTG AGGCTAAAGG AAAGATCATA ACCGATCAGG AATTGTTGGC TACGATCAAA CCATCGGTAG AATGTGTGGA TGTACCCGAT CAGGCATATA TTGACGGAGC CAACATCCTG CAGGCAAAGG ATATTTTAAC AACACTCCAA AAGAAAAGCC AACCCTTCTT TTTTGCCGTA GGCTTTGCCA AACCTCATTT GCCCTTTAAT GCACCGAAGA AATACTGGGA CCTGTATCAG CGGGAGGATA TGCCGGTTGC AGCGTTTCAG GAAAAATCTA AAAATGCAGT GGATGTAGCT TACCACAATT CGGGGGAACT CAGGGCTTAT TCAGATATTC CGGATTTATT ATCTTTTACT GATCAGAAAA GCTATGGGCT AACTTTACCC ATAGCTAAAC AAAAAGAACT GATACATGGA TACTATGCAG CGGTTTCTTA TGTAGATGCA CAGGTAGGCA TCTTATTAAA TGCCCTGGAC TCACTGGGTT TAAGTAAAAA CACGGTCATT GTACTTTGGG GCGACCACGG ATGGCATTTA GGCGATCATA ACCTTTGGTG CAAACATTCC GATTTTGAAC AGGCCACCCG TAGCCCTTTG ATCTTTTCAG CTCCAGGTAT TAAATCCTCC GCCACTACTT CCCTTTCAGA ATTTGTAGAT GTTTTTCCTA CGCTTTGCAA TTTAGCCGGT ATTCCGGTGC CCCAGCATTT AGAGGGTACC AGTCTGGTTC CATTGATGCG AAATCCTGCC TCTTCGATAA AGGAATTTGC GATCAGCCAG TATCCCCGAA GTTCAAATGC TGTGGAAACA CAACGAATGA CAGACGCTTC AGCGAAGGTT ATGGGTTATT CACTTCGCAC AAAAAGATAT CGTTACACGA TATGGATGGA GAATTTCAGG AGTAACCAGG CATTTAAGGC TACCGCTGTT GTTGGTGATG AATTGTATGA TTATCAGAAG GACCCGCTTG AAAAAATAAA TGTAGTGAAG GATAGAAATT ATGCACTGAT CGCCAAAAGT TTAAAGGATA AAATGATCAG GTATTTTCAT AGTAAAGAAA AGCCGTAA
|
Protein sequence | MILRTQTIQI ISFLGLVTMG FQVLAQKQEK PNVLFIAVDD LKPILGCYGD RLIKTPNIDR LAKMGTVFKS NYCQQAVCGP TRASIMTGMR PDITKVWDLK TKMRDMNPDI LTIPQYFASQ GYSTQAIGKI YDPRCVDEDL DKPSWTVPHY RTDKKYYAAS TGQPVLNYYQ GKEIKSLVEK RRAEAKGKII TDQELLATIK PSVECVDVPD QAYIDGANIL QAKDILTTLQ KKSQPFFFAV GFAKPHLPFN APKKYWDLYQ REDMPVAAFQ EKSKNAVDVA YHNSGELRAY SDIPDLLSFT DQKSYGLTLP IAKQKELIHG YYAAVSYVDA QVGILLNALD SLGLSKNTVI VLWGDHGWHL GDHNLWCKHS DFEQATRSPL IFSAPGIKSS ATTSLSEFVD VFPTLCNLAG IPVPQHLEGT SLVPLMRNPA SSIKEFAISQ YPRSSNAVET QRMTDASAKV MGYSLRTKRY RYTIWMENFR SNQAFKATAV VGDELYDYQK DPLEKINVVK DRNYALIAKS LKDKMIRYFH SKEKP
|
| |