Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2831 |
Symbol | |
ID | 8253939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3366547 |
End bp | 3367869 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936477 |
Product | sulfatase |
Protein accession | YP_003093092 |
Protein GI | 255532720 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0404074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTA TACCTTTTCT CTCCATTATC TTAACTGCCT TAAGTTTAAT TACCCATGTA ACATTTGGTC AAAAAAGACC AAATGTAATT ATTGTGCTCA CCGATGATAT GGGTTACGGT GATCTGGCCT GTTACGGGAA CCCTTTATTC AAAACACCAT TTCTTGATAA AATGGCCAGT AATGGCGTAA TGGCAACAAA TTTTGTAACC ACTTCTCCTA CCTGCTCCCC ATCAAGGGTA TCAACCCTTA CCGGGCGGTA TTGCAGCCGC TCTAAAATGC CACGTGTTAT AGGCCCTGGT GATAAAACAG CAATTCCTGA TGAAGAGGTT ACCATTGCCG AAATGCTGAA AACTTCAGCT TACCGTACAG CCTGTATAGG TAAATGGCAT ATTGGCGATT ATGGTACCGG ATTGCCCAAC AAACAAGGTT TCGATTTATT TTACGGGATG TTGTACAGTC ATGACTTCAG GGCACCTTAT GTAAAAACAG ATACAGTGAT TAAAATATTC AGGAACCAAA AGCCCGAAAT ATACCGTCCT AATGATACCA TACTCACAAA AGCCTATACC AGGGAAGCCA TCGGTTTTGT AAAAGAATCG ACAGCAAAAA AACAACCTTT TTTTTTATAT CTGGCCTACA ATATGCCACA TCTTCCAGTA GCCAGCGCAG TAAGAAAAGA CAGCAATAAA TCGGCCGGAG GCGAACTGGG CAGTGTGATA GAAGAAATGG ATACGGAAAT GGCTAAGCTA TGGAAAACAG TGCAGGACAG TGGCGAAGCT GACAATACCA TTTTTATATT TACCAGCGAT AACGGCCCAT GGTTAAATGC CCCTCAGCGC ATGTACGATG ACGGCATTAC CAAGCCATAT CACGTGGGCA CAGCTGGTAT TTTCAGGGGA TCGAAGGCAA CTTCTTTAGA AGGCGGACAC CGCGTGCCTT TTATAGTTTA TTACAAAAAC CATACAGCCC AACAAGTTGT GCGCAGCCCG ATATCCAACC TGGATATTTT GCCCACCCTG GCCGACTGGA CCGGTACTGC CCTACCAAAA CGGGTGCTGG ACGGAGAATC TGTGGTTAAG CTGCTGTCAC AAAAAGACTA TCAGATTCCC CACAAGCCAA TTTATTATTA CAACTATGTC CTGGAAGGTG TAAAGGATGG TGACTGGAAG CTGAGGATCA CTAAAAAGGA TGATAAAACA ATAGAAGAAA TGTTCCATCT GGGCTGGGAC CCTACAGAGC GCTACAATTT ATACAACGAC CCAAAATATG CTAAGGAACA ACAACATTTA CTGCAGTTAT ACAGGGATTA CCCGGATCAG TAA
|
Protein sequence | MKRIPFLSII LTALSLITHV TFGQKRPNVI IVLTDDMGYG DLACYGNPLF KTPFLDKMAS NGVMATNFVT TSPTCSPSRV STLTGRYCSR SKMPRVIGPG DKTAIPDEEV TIAEMLKTSA YRTACIGKWH IGDYGTGLPN KQGFDLFYGM LYSHDFRAPY VKTDTVIKIF RNQKPEIYRP NDTILTKAYT REAIGFVKES TAKKQPFFLY LAYNMPHLPV ASAVRKDSNK SAGGELGSVI EEMDTEMAKL WKTVQDSGEA DNTIFIFTSD NGPWLNAPQR MYDDGITKPY HVGTAGIFRG SKATSLEGGH RVPFIVYYKN HTAQQVVRSP ISNLDILPTL ADWTGTALPK RVLDGESVVK LLSQKDYQIP HKPIYYYNYV LEGVKDGDWK LRITKKDDKT IEEMFHLGWD PTERYNLYND PKYAKEQQHL LQLYRDYPDQ
|
| |