Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2826 |
Symbol | |
ID | 8253934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3356938 |
End bp | 3358440 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936472 |
Product | sulfatase |
Protein accession | YP_003093087 |
Protein GI | 255532715 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.250394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTA ACAAATTGAA ATATTTCCCT GCAGCACTTT CAATGGTGCT GATATGGGCT TCCTGCACTT CGCCGGAAAA AAAAACGGAT CGTCCGAATA TCCTGATGAT CATGTCCGAT AACCAATCCT GGAACCACGT AGGGAGCTAT GGTGATCAAA CGGTACGCAC GCCCAATATG GACCGGATTG CGAAAGAAGG GGTACGTTTT ACCAATGCTT TTTGCAGTTC ACCTTCCTGT ACGCCCGCAA GGGCTGGAAT GCTGACCGGA CAGGATATAT GGAGGTTAGA AGATGGGGGC AATTTATGGG GTGTTTTACC GGTTAAATAT AAAGTATATC CGGATTTGCT GGAAGAAGCT GGCTATGCCA TAGGTTTTCA GGGAAAAGGC TGGGGCCCGG GAAGCTTTGA GGCCAATAAA CGCCCAAGAA ATCCTGCAGG GAATGAGTTT AAAAGTTTTG GCGCATTTTT AAAAGATAAA AAAGAAGGTC CCTGGTGTTA TTGGATCAGT AGTCATGAAC CTCACCGTCC TTATGTGGAA GGTTCCGGCG AAAAAGCTGG TATCGATCCA AATAAAGTAA AAGTTCCTGC CTATTTGCCA GATCATATCA GTATAAGAAA AGACATTGCA GATTACTACG CTGCGGTTGA AACCTTTGAT CGTGAACTGG GCGAGGCCCT TGACCAGTTG AAAGCAAGTG GTGAGCTGGA CAATACGGTA ATTGTGGTAT GCAGTGACAA CGGCTGGCAA ATGCCGCGTG GACTGGCCAA CTTGTACGAT TTTGGTACAC ATGTGCCCCT GATCATTTCA TGGCCAGGTA AGTTTAAACA GGATGTAGTT GCCGATAACC TGGTCACACT GAATGACCTT GCCCCAACAT TCTTACAACT GGGTAAGGTA CCTGTACCGG CCGATATGAC GGGTAAAAGT TTATTGCCCA TTGTTGAGGC AGGTAAAAAA GATGAAAAAC CCCGGGATTA TGTAGTACTG GGAAGAGAGC GTCATGCATT CGTTCGTCGG CATGGCCTTG GCTATCCTGG CAGGGCAATT CGTACTAAAG ATTATCTTTA CATTAAAAAT TATGAACCAA ATAGATGGCC GGCAGGTGAT CCGCCGTTTT ATGGAGACAT TGATCCCTAC ATGTTCAACT GGCCGGGTGA AACCAAATAT TACCTGATAG AACATAAAGA TGATCCGAAA GTAAAGTCTT TCTTTGAACT GGGAATGGGC AAACGTCCGG CAGAAGAATT ATTTGATATC AATAAAGATC CGGATGAATT ACACAATCTG GCAGCACTTC CTGAATATCA AAAAATAAAA CAGGAGCTTG TTGCTAAATT GCGTAATTAT TTGGTAGCAA CGAAAGATCC GAGAGAAACT AATGGTAATA TACAGATCTG GGATACTGCT GCTTATTTTA GTGAAATAGA TAAAACGCCA AAACCAAGTA AAGAGATGCA AAAGCGTTTT AAATTAGATT CCAGTTACAA TTATTTGAAG TAA
|
Protein sequence | MKFNKLKYFP AALSMVLIWA SCTSPEKKTD RPNILMIMSD NQSWNHVGSY GDQTVRTPNM DRIAKEGVRF TNAFCSSPSC TPARAGMLTG QDIWRLEDGG NLWGVLPVKY KVYPDLLEEA GYAIGFQGKG WGPGSFEANK RPRNPAGNEF KSFGAFLKDK KEGPWCYWIS SHEPHRPYVE GSGEKAGIDP NKVKVPAYLP DHISIRKDIA DYYAAVETFD RELGEALDQL KASGELDNTV IVVCSDNGWQ MPRGLANLYD FGTHVPLIIS WPGKFKQDVV ADNLVTLNDL APTFLQLGKV PVPADMTGKS LLPIVEAGKK DEKPRDYVVL GRERHAFVRR HGLGYPGRAI RTKDYLYIKN YEPNRWPAGD PPFYGDIDPY MFNWPGETKY YLIEHKDDPK VKSFFELGMG KRPAEELFDI NKDPDELHNL AALPEYQKIK QELVAKLRNY LVATKDPRET NGNIQIWDTA AYFSEIDKTP KPSKEMQKRF KLDSSYNYLK
|
| |