Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3390 |
Symbol | |
ID | 8254509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4030364 |
End bp | 4031905 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937042 |
Product | sulfatase |
Protein accession | YP_003093646 |
Protein GI | 255533274 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0663726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT ATAAACTTCT GATAATAGCC TTACTGACCG TCAAAACCAC TCTTTTTGCA CAAAATCAAA AACCAATCCG CCCGAACATC ATTGTTTTTA TGGTTGATGA TATGGGCTGG ATGGATACTT CAGTGCCTTT TTATGACAGC ATTATGCCCC TGAACAAGCG TTACCATACA CCCAATATGG AGCGGCTGGC ACAGGCAGGA ATGAAATTTA CAGATGCTTA TGCGCAACCG GTCTGCACCC CTTCCAGAGT GAGTTTTATG ACAGGCATGA ATGCCTCCAG GAGCCATGTG ACCAATTGGA CATCACCCCT TAAAAATAAT GATGCTGATG AAAAGGATGA GCAGTTTGAG CCCCTGGAAT GGAACATGAA CGGTTTAAGC AATAATGCAG AAACGGAGCG CACAGTTTTT GCAACACCCT TTCCACAATT GTTAAAAGAT GCCGGTTATT ATACCATTCA TATTGGCAAG GCCCATTGGG CTGCAATAGG TACCCCAGGT GCAAGCCCTT ACAACTTAGG GTTTATGGTT AACATTGCCG GGCATTCGGG CGGACATCCG CAAAGCTATT TATCGGAGCA AAATTATGGC AATATGCCTG GTAAGACACA GGTACAGGCG GTACCCGATT TAGAGGCGTA TTTTAAAACA GGTACTTTTC TTTCAGAAGC CCTTACACAG GAAGCCCTGA AAACAATGGA AACACCAATA GCCAGAAAGG AGCCTTTTTA TCTGAATATG GCCCATTATG CTGTTCATAC CCCTATAATG GCCGATCCGC GCTTTGTACA AAAGTATTAT GATGCGGGGC TTGACAGTAC CGAGGCCAGA TATGCGAGTC TGGTTGAAGG TATGGACAAA AGCCTTGGTG ATATTATGGA TTATTTAAAA AAGAAAGGTG TGGACAAAAA TACCATCATT ATTTTTATGA GCGATAACGG AGGGCTAGAC CATCACCAAA GGGGAGGTGC CCTAAATACA CACAACTACC CCCTCAGGTC GGGCAAAGGC TCGGTATATG AAGGTGGGAT AAGGGAACCC ATGATTGTAA GATGGCCAGG TGTAACCAGC GCAGGTTCAG TTTATAAAAA TCCGGTGATC ATTGAAGATT TTTTCCCTTC CATATTGGAA ATGGCAGGGG TTAAGCCAGC TAAGATACTC CAAAAAACGG ATGGGCAAAG TTTTGTGAAA TACCTCAAAA ACCCTCATTT AAAAGCGCCG GACCGTCCTT TGGTATTTCA TTATCCCAAC AAATGGATTA ACCTGACCGC GAATGAAAAA CTGGGTATCA ATTATTTTAC TGCGCTTCGT TTAGGGAACT GGAAGCTGTT ATACAATATG CGGAACCGGG AGTTTGAACT GTATGATCTG GCCCAGGATA TCCGGGAGGC AAATAACCTG GCTGGTAAAT ATCCGGGAAT GGTAAAAAAA CTGGCCCTGG TATTGGGCAA AACACTTAAA GAACGGAATG CACAGCTTCC ACGTGAAAAA GCAAGTGGAA AAGTGATTCC GTTCCCTGAT GAAGTACAAT AG
|
Protein sequence | MKKYKLLIIA LLTVKTTLFA QNQKPIRPNI IVFMVDDMGW MDTSVPFYDS IMPLNKRYHT PNMERLAQAG MKFTDAYAQP VCTPSRVSFM TGMNASRSHV TNWTSPLKNN DADEKDEQFE PLEWNMNGLS NNAETERTVF ATPFPQLLKD AGYYTIHIGK AHWAAIGTPG ASPYNLGFMV NIAGHSGGHP QSYLSEQNYG NMPGKTQVQA VPDLEAYFKT GTFLSEALTQ EALKTMETPI ARKEPFYLNM AHYAVHTPIM ADPRFVQKYY DAGLDSTEAR YASLVEGMDK SLGDIMDYLK KKGVDKNTII IFMSDNGGLD HHQRGGALNT HNYPLRSGKG SVYEGGIREP MIVRWPGVTS AGSVYKNPVI IEDFFPSILE MAGVKPAKIL QKTDGQSFVK YLKNPHLKAP DRPLVFHYPN KWINLTANEK LGINYFTALR LGNWKLLYNM RNREFELYDL AQDIREANNL AGKYPGMVKK LALVLGKTLK ERNAQLPREK ASGKVIPFPD EVQ
|
| |