Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3047 |
Symbol | |
ID | 8254163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3643027 |
End bp | 3644538 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936700 |
Product | sulfatase |
Protein accession | YP_003093307 |
Protein GI | 255532935 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0209579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA CAACAATTCT GATTTTAGCA TTCCTTCTTT TTGGCACAAC GATTCATCTA TCAGCCCAAA CGGCAAAACC AAATGTTATT TTTATTTATG CAGATGATTT GGGTTATGGC GACCTGAGCT GCTATGGTGC TACTAAAATA AATACTCCAA ACCTTGACAA ACTTGCAAAA AATGGCATCC GCTTTACCAA CGGGCATTCT ACTTCAGCCA CCTGTACCCC ATCCCGCTTC GCGGTTATGA CAGGGCAATA CCCATGGCGC CAAAAAGGAA CAGAAATTTT ACCTGGCGAC GCGGCACTGA TTGTTCCAAC AAATACGACT ACCCTGCCTA AAGTATTTAA AAAAGCTGGT TATCAAACCG CAGTTGTAGG TAAATGGCAC CTCGGGTTGG GTAAACAAGT AGAAAAAAAC TGGAATACTG AAGTAACACC CGGGCCAAAT GAAGTGGGCT TCGATTATTC ATTTATTTTT CCGGCAACAG CCGATAGGGT CCCAACTGTT TTTATGGAAA ATCATAAAAT ACTGGCTTTG GATCAGACAG ACCCCATCAG AGTAGATTAT AAAAATCCGG TCGGCAACGA TCCGACAGGA AAAGAGCATC CGGAATTGCT AAAACTCAAA TCTTCAGCAG GACAGGGACA CAACAATACC ATTGTAAACA GTATTGGCAG GATTGGGTAC ATGGAGGGCG GTCACAAAGC CAGGTGGACA GATGAAGAGG TATCCACAAC ATTTCTGACC AAGGCCCGGG ATTTTATAGA AAAGAATAAA AATCAACCGT TTTTCCTTTA TTTTACACTA ACCGAACCGC ATGTGCCCCG TATGCCTGCC ACCATGTTTA AAGGGAAAAG TGAACTTGGA TTAAGGGGTG ATGCGATATT GCAGCTGGAC TGGACAGTAG GCCAGATCAT GAAACAACTG GAATATCTGA AACTGGATAA AAATACCCTC ATTATTTTTT CGAGTGATAA CGGCCCGGTA CTGGACGATG GCTATGAAGA TGAGGCTGTA AAAAAAAGTG TCGGACATCA ACCTTCCGGA ATTTTCAGGG GCGGCAAATA CAGTTCATTC GAAGCCGGAA CAAGGGTTCC ATGGATCATG AGTTGGCCAG GCACCATATT GCCAAAGGTA TCTGAAGCTA TGGTTTGCCA AATGGACTTA CTCGCCTCTT TTTCCTACCT TTTAAACGTC CCACTGCCTG CTGATGAAAC TACCGACAGC GAAAATGTCT TACCTGCGTT GCTAGGCAAA TCCACAAAAG GCCGTACCAC ATTAATTACA CAGGGAGGCC CACTGGCCAT CATCAAAAAC AACTGGAAAT ACATTGAACC AGGTAAGGGC ATTGCTTACG ATAAATTAAC AGGTATAGAG ATGGGTGTAT CGGCAAGTGG TCAATTGTAT AACCTGAACA CCGATCCTGG AGAAACTGAA AATGTTCTCT ACAAATACGC GAGCAAAGCA AAAGAGCTGG CTACACTGCT CAATGCTATA AAAGCTAAAT AA
|
Protein sequence | MKRTTILILA FLLFGTTIHL SAQTAKPNVI FIYADDLGYG DLSCYGATKI NTPNLDKLAK NGIRFTNGHS TSATCTPSRF AVMTGQYPWR QKGTEILPGD AALIVPTNTT TLPKVFKKAG YQTAVVGKWH LGLGKQVEKN WNTEVTPGPN EVGFDYSFIF PATADRVPTV FMENHKILAL DQTDPIRVDY KNPVGNDPTG KEHPELLKLK SSAGQGHNNT IVNSIGRIGY MEGGHKARWT DEEVSTTFLT KARDFIEKNK NQPFFLYFTL TEPHVPRMPA TMFKGKSELG LRGDAILQLD WTVGQIMKQL EYLKLDKNTL IIFSSDNGPV LDDGYEDEAV KKSVGHQPSG IFRGGKYSSF EAGTRVPWIM SWPGTILPKV SEAMVCQMDL LASFSYLLNV PLPADETTDS ENVLPALLGK STKGRTTLIT QGGPLAIIKN NWKYIEPGKG IAYDKLTGIE MGVSASGQLY NLNTDPGETE NVLYKYASKA KELATLLNAI KAK
|
| |