Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2825 |
Symbol | |
ID | 8253933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3355536 |
End bp | 3356930 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644936471 |
Product | sulfatase |
Protein accession | YP_003093086 |
Protein GI | 255532714 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.424556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.232256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATGT ACAAATCGAA AGGCTGGTTG ATAGCCATGC TTATACTTGC AGGTTTTGGA GATGCAGGGG CGCAAACCTC AAAAGTAGCA GCTTCCAGGC CTAACATCAT TATCATCATG ACAGATCAGC AAACAGCTGA TGCCATGAGC AATGCTGGTA ATAAGGACCT GCATACACCT GCAATGGATG TTTTGGCTGC AAACGGTACC CGTTTTACAC GTGCCTATTG TGCCCAGCCG CTCTGTACAC CTTCACGCTC CGCGATATTT AGCGGAAAAA TGCCACATGA AACCGGCTTT ACGGGGAATA CACCGGAAAA GGACGGACAG TGGCCCGATT CTGTGCTGAT GATGGGCAAA ATATTTAAGG CAGGAGGCTA TAAAACCGGC TACGTCGGAA AATGGCACCT GCCTGTTCCT GTTACTAAAG TAGCACAACA TGGATTTGAG ACTATTGAGA ATACAGGTAT GGGCGATTAT ACCGATGCAG TTACCCCATC GCAATGCGCC AACTTCATTA AAAAGAATAA AGACAACCCA TTTTTACTGG TAGCATCCTT TTTGAACCCA CACGATATTT GTGAATGGGC AAGGGGTGAT AATTTGAAAA TGGATGTTCT GGATGCAGCG CCGGATACAG CATTTTGTCC GAAATTACCT GCCAACTGGC CAATTCCGGC TTTTGAGCCT GCCATTGTAA GGGAACAGCA AAAGGTGAAC CCGCGTACTT ATCCTTCGGT AGGCTGGAAC GAAAGCCAGT GGCGCAAATA CCGCTGGGCC TATAACCGCC TGGTAGAGAA GGTAGACAAT TATATGGCCA TGGTATTGGG TTCGTTAAAA AAATATGGTA TAGAAGACAA TACCATCATC ATCTTTACCA GCGATCATGG TGATGGTTAT GCGGCACATG AGTGGAACCA GAAGCAGATT TTGTATGAGG AGGCTGCCAG GATACCTTTT ATCATCTCGA AGATCGGACA ATGGAAAGCC AGAACCGATG ATCAGCTGGT TTGCAATGGC ATCGATATTA TCCCCACCAT ATGTGGCTTT GCCGGAATTG CTAAACCTGT TGGTTTAAAA GGCCTGGATT TAAGTAAACG TATTGCCAAC CCTTCGGTTA AACTACGGGA TACTTTAGTG ATAGAAACCG ATTTTGCTGA TAACGAACTG TTGCTGGGTA TTAAGGGCAG GGCAGTGATT ACCAAAGATT TTAAATACAT TGTTTATGAC AAGGGGGAGA TCCGGGAACA ATTGTTTGAC CTGGAAAAAG ACGCAGGAGA AATGGATAAC CTGGCTGTTA AACCCGCCTA TAAAAAGAAA TTGAATGAAA TGCGCGCTTA CCTGAAACTA TGGTGTAAAC AGCACCAGGA TTCGTTTTAT GCATTAAAAA AATAA
|
Protein sequence | MKMYKSKGWL IAMLILAGFG DAGAQTSKVA ASRPNIIIIM TDQQTADAMS NAGNKDLHTP AMDVLAANGT RFTRAYCAQP LCTPSRSAIF SGKMPHETGF TGNTPEKDGQ WPDSVLMMGK IFKAGGYKTG YVGKWHLPVP VTKVAQHGFE TIENTGMGDY TDAVTPSQCA NFIKKNKDNP FLLVASFLNP HDICEWARGD NLKMDVLDAA PDTAFCPKLP ANWPIPAFEP AIVREQQKVN PRTYPSVGWN ESQWRKYRWA YNRLVEKVDN YMAMVLGSLK KYGIEDNTII IFTSDHGDGY AAHEWNQKQI LYEEAARIPF IISKIGQWKA RTDDQLVCNG IDIIPTICGF AGIAKPVGLK GLDLSKRIAN PSVKLRDTLV IETDFADNEL LLGIKGRAVI TKDFKYIVYD KGEIREQLFD LEKDAGEMDN LAVKPAYKKK LNEMRAYLKL WCKQHQDSFY ALKK
|
| |