Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2827 |
Symbol | |
ID | 8253935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3358452 |
End bp | 3360089 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644936473 |
Product | sulfatase |
Protein accession | YP_003093088 |
Protein GI | 255532716 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.115962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAT TGAAATTAAT TTTACCGGTT TTGTTTGCCG GTGCCACCTT AATGTCTTGC CAGCAGCCTA AACCTGCTGA AAGTGCCAAA AGGCCCAATA TTGTGTTCAT CATGACAGAT GACCATACCA TTCAGGCCAT AAGCGCTTAT GGCAGCAAAT TGGTAAAAAC GCCCAACCTG GACAGAATTG CCAACGAGGG TATGTTGTTT AACAACTGTT TTGTAACCAA TGCAGTTTGC GGGCCATCCA GGGCTACTAT CCTGACCGGA AAATATAGCC ACCTGAATGG TTTAACAGAC AATTCAAAGG TATTTGACAG TACTCAGGTT ATTTATCCGC AGTTGTTAAA GAAAGCAGGG TACCAGACCG CAATGATTGG CAAGTGGCAC CTGGGCTCAA CACCAATGGG CTTTGACTAT TACAGTATTT TGCCCAACCA GGGACAATAT TATCAGCCTG AATTTATAGA AAACGGGCAT CTGGTTAAAG AAAAAGGATA TGTAACAGAC CTCATCACCG ATAAGGCCAT CGGCTTCCTT GAAAAAAGGG ACCATGATAA ACCCTTTCTG ATGATTTACC AGCACAAAGC ACCGCACCGC AACTGGTTGC CGGCACCAAG ACACCTGGGG ATGTTTGACG ATACGGTTTT TCCTGAACCT GCCAATTTAC TGGATGATTT TAAGGGCAGG GGCAGGGCAG CAAAGGAGCA GCTGATGAAC ATTTCTACCG ATATGTGGCC TGCATGGGAC CTTAAAATGC TTTCTACAGC CCAGCTTGAT TCTATGGCGA AACTACCTGT TTCCCCTAAG TTTAAAGATG CCAAGGGTGA TGATTATCAA CAGGCCAATG ATCCTTCACT GGATAAAGCC CGTTTTTTTG AAGTGTACAA CCGCATGACA GATGCTGAAA AGGTACAATG GAGAAAAGTA TATGACAAAC GCGTAGCCGA ATTTAAAAGG CTGAACCCGA AAGGGGCCGA CCTGGTGCGA TGGAAATACC AGCAGTATAT GCGCGATTAT CTGGCCTGCG TGGTTTCGGT AGATGAAAAT GTAGGCAGGC TGATGGATTA CCTGAAAAAG ATAGGGGAGC TGGACAATAC CATTATTGTC TATACTTCCG ATCAGGGCTT TTATTTGGGT GAGCATGGGT ATTTCGACAA ACGTTTTATG TACGATGAAT CTTTCCGTAC ACCTTTAATG GTGAGGTATC CGCCTTCGGT TAAAGCCGGT TCAGTAAGTA ATGCCTTTGC CATGAACCTC GATTTTGCAC CAACTTTACT GGATTATGCA GGGGTAAAAA TACCAGCCGA TATGCAGGGC CTGTCGTTAC GTCCGGTATT GGATAACGCA GGAAAATCGC CGGAAAACTG GCGCAAGGCT GTATATTATC ATTATTATGA ATTTCCAAGC TGGCACATGG TTAAAAGGCA CTATGGCATC AGAACGGAGC GCTATAAACT GATCCATTTT TACAATGACA TTGATGAATG GGAATTATAC GATATGCAGA AAGATCCGCA TGAGATGCAA AACCTGTATA ACGATAAGGC CTATGAGCCG ATTATTAAAG ACCTGAAAGT GCAAATGAAA AAGCTGCAGG TACAATATAA AGATACGAAT CCAACTGAAG CTTTATAA
|
Protein sequence | MGKLKLILPV LFAGATLMSC QQPKPAESAK RPNIVFIMTD DHTIQAISAY GSKLVKTPNL DRIANEGMLF NNCFVTNAVC GPSRATILTG KYSHLNGLTD NSKVFDSTQV IYPQLLKKAG YQTAMIGKWH LGSTPMGFDY YSILPNQGQY YQPEFIENGH LVKEKGYVTD LITDKAIGFL EKRDHDKPFL MIYQHKAPHR NWLPAPRHLG MFDDTVFPEP ANLLDDFKGR GRAAKEQLMN ISTDMWPAWD LKMLSTAQLD SMAKLPVSPK FKDAKGDDYQ QANDPSLDKA RFFEVYNRMT DAEKVQWRKV YDKRVAEFKR LNPKGADLVR WKYQQYMRDY LACVVSVDEN VGRLMDYLKK IGELDNTIIV YTSDQGFYLG EHGYFDKRFM YDESFRTPLM VRYPPSVKAG SVSNAFAMNL DFAPTLLDYA GVKIPADMQG LSLRPVLDNA GKSPENWRKA VYYHYYEFPS WHMVKRHYGI RTERYKLIHF YNDIDEWELY DMQKDPHEMQ NLYNDKAYEP IIKDLKVQMK KLQVQYKDTN PTEAL
|
| |