Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3373 |
Symbol | |
ID | 8254492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4011476 |
End bp | 4012921 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937025 |
Product | sulfatase |
Protein accession | YP_003093629 |
Protein GI | 255533257 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAT TATTGCTGAT TTCAGTATTC TTATTATTAA GTCAACGGCT TGCTGCACAA AACGTAATTC TGATCTATGC AGATGACCTG GGCTACGCTG AACTTGGAAG CTATGGGCAA AAAAAGATCA AAACCCCGCA CCTTGACCAA CTGGCGGCTC AGGGATTACG GCTAACCCAA TTTTATACGG GTACACCTGT ATGCGCTCCA TCCAGGGCCA ACCTGATGAC AGGCCTCCAT GCTGGTCATG CGCAGATCAG AGACAATTAC GGGCTCCTTC CCTACCAGGA AAACGTAAAT GAACCGGGCT CCTTTCCGCT AAAAGCAGGC ACAGCTACTT TAGGTTCTCT ATTTAAAACA GCAGGATATG CTACAGCTGC AATTGGAAAG TGGGGACTAG GGAATCATGA CAACTCTGGA GACCCGCAGA AGTTAGGATT TGATTACTTT TATGGTTACT ATGACCAGCG GCAGGCACAT AACTATTATC CTACCCACCT CTGGGAAAAT GGCAAATGGG ATACTTTGAG AAACCATCCC ATGGAAGTTC ATCCAAAAGA TAAAACAGTT TCGGAGTCCG GGGCCTATCG TGGCAAAGAC TACGCCATTG ATAAAATGAC GGAAAAAGCA GTTCGTTTCA TTCAATCCAA TAAAGACCGG CCTTTCTTTC TTTATTTCCC TATCACCCTG CCACACGGTG TTTTGCAGGA GCCAACAAGT GGAATTGATG CTTATGTAAA ACTATTTAAT GAAAAGCCTT CAGGCAAAGA CCCGATCACA CCATACCCTA AAGCCTCATA TGCGGCTATG GTCTCCTATA TGGACCAGCA GGTCGGTGTA ATCCAGAATC TGTTAAAAGA GCTTCGGCTG GATCAGAATA CCATTGTTAT TTTTACCAGT GACAACGGGA CAGCCGCAAA TGTTGACCGT GACTTTTTCA ATAGTACAGG AGGTTTAAGG GGCGTTAAAC AAGATGTTTA TGAAGGGGGC ATAAGAGAAC CCTTTATCAT CAAATGGCCG GGTAAAATAG CCCAGGGAAA AACCAGCGAT TACCCTGTTG TTACCTATGA CCTGATGGCA ACTTTTGCCG ATCTGCTCCA GGTAAAAGCA CCTAAAAACG ATGGAATTTC AGTACTTGAT TTGTTTAAGG GCAGCCTACC TGTTGCTAAG CGTGGATTTT TATATTGGGA ATACCCTTCA AAAGGTGGAC AGCTGGCCAT CAGAATAGGG AACCTTAAGG GTGTAAAAAC CAATATCCAG AAAAATAAGG CTGCTGCCTG GCAAATATAC GACCTTTCAA AAGATCCGGG AGAGTCCAAT GATATTGCAT CCAGTCATCC GGAGTTGCCC CATGCATTTG ATGCTATTGT AAAAAAAGAA CATACCTCGC CATTACGTCC CGAATGGGAT ATTTTTAAAT CAAAAAAGGA ATCAACTGAA AATTAA
|
Protein sequence | MNRLLLISVF LLLSQRLAAQ NVILIYADDL GYAELGSYGQ KKIKTPHLDQ LAAQGLRLTQ FYTGTPVCAP SRANLMTGLH AGHAQIRDNY GLLPYQENVN EPGSFPLKAG TATLGSLFKT AGYATAAIGK WGLGNHDNSG DPQKLGFDYF YGYYDQRQAH NYYPTHLWEN GKWDTLRNHP MEVHPKDKTV SESGAYRGKD YAIDKMTEKA VRFIQSNKDR PFFLYFPITL PHGVLQEPTS GIDAYVKLFN EKPSGKDPIT PYPKASYAAM VSYMDQQVGV IQNLLKELRL DQNTIVIFTS DNGTAANVDR DFFNSTGGLR GVKQDVYEGG IREPFIIKWP GKIAQGKTSD YPVVTYDLMA TFADLLQVKA PKNDGISVLD LFKGSLPVAK RGFLYWEYPS KGGQLAIRIG NLKGVKTNIQ KNKAAAWQIY DLSKDPGESN DIASSHPELP HAFDAIVKKE HTSPLRPEWD IFKSKKESTE N
|
| |