Gene Phep_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3373 
Symbol 
ID8254492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4011476 
End bp4012921 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content44% 
IMG OID644937025 
Productsulfatase 
Protein accessionYP_003093629 
Protein GI255533257 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAT TATTGCTGAT TTCAGTATTC TTATTATTAA GTCAACGGCT TGCTGCACAA 
AACGTAATTC TGATCTATGC AGATGACCTG GGCTACGCTG AACTTGGAAG CTATGGGCAA
AAAAAGATCA AAACCCCGCA CCTTGACCAA CTGGCGGCTC AGGGATTACG GCTAACCCAA
TTTTATACGG GTACACCTGT ATGCGCTCCA TCCAGGGCCA ACCTGATGAC AGGCCTCCAT
GCTGGTCATG CGCAGATCAG AGACAATTAC GGGCTCCTTC CCTACCAGGA AAACGTAAAT
GAACCGGGCT CCTTTCCGCT AAAAGCAGGC ACAGCTACTT TAGGTTCTCT ATTTAAAACA
GCAGGATATG CTACAGCTGC AATTGGAAAG TGGGGACTAG GGAATCATGA CAACTCTGGA
GACCCGCAGA AGTTAGGATT TGATTACTTT TATGGTTACT ATGACCAGCG GCAGGCACAT
AACTATTATC CTACCCACCT CTGGGAAAAT GGCAAATGGG ATACTTTGAG AAACCATCCC
ATGGAAGTTC ATCCAAAAGA TAAAACAGTT TCGGAGTCCG GGGCCTATCG TGGCAAAGAC
TACGCCATTG ATAAAATGAC GGAAAAAGCA GTTCGTTTCA TTCAATCCAA TAAAGACCGG
CCTTTCTTTC TTTATTTCCC TATCACCCTG CCACACGGTG TTTTGCAGGA GCCAACAAGT
GGAATTGATG CTTATGTAAA ACTATTTAAT GAAAAGCCTT CAGGCAAAGA CCCGATCACA
CCATACCCTA AAGCCTCATA TGCGGCTATG GTCTCCTATA TGGACCAGCA GGTCGGTGTA
ATCCAGAATC TGTTAAAAGA GCTTCGGCTG GATCAGAATA CCATTGTTAT TTTTACCAGT
GACAACGGGA CAGCCGCAAA TGTTGACCGT GACTTTTTCA ATAGTACAGG AGGTTTAAGG
GGCGTTAAAC AAGATGTTTA TGAAGGGGGC ATAAGAGAAC CCTTTATCAT CAAATGGCCG
GGTAAAATAG CCCAGGGAAA AACCAGCGAT TACCCTGTTG TTACCTATGA CCTGATGGCA
ACTTTTGCCG ATCTGCTCCA GGTAAAAGCA CCTAAAAACG ATGGAATTTC AGTACTTGAT
TTGTTTAAGG GCAGCCTACC TGTTGCTAAG CGTGGATTTT TATATTGGGA ATACCCTTCA
AAAGGTGGAC AGCTGGCCAT CAGAATAGGG AACCTTAAGG GTGTAAAAAC CAATATCCAG
AAAAATAAGG CTGCTGCCTG GCAAATATAC GACCTTTCAA AAGATCCGGG AGAGTCCAAT
GATATTGCAT CCAGTCATCC GGAGTTGCCC CATGCATTTG ATGCTATTGT AAAAAAAGAA
CATACCTCGC CATTACGTCC CGAATGGGAT ATTTTTAAAT CAAAAAAGGA ATCAACTGAA
AATTAA
 
Protein sequence
MNRLLLISVF LLLSQRLAAQ NVILIYADDL GYAELGSYGQ KKIKTPHLDQ LAAQGLRLTQ 
FYTGTPVCAP SRANLMTGLH AGHAQIRDNY GLLPYQENVN EPGSFPLKAG TATLGSLFKT
AGYATAAIGK WGLGNHDNSG DPQKLGFDYF YGYYDQRQAH NYYPTHLWEN GKWDTLRNHP
MEVHPKDKTV SESGAYRGKD YAIDKMTEKA VRFIQSNKDR PFFLYFPITL PHGVLQEPTS
GIDAYVKLFN EKPSGKDPIT PYPKASYAAM VSYMDQQVGV IQNLLKELRL DQNTIVIFTS
DNGTAANVDR DFFNSTGGLR GVKQDVYEGG IREPFIIKWP GKIAQGKTSD YPVVTYDLMA
TFADLLQVKA PKNDGISVLD LFKGSLPVAK RGFLYWEYPS KGGQLAIRIG NLKGVKTNIQ
KNKAAAWQIY DLSKDPGESN DIASSHPELP HAFDAIVKKE HTSPLRPEWD IFKSKKESTE
N