Gene Phep_2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2826 
Symbol 
ID8253934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3356938 
End bp3358440 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content43% 
IMG OID644936472 
Productsulfatase 
Protein accessionYP_003093087 
Protein GI255532715 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.250394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA ACAAATTGAA ATATTTCCCT GCAGCACTTT CAATGGTGCT GATATGGGCT 
TCCTGCACTT CGCCGGAAAA AAAAACGGAT CGTCCGAATA TCCTGATGAT CATGTCCGAT
AACCAATCCT GGAACCACGT AGGGAGCTAT GGTGATCAAA CGGTACGCAC GCCCAATATG
GACCGGATTG CGAAAGAAGG GGTACGTTTT ACCAATGCTT TTTGCAGTTC ACCTTCCTGT
ACGCCCGCAA GGGCTGGAAT GCTGACCGGA CAGGATATAT GGAGGTTAGA AGATGGGGGC
AATTTATGGG GTGTTTTACC GGTTAAATAT AAAGTATATC CGGATTTGCT GGAAGAAGCT
GGCTATGCCA TAGGTTTTCA GGGAAAAGGC TGGGGCCCGG GAAGCTTTGA GGCCAATAAA
CGCCCAAGAA ATCCTGCAGG GAATGAGTTT AAAAGTTTTG GCGCATTTTT AAAAGATAAA
AAAGAAGGTC CCTGGTGTTA TTGGATCAGT AGTCATGAAC CTCACCGTCC TTATGTGGAA
GGTTCCGGCG AAAAAGCTGG TATCGATCCA AATAAAGTAA AAGTTCCTGC CTATTTGCCA
GATCATATCA GTATAAGAAA AGACATTGCA GATTACTACG CTGCGGTTGA AACCTTTGAT
CGTGAACTGG GCGAGGCCCT TGACCAGTTG AAAGCAAGTG GTGAGCTGGA CAATACGGTA
ATTGTGGTAT GCAGTGACAA CGGCTGGCAA ATGCCGCGTG GACTGGCCAA CTTGTACGAT
TTTGGTACAC ATGTGCCCCT GATCATTTCA TGGCCAGGTA AGTTTAAACA GGATGTAGTT
GCCGATAACC TGGTCACACT GAATGACCTT GCCCCAACAT TCTTACAACT GGGTAAGGTA
CCTGTACCGG CCGATATGAC GGGTAAAAGT TTATTGCCCA TTGTTGAGGC AGGTAAAAAA
GATGAAAAAC CCCGGGATTA TGTAGTACTG GGAAGAGAGC GTCATGCATT CGTTCGTCGG
CATGGCCTTG GCTATCCTGG CAGGGCAATT CGTACTAAAG ATTATCTTTA CATTAAAAAT
TATGAACCAA ATAGATGGCC GGCAGGTGAT CCGCCGTTTT ATGGAGACAT TGATCCCTAC
ATGTTCAACT GGCCGGGTGA AACCAAATAT TACCTGATAG AACATAAAGA TGATCCGAAA
GTAAAGTCTT TCTTTGAACT GGGAATGGGC AAACGTCCGG CAGAAGAATT ATTTGATATC
AATAAAGATC CGGATGAATT ACACAATCTG GCAGCACTTC CTGAATATCA AAAAATAAAA
CAGGAGCTTG TTGCTAAATT GCGTAATTAT TTGGTAGCAA CGAAAGATCC GAGAGAAACT
AATGGTAATA TACAGATCTG GGATACTGCT GCTTATTTTA GTGAAATAGA TAAAACGCCA
AAACCAAGTA AAGAGATGCA AAAGCGTTTT AAATTAGATT CCAGTTACAA TTATTTGAAG
TAA
 
Protein sequence
MKFNKLKYFP AALSMVLIWA SCTSPEKKTD RPNILMIMSD NQSWNHVGSY GDQTVRTPNM 
DRIAKEGVRF TNAFCSSPSC TPARAGMLTG QDIWRLEDGG NLWGVLPVKY KVYPDLLEEA
GYAIGFQGKG WGPGSFEANK RPRNPAGNEF KSFGAFLKDK KEGPWCYWIS SHEPHRPYVE
GSGEKAGIDP NKVKVPAYLP DHISIRKDIA DYYAAVETFD RELGEALDQL KASGELDNTV
IVVCSDNGWQ MPRGLANLYD FGTHVPLIIS WPGKFKQDVV ADNLVTLNDL APTFLQLGKV
PVPADMTGKS LLPIVEAGKK DEKPRDYVVL GRERHAFVRR HGLGYPGRAI RTKDYLYIKN
YEPNRWPAGD PPFYGDIDPY MFNWPGETKY YLIEHKDDPK VKSFFELGMG KRPAEELFDI
NKDPDELHNL AALPEYQKIK QELVAKLRNY LVATKDPRET NGNIQIWDTA AYFSEIDKTP
KPSKEMQKRF KLDSSYNYLK