Gene Phep_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3489 
Symbol 
ID8254609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4152384 
End bp4153991 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content42% 
IMG OID644937139 
Productsulfatase 
Protein accessionYP_003093742 
Protein GI255533370 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTAA GAACTCAGAC CATTCAGATC ATCTCTTTTT TAGGGCTGGT TACCATGGGT 
TTTCAGGTAC TTGCTCAAAA GCAGGAGAAG CCAAACGTAT TGTTTATCGC TGTTGATGAC
CTGAAGCCTA TTTTAGGTTG TTATGGCGAT CGGTTGATTA AAACACCCAA TATAGACCGG
TTGGCAAAAA TGGGTACGGT TTTTAAAAGC AACTATTGCC AGCAGGCAGT TTGCGGTCCA
ACCAGAGCGA GTATCATGAC CGGAATGCGC CCTGATATAA CCAAAGTATG GGATTTGAAA
ACAAAGATGA GGGATATGAA TCCTGATATT CTGACCATCC CGCAATACTT TGCCAGTCAG
GGATACTCCA CGCAGGCTAT CGGTAAGATA TATGATCCAA GATGTGTGGA TGAGGATTTA
GATAAACCAA GCTGGACCGT TCCACATTAC AGAACAGATA AAAAATATTA TGCTGCCTCT
ACCGGACAGC CTGTTTTAAA TTATTATCAG GGAAAAGAGA TTAAATCACT GGTTGAAAAA
CGCAGGGCTG AGGCTAAAGG AAAGATCATA ACCGATCAGG AATTGTTGGC TACGATCAAA
CCATCGGTAG AATGTGTGGA TGTACCCGAT CAGGCATATA TTGACGGAGC CAACATCCTG
CAGGCAAAGG ATATTTTAAC AACACTCCAA AAGAAAAGCC AACCCTTCTT TTTTGCCGTA
GGCTTTGCCA AACCTCATTT GCCCTTTAAT GCACCGAAGA AATACTGGGA CCTGTATCAG
CGGGAGGATA TGCCGGTTGC AGCGTTTCAG GAAAAATCTA AAAATGCAGT GGATGTAGCT
TACCACAATT CGGGGGAACT CAGGGCTTAT TCAGATATTC CGGATTTATT ATCTTTTACT
GATCAGAAAA GCTATGGGCT AACTTTACCC ATAGCTAAAC AAAAAGAACT GATACATGGA
TACTATGCAG CGGTTTCTTA TGTAGATGCA CAGGTAGGCA TCTTATTAAA TGCCCTGGAC
TCACTGGGTT TAAGTAAAAA CACGGTCATT GTACTTTGGG GCGACCACGG ATGGCATTTA
GGCGATCATA ACCTTTGGTG CAAACATTCC GATTTTGAAC AGGCCACCCG TAGCCCTTTG
ATCTTTTCAG CTCCAGGTAT TAAATCCTCC GCCACTACTT CCCTTTCAGA ATTTGTAGAT
GTTTTTCCTA CGCTTTGCAA TTTAGCCGGT ATTCCGGTGC CCCAGCATTT AGAGGGTACC
AGTCTGGTTC CATTGATGCG AAATCCTGCC TCTTCGATAA AGGAATTTGC GATCAGCCAG
TATCCCCGAA GTTCAAATGC TGTGGAAACA CAACGAATGA CAGACGCTTC AGCGAAGGTT
ATGGGTTATT CACTTCGCAC AAAAAGATAT CGTTACACGA TATGGATGGA GAATTTCAGG
AGTAACCAGG CATTTAAGGC TACCGCTGTT GTTGGTGATG AATTGTATGA TTATCAGAAG
GACCCGCTTG AAAAAATAAA TGTAGTGAAG GATAGAAATT ATGCACTGAT CGCCAAAAGT
TTAAAGGATA AAATGATCAG GTATTTTCAT AGTAAAGAAA AGCCGTAA
 
Protein sequence
MILRTQTIQI ISFLGLVTMG FQVLAQKQEK PNVLFIAVDD LKPILGCYGD RLIKTPNIDR 
LAKMGTVFKS NYCQQAVCGP TRASIMTGMR PDITKVWDLK TKMRDMNPDI LTIPQYFASQ
GYSTQAIGKI YDPRCVDEDL DKPSWTVPHY RTDKKYYAAS TGQPVLNYYQ GKEIKSLVEK
RRAEAKGKII TDQELLATIK PSVECVDVPD QAYIDGANIL QAKDILTTLQ KKSQPFFFAV
GFAKPHLPFN APKKYWDLYQ REDMPVAAFQ EKSKNAVDVA YHNSGELRAY SDIPDLLSFT
DQKSYGLTLP IAKQKELIHG YYAAVSYVDA QVGILLNALD SLGLSKNTVI VLWGDHGWHL
GDHNLWCKHS DFEQATRSPL IFSAPGIKSS ATTSLSEFVD VFPTLCNLAG IPVPQHLEGT
SLVPLMRNPA SSIKEFAISQ YPRSSNAVET QRMTDASAKV MGYSLRTKRY RYTIWMENFR
SNQAFKATAV VGDELYDYQK DPLEKINVVK DRNYALIAKS LKDKMIRYFH SKEKP