Gene Phep_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3852 
Symbol 
ID8254986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4622249 
End bp4623655 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content43% 
IMG OID644937516 
Productsulfatase 
Protein accessionYP_003094105 
Protein GI255533733 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAT ACCTTTTTAC CGTTTTATTG GTTTTAGCGA CAATCCATTT AAGTGCCCGC 
CAAAAGCCCA ATGTGATCTT TATTTTAGCA GATGACATGG GCTATGGCGA TTTAGGCTGT
TACGGCCAAC AACTCATAGA AACGCCTAAC ATTGACAAGC TGGCTGCAAA TGGAATCCGA
TTTACTCAAT TTTATGCTGG CACTTCGGTA TGTGCTCCAT CAAGGGCATC TTTAATGACC
GGCTTACATA CCGGCCATAC GCCAATAAGG GGTAATCATG AAATTAAACC AGAAGGACAG
CTACCCTTAC CCAAGGACAC CTATACCTTG GCCAGACTAT TTAAAGCTGC TGGTTATAAG
ACCGAGGCAT TTGGTAAATG GGGACTGGGC TATCCAGGTT CTGAAGGCGA CCCGGTAAAA
CAGGGCATAG ATCAGTTTTA TGGCTACAAT TGCCAGCGCC AGTCGCATAA CTTCTTTCCA
GACCATTTAT GGGACAACGA AAAACGTGTT GAACTGGGCA ATACTTTAAG CCAGCAAACA
CAATATGCCC CCGAACTGAT CCAGAAACAG GCTATGTCTT TTATGAAAGC AAATCAATCC
AACCCTTTCT TTCTGTATCT GGCCTATACC CTGCCCCATG CAGCATTACA GTTACCCAAA
AACGACCAGG CATTTGAATA CTATAAAAAG AAATTTAAGG AACAGCCCAA GCCTGTAAAA
GAAAACTGGG ACGGCATTGC TTATCAACCG CAGCCTTACC CACACGCAGC TTACGCAGCT
ATGGTAAGCA AACTGGACAA TTATGTAGGT GAAGTAGTAA AACAGCTCAA GGCACTTGAC
CTGGAAAAAC AAACGCTGAT CGTTTTTACC AGTGACAATG GTCCGCACAA TGAAGGTGGA
AATGAACCTG CTTTTTTTAA CAGCAGTGCT GGCTTTAAAG GAATAAAAAG ACAGCTTACG
GAAGGTGGGA TTAGAGAACC AATGATTGTT AGCTGGCCAG GCAAGATCAA AGCCGGGCAG
AGCTCGGCAC ATATTGGTGC ATTCTGGGAT TTTATGCCAA CTTTTGCAGA ACTGACGGCT
CAACCCCTGC CTGTTAAAAC AGATGGATTA TCCATACTAC CTGTATTGCT AAATAAAGGC
ACACAAAAAC AACATGATTT TTTATACTGG GAGTTTCATG AACAGGGCGG AAGACAAGCC
CTGAGAATGG GTAAATGGAA GGCCATCCGG GAAAAGGTTA AACAGGATGC AAATGGCCCA
ATTTTGTTAT ATGATCTGGA TATTGATCCA AAAGAGCGCA ACGACCTGGC TGCCAAACAT
CCGGAAGTGG TAAAAAAAGC GGCATTGCTT ATGCAGCATG AGCATGTAGA AAACAAGGAT
TTCCCACTAA TTTCAAATAA ATTTTAA
 
Protein sequence
MRIYLFTVLL VLATIHLSAR QKPNVIFILA DDMGYGDLGC YGQQLIETPN IDKLAANGIR 
FTQFYAGTSV CAPSRASLMT GLHTGHTPIR GNHEIKPEGQ LPLPKDTYTL ARLFKAAGYK
TEAFGKWGLG YPGSEGDPVK QGIDQFYGYN CQRQSHNFFP DHLWDNEKRV ELGNTLSQQT
QYAPELIQKQ AMSFMKANQS NPFFLYLAYT LPHAALQLPK NDQAFEYYKK KFKEQPKPVK
ENWDGIAYQP QPYPHAAYAA MVSKLDNYVG EVVKQLKALD LEKQTLIVFT SDNGPHNEGG
NEPAFFNSSA GFKGIKRQLT EGGIREPMIV SWPGKIKAGQ SSAHIGAFWD FMPTFAELTA
QPLPVKTDGL SILPVLLNKG TQKQHDFLYW EFHEQGGRQA LRMGKWKAIR EKVKQDANGP
ILLYDLDIDP KERNDLAAKH PEVVKKAALL MQHEHVENKD FPLISNKF