Gene Phep_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2247 
Symbol 
ID8253353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2607917 
End bp2609398 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content43% 
IMG OID644935896 
Productsulfatase 
Protein accessionYP_003092513 
Protein GI255532141 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAG CTACAGTATT ATTTTTTACC CTTTCTGTCC TTATTTTATT TAGCAGTCAT 
AAATATGTTC CGGCACCTAC CGCAAAACCT TATAACGTGC TTTTTATTTT TGTTGACGAC
CTTCGTCCCG ATCTGGGCTG TTATGGTAAC CGCATTATAA AATCTCCCCA TATAGATGCC
TTGGCTGCAC AATCGGTTCT TTTTAAGCAA CAATTTGTAA CAGTACCTAC CTGTGGGGCT
TCAAGGGCCA GTATACTTAC AGGTTTAAGG CCCCGTTCAG TAAATGATCT TTCCAATGAG
GCTTTTGAAC TTAAACCAAA AAGCCAGAAT ATACCCGAAT CTTTTATTGC GTTACTTAGA
CAGCAAGGAT ATTATACGGT AGGGATAGGT AAAATAAGTC ACTCACCCGA TGGTTATGTA
TATAAATACC TGGAACCAAA AAGTTCACAA ATGGAACTGG AAAGAAGCTG GGATGAAATG
CTTTTTAATG CAGGTAAATG GAAAACAGGG TGGAATGCCT TTTTTGGCTA TGCCGATGGC
AACAACAGAA ATGAATTAAA AGGCGAAGTA AAACCTTATG AACATGCGCC TGTAAGTGAC
AGCAACTACC CCGATGGCTT AACAGCCGAA ATGGCAGTCA GCAAGTTAAA AGAACTGAGT
ACAAAAGAGA AACCTTTTTT TTTGGGTGTA GGTCTGTTTA AGCCCCATCT ACCATTTACT
GCGCCGCAGA AGTATTGGGA TTTATATCAG GAGGCCGACA TCAGCTTAAC ACCATCACCA
GATATACCAG TAGATGTTAA TCCTGTCAGT TTGCAGGAAA GCGGGGAGTT TAACGGGTAT
AAAAAGGGGG AAGAAAGAGC CTCACTGGCC AAGCCTGTAT CTGATGCTTA TGCCCGTAAA
CTTCGTCATG CCTATTATGC TGCAGTAAGT TATTCAGATG CCCAGATAGG TAAAATACTG
GATGAACTGA AGCGAAGCGG AAAGGATAAA AATACCATTG TGGTATTATG GGGTGATCAT
GGCTGGCACC TGGGCGACGA CCGCGTTTGG GGTAAACATA CGCTATCTGA ATGGGCCTTG
CACAGTCCCC TGATCATAAA GGTACCTGGT TTGCCCCAGG CCATAAACAA TAATGTGGTG
AGCGCTGTAG ACGTGTATCC TACTTTAATG GAACTCTGCG GAATAAAGAA GCCAGCGCAT
ATTGACGGGA CAAGTCTGGT ACCTGCATTA AAAAATCCGC TTGCCAGTTC AGCGGGCGGT
ATAGCCTACA GTTATTTTAA GAAAGGGATC AGTCTGCGTA CAGACCGTTA CCGTTTAACA
AAATACTTCC GGGCCGCAAT GCCTGCAATT GAATTATACG ATCACCAGAC AGATCCTTAT
GAAAATAAGA ACATAGCAGC ACAGCAGCCG GAACTGGTTA AACAATTGAT GGTTTTACTG
GAAAAAGGTA ATACCGGTTT ATACAACAAG CCGGTAAATT GA
 
Protein sequence
MKRATVLFFT LSVLILFSSH KYVPAPTAKP YNVLFIFVDD LRPDLGCYGN RIIKSPHIDA 
LAAQSVLFKQ QFVTVPTCGA SRASILTGLR PRSVNDLSNE AFELKPKSQN IPESFIALLR
QQGYYTVGIG KISHSPDGYV YKYLEPKSSQ MELERSWDEM LFNAGKWKTG WNAFFGYADG
NNRNELKGEV KPYEHAPVSD SNYPDGLTAE MAVSKLKELS TKEKPFFLGV GLFKPHLPFT
APQKYWDLYQ EADISLTPSP DIPVDVNPVS LQESGEFNGY KKGEERASLA KPVSDAYARK
LRHAYYAAVS YSDAQIGKIL DELKRSGKDK NTIVVLWGDH GWHLGDDRVW GKHTLSEWAL
HSPLIIKVPG LPQAINNNVV SAVDVYPTLM ELCGIKKPAH IDGTSLVPAL KNPLASSAGG
IAYSYFKKGI SLRTDRYRLT KYFRAAMPAI ELYDHQTDPY ENKNIAAQQP ELVKQLMVLL
EKGNTGLYNK PVN