Gene Phep_2825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2825 
Symbol 
ID8253933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3355536 
End bp3356930 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content45% 
IMG OID644936471 
Productsulfatase 
Protein accessionYP_003093086 
Protein GI255532714 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.424556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.232256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGT ACAAATCGAA AGGCTGGTTG ATAGCCATGC TTATACTTGC AGGTTTTGGA 
GATGCAGGGG CGCAAACCTC AAAAGTAGCA GCTTCCAGGC CTAACATCAT TATCATCATG
ACAGATCAGC AAACAGCTGA TGCCATGAGC AATGCTGGTA ATAAGGACCT GCATACACCT
GCAATGGATG TTTTGGCTGC AAACGGTACC CGTTTTACAC GTGCCTATTG TGCCCAGCCG
CTCTGTACAC CTTCACGCTC CGCGATATTT AGCGGAAAAA TGCCACATGA AACCGGCTTT
ACGGGGAATA CACCGGAAAA GGACGGACAG TGGCCCGATT CTGTGCTGAT GATGGGCAAA
ATATTTAAGG CAGGAGGCTA TAAAACCGGC TACGTCGGAA AATGGCACCT GCCTGTTCCT
GTTACTAAAG TAGCACAACA TGGATTTGAG ACTATTGAGA ATACAGGTAT GGGCGATTAT
ACCGATGCAG TTACCCCATC GCAATGCGCC AACTTCATTA AAAAGAATAA AGACAACCCA
TTTTTACTGG TAGCATCCTT TTTGAACCCA CACGATATTT GTGAATGGGC AAGGGGTGAT
AATTTGAAAA TGGATGTTCT GGATGCAGCG CCGGATACAG CATTTTGTCC GAAATTACCT
GCCAACTGGC CAATTCCGGC TTTTGAGCCT GCCATTGTAA GGGAACAGCA AAAGGTGAAC
CCGCGTACTT ATCCTTCGGT AGGCTGGAAC GAAAGCCAGT GGCGCAAATA CCGCTGGGCC
TATAACCGCC TGGTAGAGAA GGTAGACAAT TATATGGCCA TGGTATTGGG TTCGTTAAAA
AAATATGGTA TAGAAGACAA TACCATCATC ATCTTTACCA GCGATCATGG TGATGGTTAT
GCGGCACATG AGTGGAACCA GAAGCAGATT TTGTATGAGG AGGCTGCCAG GATACCTTTT
ATCATCTCGA AGATCGGACA ATGGAAAGCC AGAACCGATG ATCAGCTGGT TTGCAATGGC
ATCGATATTA TCCCCACCAT ATGTGGCTTT GCCGGAATTG CTAAACCTGT TGGTTTAAAA
GGCCTGGATT TAAGTAAACG TATTGCCAAC CCTTCGGTTA AACTACGGGA TACTTTAGTG
ATAGAAACCG ATTTTGCTGA TAACGAACTG TTGCTGGGTA TTAAGGGCAG GGCAGTGATT
ACCAAAGATT TTAAATACAT TGTTTATGAC AAGGGGGAGA TCCGGGAACA ATTGTTTGAC
CTGGAAAAAG ACGCAGGAGA AATGGATAAC CTGGCTGTTA AACCCGCCTA TAAAAAGAAA
TTGAATGAAA TGCGCGCTTA CCTGAAACTA TGGTGTAAAC AGCACCAGGA TTCGTTTTAT
GCATTAAAAA AATAA
 
Protein sequence
MKMYKSKGWL IAMLILAGFG DAGAQTSKVA ASRPNIIIIM TDQQTADAMS NAGNKDLHTP 
AMDVLAANGT RFTRAYCAQP LCTPSRSAIF SGKMPHETGF TGNTPEKDGQ WPDSVLMMGK
IFKAGGYKTG YVGKWHLPVP VTKVAQHGFE TIENTGMGDY TDAVTPSQCA NFIKKNKDNP
FLLVASFLNP HDICEWARGD NLKMDVLDAA PDTAFCPKLP ANWPIPAFEP AIVREQQKVN
PRTYPSVGWN ESQWRKYRWA YNRLVEKVDN YMAMVLGSLK KYGIEDNTII IFTSDHGDGY
AAHEWNQKQI LYEEAARIPF IISKIGQWKA RTDDQLVCNG IDIIPTICGF AGIAKPVGLK
GLDLSKRIAN PSVKLRDTLV IETDFADNEL LLGIKGRAVI TKDFKYIVYD KGEIREQLFD
LEKDAGEMDN LAVKPAYKKK LNEMRAYLKL WCKQHQDSFY ALKK