Gene Phep_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3271 
Symbol 
ID8254390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3882807 
End bp3884735 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content40% 
IMG OID644936924 
Productsulfatase 
Protein accessionYP_003093528 
Protein GI255533156 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.269035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAGAG AAAATCCAAT AACCTTAAAC CTGTATGTGG CCCTGGCCTA CAGGTTCCTG 
ATTTTACTGG TTTTATATAC TTTATGCAGA TTGGGCTTCT TCTTTTTTAA CCATAGCCTG
TTCCAGCACA TTACCTTACC CAAATACCTG TACATGCTAT GGGGCGGACT AAAGTTTGAC
GTTTCGGCAC TCATCTATAT CAATGCCATC TTCCTTTTAA TGCAGCTGGT ACCTGCCCCT
TTTAAGTACA AAGATGGCTA TCAGCGCTTT TGCAAATGGC TTTTTATCAT TAGCAACAGC
ATTGGCATTA TGGCCAACTT TGCCGATTTT GCCTACTATA AATATACACT TAAAAGAACT
ACAGCAACTG TTTTTAGCCA GTTTAGCCAT GAACAGAACA AGTTTAAACT GTTTATAGAT
TTTTTAACAG ATTACTGGTA CCTGTTTCTG CTTTATGCCC TGTTTATATG GGGCTTTGTA
AAGCTTTACC AGCTGGTAGG TGTTAAAAAA GTAAAAACTT TTAAATGGCC TGCATACCTG
TTGCAAACCG TATTGCTGTT TGCTATTGCA CTGGTTTGCC TTACAGGTGT ACGTGGTGGC
TGGGGCTATG GCACCAGGCC CATTACGCTG AGCAATGCAG GAGAATTTGT GGATACACCA
GATCAGATGA GCCTGGTGCT GAACACACCT TTCTGTATAT TCAGGACTTT AAAAGTGTCT
AAACTAAAGC CTGTAAACTA TTACGATGAG CAGACATTGA ACAGCATTTA CAATCCCATA
CACCTGCCCA AAGATACGGT TGCCTTTAAA AAGCTGAATG TCGTTTTCCT GATCATAGAA
AGTTTGGGTA AGGAACATAT AGGCGGCTTA AATAAAGACC TGATGGGTGG AAAGTACAAA
GGTTTTACCC CTTTTATTGA TTCGCTGATT GAACAGAGCT ATACCTTTAC CCATACTTAT
GCCAATGGCC GTAAATCTAT AGATGCGCTA CCTTCTGTGA TTTCAGGTAT TCCTTCTATC
CGTGAGCCTT TTGTACTTTC GGTATACTCA GGGAATAAGA CCACCAGCAT TGCCAAGCTT
TTAGGTGATA AGGGGTATGA AACTGCCTTT TTTCATGGGG CACCAAATGG TTCGATGGGT
TTTTCCTCTT ATACCCATCT TGCAGGAATC AAACATTATT TCGGACAGAA CGAATATAAA
AAAACAGGAG ATTATGACGG TACCTGGGGC ATTTGGGACA ATCCTTTTAT GCAATATATG
GCCCAAACCA TGAATACGCT GAAGCAACCT TTTTTCTCAG CATTTTTCTC GCTTTCTTCT
CATCATCCTT TCAAGCTGCC CGATGAATAT GCAGGTAAAT TCCCTAAAGG TCATTTGCCT
GTACAGGAAG TACTGGGCTA TACAGACATG GCACTGCGTA ATTTTTTCAG AACGGCATCT
GCTATGCCCT GGTATAAAAA CACACTGTTT GTATTGTGTG CGGATCATGC CACAGTATCT
TACTTCCCTG AATATCAAAC CACTCCCGGA TATTTTTCTA TTCCGATTGT TTTTTATTAT
CCTGGCGGGG ATTTAAAAGG GAAAGCGGAT AAAAACGTAC AACAGATAGA TATTATGCCC
ACTGTTTTGA ATTATCTGCA TTATGACAAA CCTTATTTTG CGCTTGGCTT TGATGCTTTT
GACAAAAGAC AGGATAATTT TGTGGTAAAC AATAACGATG GTACCTTTAG CTTTTACCAG
GGCGATTATT TACTGATCAA TGATGGCAAG ATCAACCTTT CATTATACAA TTTAAAAACC
GACCGTCTTA CTCAAAACAA CATATTAGAT AAAGAACCGT TAATTGCACA ACAAATGGAA
AAATACCTGA AAGCTTTTGT GCAACAGTAC AACAACCGGA TGATTGAAAA CAAATTAACG
GCGAATTAG
 
Protein sequence
MKRENPITLN LYVALAYRFL ILLVLYTLCR LGFFFFNHSL FQHITLPKYL YMLWGGLKFD 
VSALIYINAI FLLMQLVPAP FKYKDGYQRF CKWLFIISNS IGIMANFADF AYYKYTLKRT
TATVFSQFSH EQNKFKLFID FLTDYWYLFL LYALFIWGFV KLYQLVGVKK VKTFKWPAYL
LQTVLLFAIA LVCLTGVRGG WGYGTRPITL SNAGEFVDTP DQMSLVLNTP FCIFRTLKVS
KLKPVNYYDE QTLNSIYNPI HLPKDTVAFK KLNVVFLIIE SLGKEHIGGL NKDLMGGKYK
GFTPFIDSLI EQSYTFTHTY ANGRKSIDAL PSVISGIPSI REPFVLSVYS GNKTTSIAKL
LGDKGYETAF FHGAPNGSMG FSSYTHLAGI KHYFGQNEYK KTGDYDGTWG IWDNPFMQYM
AQTMNTLKQP FFSAFFSLSS HHPFKLPDEY AGKFPKGHLP VQEVLGYTDM ALRNFFRTAS
AMPWYKNTLF VLCADHATVS YFPEYQTTPG YFSIPIVFYY PGGDLKGKAD KNVQQIDIMP
TVLNYLHYDK PYFALGFDAF DKRQDNFVVN NNDGTFSFYQ GDYLLINDGK INLSLYNLKT
DRLTQNNILD KEPLIAQQME KYLKAFVQQY NNRMIENKLT AN