Gene Phep_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3537 
Symbol 
ID8254658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4207326 
End bp4209278 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content42% 
IMG OID644937188 
Productsulfatase 
Protein accessionYP_003093790 
Protein GI255533418 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACA AATTATTTAA AGGCAGGTAT AGCAGCTTGT TCTCTTTTCT ACTTGTCTTT 
ATTTTTAGCT CTTTCCTGAT CAGGACTGTA TTGCTCTTTA TCTCAATTGG AAAAGCAGAT
TTTACTATTT TAGGCGTAAT TCAGATTTAC CTGCTGGGCT TTGTTTATGA CCTGGCTGTA
GGTTTGTTTT TAACTGGCTT GTATAACCTG TACCTGCTTT TTCTGCCAGG TAAATGGGCC
AATTCTATAG CAAATAAGGT CCTTACTTAT GCGGGGCTTT TTATCATCTT GCTCATTTCT
TTCTTTTCTT TTTTTGCCGA ACTTACTTTT TGGCAGGAAT TTGAGAGCAG GTTTAATTTT
ATTGCTGTAG ATTACCTCAT CTACACTACC GAAGTAATCA ATAACATCAT TGAATCGTAT
CCTTTGCCAT TACTGATCAG CGGGATATTG CTGCTGGTGG TATTGGTGTT CTGGTTGTTT
ACCAAAAAGA AGGTCTTCCG GTATACCTTT CAGTCGGCCA TGCCATTTAA ACAAAGAATA
GCAATATCTG GTGTTTTATT GCTGGCAACC ATCGTTTATC CGCTGGTGCT CAGCAATTCT
TTTGCAGAAT CTGGTACCAA TCGTTACCAG AACGAGCTTT CAAAAGCAGG TATCTATTCC
ATTTTTGCAG CCTTTAAAAA CAATGAACTT AATTATAAGG ATTTTTATGC CTTGCTTCCT
GACGATAAAG CTTTTGCCCT GATGCGAAAA CAGCTGCAGG ATCGGCACAG TGCATTTGTC
AGTACAGGCC ATTCGATAAA GAGAACAGTT AAGAGTGACA AACCTTTGTA TAAACCCAAT
GTGATCATGA TTACGGTGGA GAGCTTGAGT GCAGATTTTC TTGGGCATTT TGGCAATACA
CAGCATTTAA CCCCGGTACT GGATTCGCTG TCGCAACACA ACCTGGTATT TAACAATATG
TTTGCAACGG GCACCCGTAC CGTAAGGGGA ATGGAAGCCC TCTCGCTTGC TATTCCTCCA
ACACCGGGAA GCAGTATTGT AAGGCGAAGT AAAAATGAGA ACCTGTGTAC TGTTGGTTAT
ATTTTTCAAC AGGCAGGTTA TACCCGGACT TTCTATTATG GTGGCGATGG TTATTTCGAT
AACATGAATG AGTATTTTGG CAGTAATGGT TTTGACATTA CTGACAGGGG CAGGAACATT
AAGGTAGGCG AGAGTTACCT GACCAAAAGG ACCATCATTC AGGATAAACA GGTAACCTTT
GAAAATGCAT GGGGAATATG TGATGGCGAT CTTTTTGATG CCGTGATAAG GGGTGCCGAC
CAAGATTACC AGAATGGTAA GCCTTTCTAC AATTTTGTAA TGACCACCTC TAACCACAGG
CCTTTTACTT TTCCTGATGG TAAAATTGAG GCCAAAGTGA AGAACAGGGA GGCTGCAGTG
CGGTATACAG ATTTTGCTAT AGGCGACTTT TTGAAAAAGA TGCAGAAGAA GGCGTGGTTT
AAAAATACAG TGGTGATTAT TGTGGCCGAC CATTGTGCGG CCAGTGCGGG AAAAAATGAG
ATCGACATCA GTAAATATCA CATTCCCTGT ATTGTACTGA ACCTGCCGGT AAAAGGTAAA
GTAGCAATTG ATCAACTTTG CTCGCAGATA GACCTGTACC CCACTTTGTT CGACCTGCTG
GGCTGGAATT ACGAGAGTAA CCTGTATGGA CAGAATGTTT TAGAACCAGG CTACCAGCCC
CGTGCTGTAT TAGGTACCTA CCAACAGCTG GGATATTTAA AGCAGGACAG CCTGGTTATA
TTGGGGCCAC AGCAAAAAAC CGAGACCTTT ATTTATCACC GGGAGAACAA TGAACAGGTA
CCCAATCCGT TGTCCAGAAC GGTTATTGAG CAGGCCATGG CCAATTATCA AACGGCTTAC
GACCTGTTTA AAAACGGTGG CCTGCACCAG TAA
 
Protein sequence
MLNKLFKGRY SSLFSFLLVF IFSSFLIRTV LLFISIGKAD FTILGVIQIY LLGFVYDLAV 
GLFLTGLYNL YLLFLPGKWA NSIANKVLTY AGLFIILLIS FFSFFAELTF WQEFESRFNF
IAVDYLIYTT EVINNIIESY PLPLLISGIL LLVVLVFWLF TKKKVFRYTF QSAMPFKQRI
AISGVLLLAT IVYPLVLSNS FAESGTNRYQ NELSKAGIYS IFAAFKNNEL NYKDFYALLP
DDKAFALMRK QLQDRHSAFV STGHSIKRTV KSDKPLYKPN VIMITVESLS ADFLGHFGNT
QHLTPVLDSL SQHNLVFNNM FATGTRTVRG MEALSLAIPP TPGSSIVRRS KNENLCTVGY
IFQQAGYTRT FYYGGDGYFD NMNEYFGSNG FDITDRGRNI KVGESYLTKR TIIQDKQVTF
ENAWGICDGD LFDAVIRGAD QDYQNGKPFY NFVMTTSNHR PFTFPDGKIE AKVKNREAAV
RYTDFAIGDF LKKMQKKAWF KNTVVIIVAD HCAASAGKNE IDISKYHIPC IVLNLPVKGK
VAIDQLCSQI DLYPTLFDLL GWNYESNLYG QNVLEPGYQP RAVLGTYQQL GYLKQDSLVI
LGPQQKTETF IYHRENNEQV PNPLSRTVIE QAMANYQTAY DLFKNGGLHQ