Gene Phep_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3390 
Symbol 
ID8254509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4030364 
End bp4031905 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content44% 
IMG OID644937042 
Productsulfatase 
Protein accessionYP_003093646 
Protein GI255533274 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0663726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ATAAACTTCT GATAATAGCC TTACTGACCG TCAAAACCAC TCTTTTTGCA 
CAAAATCAAA AACCAATCCG CCCGAACATC ATTGTTTTTA TGGTTGATGA TATGGGCTGG
ATGGATACTT CAGTGCCTTT TTATGACAGC ATTATGCCCC TGAACAAGCG TTACCATACA
CCCAATATGG AGCGGCTGGC ACAGGCAGGA ATGAAATTTA CAGATGCTTA TGCGCAACCG
GTCTGCACCC CTTCCAGAGT GAGTTTTATG ACAGGCATGA ATGCCTCCAG GAGCCATGTG
ACCAATTGGA CATCACCCCT TAAAAATAAT GATGCTGATG AAAAGGATGA GCAGTTTGAG
CCCCTGGAAT GGAACATGAA CGGTTTAAGC AATAATGCAG AAACGGAGCG CACAGTTTTT
GCAACACCCT TTCCACAATT GTTAAAAGAT GCCGGTTATT ATACCATTCA TATTGGCAAG
GCCCATTGGG CTGCAATAGG TACCCCAGGT GCAAGCCCTT ACAACTTAGG GTTTATGGTT
AACATTGCCG GGCATTCGGG CGGACATCCG CAAAGCTATT TATCGGAGCA AAATTATGGC
AATATGCCTG GTAAGACACA GGTACAGGCG GTACCCGATT TAGAGGCGTA TTTTAAAACA
GGTACTTTTC TTTCAGAAGC CCTTACACAG GAAGCCCTGA AAACAATGGA AACACCAATA
GCCAGAAAGG AGCCTTTTTA TCTGAATATG GCCCATTATG CTGTTCATAC CCCTATAATG
GCCGATCCGC GCTTTGTACA AAAGTATTAT GATGCGGGGC TTGACAGTAC CGAGGCCAGA
TATGCGAGTC TGGTTGAAGG TATGGACAAA AGCCTTGGTG ATATTATGGA TTATTTAAAA
AAGAAAGGTG TGGACAAAAA TACCATCATT ATTTTTATGA GCGATAACGG AGGGCTAGAC
CATCACCAAA GGGGAGGTGC CCTAAATACA CACAACTACC CCCTCAGGTC GGGCAAAGGC
TCGGTATATG AAGGTGGGAT AAGGGAACCC ATGATTGTAA GATGGCCAGG TGTAACCAGC
GCAGGTTCAG TTTATAAAAA TCCGGTGATC ATTGAAGATT TTTTCCCTTC CATATTGGAA
ATGGCAGGGG TTAAGCCAGC TAAGATACTC CAAAAAACGG ATGGGCAAAG TTTTGTGAAA
TACCTCAAAA ACCCTCATTT AAAAGCGCCG GACCGTCCTT TGGTATTTCA TTATCCCAAC
AAATGGATTA ACCTGACCGC GAATGAAAAA CTGGGTATCA ATTATTTTAC TGCGCTTCGT
TTAGGGAACT GGAAGCTGTT ATACAATATG CGGAACCGGG AGTTTGAACT GTATGATCTG
GCCCAGGATA TCCGGGAGGC AAATAACCTG GCTGGTAAAT ATCCGGGAAT GGTAAAAAAA
CTGGCCCTGG TATTGGGCAA AACACTTAAA GAACGGAATG CACAGCTTCC ACGTGAAAAA
GCAAGTGGAA AAGTGATTCC GTTCCCTGAT GAAGTACAAT AG
 
Protein sequence
MKKYKLLIIA LLTVKTTLFA QNQKPIRPNI IVFMVDDMGW MDTSVPFYDS IMPLNKRYHT 
PNMERLAQAG MKFTDAYAQP VCTPSRVSFM TGMNASRSHV TNWTSPLKNN DADEKDEQFE
PLEWNMNGLS NNAETERTVF ATPFPQLLKD AGYYTIHIGK AHWAAIGTPG ASPYNLGFMV
NIAGHSGGHP QSYLSEQNYG NMPGKTQVQA VPDLEAYFKT GTFLSEALTQ EALKTMETPI
ARKEPFYLNM AHYAVHTPIM ADPRFVQKYY DAGLDSTEAR YASLVEGMDK SLGDIMDYLK
KKGVDKNTII IFMSDNGGLD HHQRGGALNT HNYPLRSGKG SVYEGGIREP MIVRWPGVTS
AGSVYKNPVI IEDFFPSILE MAGVKPAKIL QKTDGQSFVK YLKNPHLKAP DRPLVFHYPN
KWINLTANEK LGINYFTALR LGNWKLLYNM RNREFELYDL AQDIREANNL AGKYPGMVKK
LALVLGKTLK ERNAQLPREK ASGKVIPFPD EVQ