Gene Phep_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0785 
Symbol 
ID8251874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp926663 
End bp928081 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content45% 
IMG OID644934435 
Productsulfatase 
Protein accessionYP_003091069 
Protein GI255530697 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.288009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAA TAAAAACGAT AAGTACTTTG TTGCTGGCCC TTTGGACAGG CATTAGTGCT 
GCACAGGTAA AAACTGCGGC CAAGCCCAAC GTGATTGTCA TTGTTAGCGA TGATGCCGGA
TATGTAGATT TTGGTTGTTA TGGTGGTAAA CAGATCCCCA CACCCAATAT TGATGCCATT
GCCAAACAGG GTACGCGGTT TACTGATGCA TATGTTTCGG CTTCAGTATG TGCCCCCTCA
AGGGCCGGAA TTTTAACCGG ACGTTACCAG CAGCGCTTTG GCTTTGAGCA CAATACATCA
AATGTTTTGG CCCCGGGGTA TAAAATAACT GATGTAGGAA TGGATCCTTC GGAACAGACC
ATTGGAAATG AAATGCAGGC AAATGGGTAT AAAACCATTG CAATTGGTAA ATGGCACCAG
GGTGATGAAC CTAAACATTT TCCGCTAAAC AGGGGCTTTA ACGAATTTTA TGGTTTTACA
GGGGGGCACC GTGATTTTTT TGCCTATAAA GGCAAAAGAA CCAATGAACA TGCTTTGTAC
AACAATAAAG AGATCGTTCC GGAAAATGAA ATTACCTATC TGACGGATAT GTTTACCGAT
AAGGCTACGT CTTTTATTAC AGCAAATAAA GACAAGCCCT TTTTTATGTA CCTTTCTTAC
AATGCAGTAC ACACGCCGAT GAATGCGAAA AAAGACCTGA TGGAGCGTTA TGCAAGTATA
GCCGATACCG GGCGCAGGGC CTATGCAGCC ATGATGACCT CATTGGATGA TGGAATTGGT
AAGGTAATGG CCACACTTAA GGCAAATCAG CTGGATAAAA ATACACTGAT CATTTTTATC
AACGACAATG GTGGCGCTAC AGTAAACTCT TCTGATAACG GGCCGTTAAG GGGTATGAAA
GGGTCAAAAT GGGAAGGTGG CATCCGTGTG GCCATGATGA TGAAATGGCC TGGACATATT
GCTGCAAATA AAACAGATAG CCGTCCGGTA AGCTCATTAG ATATCCTGCC TACGGCCATT
GGTGCCGGAA AAGGTAAACA AAAGGGTACA AAAAAGCTGG ATGGGGTAAA CTTACTTCCT
TATTTAAGTG CGGGTAATAA AAAGACACCC CACGAGGCGC TATATTGGCG AAGAGGCGTA
GCCGCAGCCA TGAGAGAAGG GAACTGGAAG CTGATCCGGG TTAAGGAAAG CCCCACCGTA
CAGAATGTAT TGTTGTTTGA CCTGAGTAAG GACCTTTCAG AGACTAAAAA CCTGTCGGAA
AAATATCCTG CCAAAGTAAA AGAGCTGCTT GTCAAACTTG CTGAATGGGA AAAAGGACTG
GACCAGCCGC ACTGGTATAG TTCTTACGGC GACCAGAACC AGATCATGAA GCACCGTATG
GAAACTACAG GCCGCGAGAT GGAGAGAATG TACCCTTAA
 
Protein sequence
MKGIKTISTL LLALWTGISA AQVKTAAKPN VIVIVSDDAG YVDFGCYGGK QIPTPNIDAI 
AKQGTRFTDA YVSASVCAPS RAGILTGRYQ QRFGFEHNTS NVLAPGYKIT DVGMDPSEQT
IGNEMQANGY KTIAIGKWHQ GDEPKHFPLN RGFNEFYGFT GGHRDFFAYK GKRTNEHALY
NNKEIVPENE ITYLTDMFTD KATSFITANK DKPFFMYLSY NAVHTPMNAK KDLMERYASI
ADTGRRAYAA MMTSLDDGIG KVMATLKANQ LDKNTLIIFI NDNGGATVNS SDNGPLRGMK
GSKWEGGIRV AMMMKWPGHI AANKTDSRPV SSLDILPTAI GAGKGKQKGT KKLDGVNLLP
YLSAGNKKTP HEALYWRRGV AAAMREGNWK LIRVKESPTV QNVLLFDLSK DLSETKNLSE
KYPAKVKELL VKLAEWEKGL DQPHWYSSYG DQNQIMKHRM ETTGREMERM YP