Gene Phep_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3047 
Symbol 
ID8254163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3643027 
End bp3644538 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content43% 
IMG OID644936700 
Productsulfatase 
Protein accessionYP_003093307 
Protein GI255532935 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0209579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CAACAATTCT GATTTTAGCA TTCCTTCTTT TTGGCACAAC GATTCATCTA 
TCAGCCCAAA CGGCAAAACC AAATGTTATT TTTATTTATG CAGATGATTT GGGTTATGGC
GACCTGAGCT GCTATGGTGC TACTAAAATA AATACTCCAA ACCTTGACAA ACTTGCAAAA
AATGGCATCC GCTTTACCAA CGGGCATTCT ACTTCAGCCA CCTGTACCCC ATCCCGCTTC
GCGGTTATGA CAGGGCAATA CCCATGGCGC CAAAAAGGAA CAGAAATTTT ACCTGGCGAC
GCGGCACTGA TTGTTCCAAC AAATACGACT ACCCTGCCTA AAGTATTTAA AAAAGCTGGT
TATCAAACCG CAGTTGTAGG TAAATGGCAC CTCGGGTTGG GTAAACAAGT AGAAAAAAAC
TGGAATACTG AAGTAACACC CGGGCCAAAT GAAGTGGGCT TCGATTATTC ATTTATTTTT
CCGGCAACAG CCGATAGGGT CCCAACTGTT TTTATGGAAA ATCATAAAAT ACTGGCTTTG
GATCAGACAG ACCCCATCAG AGTAGATTAT AAAAATCCGG TCGGCAACGA TCCGACAGGA
AAAGAGCATC CGGAATTGCT AAAACTCAAA TCTTCAGCAG GACAGGGACA CAACAATACC
ATTGTAAACA GTATTGGCAG GATTGGGTAC ATGGAGGGCG GTCACAAAGC CAGGTGGACA
GATGAAGAGG TATCCACAAC ATTTCTGACC AAGGCCCGGG ATTTTATAGA AAAGAATAAA
AATCAACCGT TTTTCCTTTA TTTTACACTA ACCGAACCGC ATGTGCCCCG TATGCCTGCC
ACCATGTTTA AAGGGAAAAG TGAACTTGGA TTAAGGGGTG ATGCGATATT GCAGCTGGAC
TGGACAGTAG GCCAGATCAT GAAACAACTG GAATATCTGA AACTGGATAA AAATACCCTC
ATTATTTTTT CGAGTGATAA CGGCCCGGTA CTGGACGATG GCTATGAAGA TGAGGCTGTA
AAAAAAAGTG TCGGACATCA ACCTTCCGGA ATTTTCAGGG GCGGCAAATA CAGTTCATTC
GAAGCCGGAA CAAGGGTTCC ATGGATCATG AGTTGGCCAG GCACCATATT GCCAAAGGTA
TCTGAAGCTA TGGTTTGCCA AATGGACTTA CTCGCCTCTT TTTCCTACCT TTTAAACGTC
CCACTGCCTG CTGATGAAAC TACCGACAGC GAAAATGTCT TACCTGCGTT GCTAGGCAAA
TCCACAAAAG GCCGTACCAC ATTAATTACA CAGGGAGGCC CACTGGCCAT CATCAAAAAC
AACTGGAAAT ACATTGAACC AGGTAAGGGC ATTGCTTACG ATAAATTAAC AGGTATAGAG
ATGGGTGTAT CGGCAAGTGG TCAATTGTAT AACCTGAACA CCGATCCTGG AGAAACTGAA
AATGTTCTCT ACAAATACGC GAGCAAAGCA AAAGAGCTGG CTACACTGCT CAATGCTATA
AAAGCTAAAT AA
 
Protein sequence
MKRTTILILA FLLFGTTIHL SAQTAKPNVI FIYADDLGYG DLSCYGATKI NTPNLDKLAK 
NGIRFTNGHS TSATCTPSRF AVMTGQYPWR QKGTEILPGD AALIVPTNTT TLPKVFKKAG
YQTAVVGKWH LGLGKQVEKN WNTEVTPGPN EVGFDYSFIF PATADRVPTV FMENHKILAL
DQTDPIRVDY KNPVGNDPTG KEHPELLKLK SSAGQGHNNT IVNSIGRIGY MEGGHKARWT
DEEVSTTFLT KARDFIEKNK NQPFFLYFTL TEPHVPRMPA TMFKGKSELG LRGDAILQLD
WTVGQIMKQL EYLKLDKNTL IIFSSDNGPV LDDGYEDEAV KKSVGHQPSG IFRGGKYSSF
EAGTRVPWIM SWPGTILPKV SEAMVCQMDL LASFSYLLNV PLPADETTDS ENVLPALLGK
STKGRTTLIT QGGPLAIIKN NWKYIEPGKG IAYDKLTGIE MGVSASGQLY NLNTDPGETE
NVLYKYASKA KELATLLNAI KAK