Gene Phep_2827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2827 
Symbol 
ID8253935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3358452 
End bp3360089 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content44% 
IMG OID644936473 
Productsulfatase 
Protein accessionYP_003093088 
Protein GI255532716 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.115962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAT TGAAATTAAT TTTACCGGTT TTGTTTGCCG GTGCCACCTT AATGTCTTGC 
CAGCAGCCTA AACCTGCTGA AAGTGCCAAA AGGCCCAATA TTGTGTTCAT CATGACAGAT
GACCATACCA TTCAGGCCAT AAGCGCTTAT GGCAGCAAAT TGGTAAAAAC GCCCAACCTG
GACAGAATTG CCAACGAGGG TATGTTGTTT AACAACTGTT TTGTAACCAA TGCAGTTTGC
GGGCCATCCA GGGCTACTAT CCTGACCGGA AAATATAGCC ACCTGAATGG TTTAACAGAC
AATTCAAAGG TATTTGACAG TACTCAGGTT ATTTATCCGC AGTTGTTAAA GAAAGCAGGG
TACCAGACCG CAATGATTGG CAAGTGGCAC CTGGGCTCAA CACCAATGGG CTTTGACTAT
TACAGTATTT TGCCCAACCA GGGACAATAT TATCAGCCTG AATTTATAGA AAACGGGCAT
CTGGTTAAAG AAAAAGGATA TGTAACAGAC CTCATCACCG ATAAGGCCAT CGGCTTCCTT
GAAAAAAGGG ACCATGATAA ACCCTTTCTG ATGATTTACC AGCACAAAGC ACCGCACCGC
AACTGGTTGC CGGCACCAAG ACACCTGGGG ATGTTTGACG ATACGGTTTT TCCTGAACCT
GCCAATTTAC TGGATGATTT TAAGGGCAGG GGCAGGGCAG CAAAGGAGCA GCTGATGAAC
ATTTCTACCG ATATGTGGCC TGCATGGGAC CTTAAAATGC TTTCTACAGC CCAGCTTGAT
TCTATGGCGA AACTACCTGT TTCCCCTAAG TTTAAAGATG CCAAGGGTGA TGATTATCAA
CAGGCCAATG ATCCTTCACT GGATAAAGCC CGTTTTTTTG AAGTGTACAA CCGCATGACA
GATGCTGAAA AGGTACAATG GAGAAAAGTA TATGACAAAC GCGTAGCCGA ATTTAAAAGG
CTGAACCCGA AAGGGGCCGA CCTGGTGCGA TGGAAATACC AGCAGTATAT GCGCGATTAT
CTGGCCTGCG TGGTTTCGGT AGATGAAAAT GTAGGCAGGC TGATGGATTA CCTGAAAAAG
ATAGGGGAGC TGGACAATAC CATTATTGTC TATACTTCCG ATCAGGGCTT TTATTTGGGT
GAGCATGGGT ATTTCGACAA ACGTTTTATG TACGATGAAT CTTTCCGTAC ACCTTTAATG
GTGAGGTATC CGCCTTCGGT TAAAGCCGGT TCAGTAAGTA ATGCCTTTGC CATGAACCTC
GATTTTGCAC CAACTTTACT GGATTATGCA GGGGTAAAAA TACCAGCCGA TATGCAGGGC
CTGTCGTTAC GTCCGGTATT GGATAACGCA GGAAAATCGC CGGAAAACTG GCGCAAGGCT
GTATATTATC ATTATTATGA ATTTCCAAGC TGGCACATGG TTAAAAGGCA CTATGGCATC
AGAACGGAGC GCTATAAACT GATCCATTTT TACAATGACA TTGATGAATG GGAATTATAC
GATATGCAGA AAGATCCGCA TGAGATGCAA AACCTGTATA ACGATAAGGC CTATGAGCCG
ATTATTAAAG ACCTGAAAGT GCAAATGAAA AAGCTGCAGG TACAATATAA AGATACGAAT
CCAACTGAAG CTTTATAA
 
Protein sequence
MGKLKLILPV LFAGATLMSC QQPKPAESAK RPNIVFIMTD DHTIQAISAY GSKLVKTPNL 
DRIANEGMLF NNCFVTNAVC GPSRATILTG KYSHLNGLTD NSKVFDSTQV IYPQLLKKAG
YQTAMIGKWH LGSTPMGFDY YSILPNQGQY YQPEFIENGH LVKEKGYVTD LITDKAIGFL
EKRDHDKPFL MIYQHKAPHR NWLPAPRHLG MFDDTVFPEP ANLLDDFKGR GRAAKEQLMN
ISTDMWPAWD LKMLSTAQLD SMAKLPVSPK FKDAKGDDYQ QANDPSLDKA RFFEVYNRMT
DAEKVQWRKV YDKRVAEFKR LNPKGADLVR WKYQQYMRDY LACVVSVDEN VGRLMDYLKK
IGELDNTIIV YTSDQGFYLG EHGYFDKRFM YDESFRTPLM VRYPPSVKAG SVSNAFAMNL
DFAPTLLDYA GVKIPADMQG LSLRPVLDNA GKSPENWRKA VYYHYYEFPS WHMVKRHYGI
RTERYKLIHF YNDIDEWELY DMQKDPHEMQ NLYNDKAYEP IIKDLKVQMK KLQVQYKDTN
PTEAL