Gene Phep_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2831 
Symbol 
ID8253939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3366547 
End bp3367869 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content43% 
IMG OID644936477 
Productsulfatase 
Protein accessionYP_003093092 
Protein GI255532720 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0404074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTA TACCTTTTCT CTCCATTATC TTAACTGCCT TAAGTTTAAT TACCCATGTA 
ACATTTGGTC AAAAAAGACC AAATGTAATT ATTGTGCTCA CCGATGATAT GGGTTACGGT
GATCTGGCCT GTTACGGGAA CCCTTTATTC AAAACACCAT TTCTTGATAA AATGGCCAGT
AATGGCGTAA TGGCAACAAA TTTTGTAACC ACTTCTCCTA CCTGCTCCCC ATCAAGGGTA
TCAACCCTTA CCGGGCGGTA TTGCAGCCGC TCTAAAATGC CACGTGTTAT AGGCCCTGGT
GATAAAACAG CAATTCCTGA TGAAGAGGTT ACCATTGCCG AAATGCTGAA AACTTCAGCT
TACCGTACAG CCTGTATAGG TAAATGGCAT ATTGGCGATT ATGGTACCGG ATTGCCCAAC
AAACAAGGTT TCGATTTATT TTACGGGATG TTGTACAGTC ATGACTTCAG GGCACCTTAT
GTAAAAACAG ATACAGTGAT TAAAATATTC AGGAACCAAA AGCCCGAAAT ATACCGTCCT
AATGATACCA TACTCACAAA AGCCTATACC AGGGAAGCCA TCGGTTTTGT AAAAGAATCG
ACAGCAAAAA AACAACCTTT TTTTTTATAT CTGGCCTACA ATATGCCACA TCTTCCAGTA
GCCAGCGCAG TAAGAAAAGA CAGCAATAAA TCGGCCGGAG GCGAACTGGG CAGTGTGATA
GAAGAAATGG ATACGGAAAT GGCTAAGCTA TGGAAAACAG TGCAGGACAG TGGCGAAGCT
GACAATACCA TTTTTATATT TACCAGCGAT AACGGCCCAT GGTTAAATGC CCCTCAGCGC
ATGTACGATG ACGGCATTAC CAAGCCATAT CACGTGGGCA CAGCTGGTAT TTTCAGGGGA
TCGAAGGCAA CTTCTTTAGA AGGCGGACAC CGCGTGCCTT TTATAGTTTA TTACAAAAAC
CATACAGCCC AACAAGTTGT GCGCAGCCCG ATATCCAACC TGGATATTTT GCCCACCCTG
GCCGACTGGA CCGGTACTGC CCTACCAAAA CGGGTGCTGG ACGGAGAATC TGTGGTTAAG
CTGCTGTCAC AAAAAGACTA TCAGATTCCC CACAAGCCAA TTTATTATTA CAACTATGTC
CTGGAAGGTG TAAAGGATGG TGACTGGAAG CTGAGGATCA CTAAAAAGGA TGATAAAACA
ATAGAAGAAA TGTTCCATCT GGGCTGGGAC CCTACAGAGC GCTACAATTT ATACAACGAC
CCAAAATATG CTAAGGAACA ACAACATTTA CTGCAGTTAT ACAGGGATTA CCCGGATCAG
TAA
 
Protein sequence
MKRIPFLSII LTALSLITHV TFGQKRPNVI IVLTDDMGYG DLACYGNPLF KTPFLDKMAS 
NGVMATNFVT TSPTCSPSRV STLTGRYCSR SKMPRVIGPG DKTAIPDEEV TIAEMLKTSA
YRTACIGKWH IGDYGTGLPN KQGFDLFYGM LYSHDFRAPY VKTDTVIKIF RNQKPEIYRP
NDTILTKAYT REAIGFVKES TAKKQPFFLY LAYNMPHLPV ASAVRKDSNK SAGGELGSVI
EEMDTEMAKL WKTVQDSGEA DNTIFIFTSD NGPWLNAPQR MYDDGITKPY HVGTAGIFRG
SKATSLEGGH RVPFIVYYKN HTAQQVVRSP ISNLDILPTL ADWTGTALPK RVLDGESVVK
LLSQKDYQIP HKPIYYYNYV LEGVKDGDWK LRITKKDDKT IEEMFHLGWD PTERYNLYND
PKYAKEQQHL LQLYRDYPDQ