Gene Phep_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3374 
Symbol 
ID8254493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4012932 
End bp4014455 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content46% 
IMG OID644937026 
Productsulfatase 
Protein accessionYP_003093630 
Protein GI255533258 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA ATATCAACAA AGTACCCGGC AGCATGATGC TGCTCATTGC AATAACACTG 
GCATTTTCAT CAAACGCCCA AACAACAAAG ACCAGTAAAC CAAATATTGT GATCATCTAT
GCAGACGATT TGGGATATGG CGATATCAGT GCCTATGGTG GCGACGTGAA AACACCTAAC
ATAGACCGGC TGGCTAGCCA GGGCCTGAGC TTTACGAATG GGCATTCCAC ATCAGCAACC
TGCACACCTT CGCGCTATTC CCTGCTTACC GGTAAATATG CCTGGAGAAA ACAGGGCACC
GGCGTTGCAC CTGGCAATGC TCCGCTTATT CTAGATCCCG AAAAAAATAC CATAGCAGAT
GTATTGGGTA AAGCCGGTTA TAAAAGTGCT GTGGTTGGTA AATGGCATTT AGGCCTTGGC
CCAAAAGAGG GGGCAGATTG GAATGGTGAC ATCAAACCAG GGCCTCTGGA GCTGGGTTTC
AACTATTCCT ATATTTTACC GGCAACAGGC GATCGTGTTC CATGCGTTTA TGTAGAGAAC
CATAGAATTG TTAACCTTGA TCCCAAAGAT CCCGTTCACG TTTCTTATCT GGCTCCTATA
GCAAATGAAC CAACCGGACT CAATAATCCG GAACTATTAC GTGTTCAATC TTCGCATGGA
CACAATCAGG CCATAGTAAA TGGAATTGGC CGCATTGGCT ACATGACCGG AGGAAAATCA
GCATTATGGA CAGACGAGGA CATTGCCGCT GTATTGGCTT TAAAAGCAAG CAAATTTATT
GAAAACAATA AAAATCAGCC TTTTTTCCTA TACCTGGCCA CTCATGACAT CCATGTACCA
AGGGTACCAA ACTCAAAGTT CCTTGGTAAA AGTGGACTTG GTGTGCGTGG CGATGCGATA
CTCCAGTTGG ACTGGACCGT AGGGCAGGTT ACAAAAACAT TGGATAGCCT TGGTTTAAGC
AAAAATACCC TTGTGATATT TAGCAGCGAT AACGGGCCGG TTCTGGATGA TGGTTACGTA
GATGAGGCCA TAGAAAAACT AGGCACGCAT AAACCTGCAG GGCCACTTAG AGGGGGTAAA
TACAGTTTGT TTGATGGTGG AACCCGTGTG CCACTGATTG TGAAATGGCC TGCAGCGATC
GCCGCAGGTA GCAGCTCTGA TGCCCTGATT AGTCAGGTCG ATTTCTTTGC CTCACTGGCC
GCATTAACCG GCCAAAAACC AGGCGCCGGA GATGCCCCAG ATAGTCAGAA CGTCATTAAT
GCCTTAACCG GAAAATCCAA ATCAGGGAGG TCATGGCTTA TCGCACATGC AGGTACACTG
TCTATTACAA AAGGTGACTG GAAATATATT GAACCCGCGA AAGGAAATGC AGCAGCATCA
AGACACAAAG AACTCGGAAA ATCTGCTGTT GCTCAATTAT ATAACCTTAA AAACGATCTC
GCTGAAACCA GGAACCTTGC AGATGAAAAT CCGGAATTGG TAAAAACACT GGCAGCCGAA
CTGGAAAAAG TTAAGTCGCT ATAG
 
Protein sequence
MKTNINKVPG SMMLLIAITL AFSSNAQTTK TSKPNIVIIY ADDLGYGDIS AYGGDVKTPN 
IDRLASQGLS FTNGHSTSAT CTPSRYSLLT GKYAWRKQGT GVAPGNAPLI LDPEKNTIAD
VLGKAGYKSA VVGKWHLGLG PKEGADWNGD IKPGPLELGF NYSYILPATG DRVPCVYVEN
HRIVNLDPKD PVHVSYLAPI ANEPTGLNNP ELLRVQSSHG HNQAIVNGIG RIGYMTGGKS
ALWTDEDIAA VLALKASKFI ENNKNQPFFL YLATHDIHVP RVPNSKFLGK SGLGVRGDAI
LQLDWTVGQV TKTLDSLGLS KNTLVIFSSD NGPVLDDGYV DEAIEKLGTH KPAGPLRGGK
YSLFDGGTRV PLIVKWPAAI AAGSSSDALI SQVDFFASLA ALTGQKPGAG DAPDSQNVIN
ALTGKSKSGR SWLIAHAGTL SITKGDWKYI EPAKGNAAAS RHKELGKSAV AQLYNLKNDL
AETRNLADEN PELVKTLAAE LEKVKSL