Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2941 |
Symbol | |
ID | 8254052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3505659 |
End bp | 3507101 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644936589 |
Product | sulfatase |
Protein accession | YP_003093201 |
Protein GI | 255532829 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.515365 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCG GGCTTTTTAT ACTATCATTT TGCTGTTTTT TTGCTGCTGG CAGGGCCCAG ACCACCAAAA CGCAACGTCC GAATGTGATC ATTATCAACA TGGACGATAT GGGGTACGGG GATACCGAAC CCTATGGGAT GACTGGAATA CCAACGCCTA ATTTTAATAA AGCGGCTAAG GAAGGCATGC GGTTTACACA CTTTAATGCT GCTCAGGCAA TTTGCAGCCC GTCCAGGGCA GCATTATTGA CCGGCTGTTA TCCAAACCGG ATCGGATTGC GGGGGGCATT GTCCCCCGAT TCAAAAATAG CTTTGGACAC TGCAGAAGAA ACCATCGCCT CCCTGCTCAA AAAGGCAGGG TATAAAACTG CCATGCTGGG TAAATGGCAT TTGGGCAGCA AGGCCCCCAA CCTTCCGCTT CATTATGGTT TCGACAGCTT CTACGGGCTT CCCTATTCGA ACGATATGTG GCCGGTGGAT TATGAAGGAA AGCCCCAGGC GGCTGTTGCC GGGAAAAAAA GCTATCCCGA ACTGCCCTTG CTGGATGGCG ACAAGCCTGC TGACTATGTG CGTACACCTG ATGATCAGGC CATGCTTACC GGAACATTTA CCCGTAAAGC GGTACGCTTT ATTGAAAATA ATAAAAGCGC ACCTTTTTTT CTTTACCTGG CCCATCCCAT GCCCCATGTG CCGCTGGCTG CCTCGGCAGC ATTCAGGGGT AAAAGTGAGC TGGGACTGTT TGGCGACGTT ATTATGGAAC TGGACTGGTC TGTAGGGGAG ATCATGAAAT CGCTGGACCG GAATAAAATT GCTTCAAATA CCATTCTTAT AATTATGAGC GACAATGGCC CCTGGTTAAG GTTCGGTAAC CATGCAGGCT CTTCAGGTGG GTTTCGCGGA GGAAAAATGA CCATATGGGA TGGAGGGACC CGTGTTCCCT GCATCATCAG ATGGCCGGGC AAAGTGGAAG CCGGAAGTGT AAACAGCAAC CTGATTACCA ATATGGATAT CCTGCCAACT TTGCTGCAGC TTAGCCATGC CGCCCCACCC GAAAAGAAGA TAGACGGGAT AAGTTTTGCA GATTTGCTGC TGGGCAGATC TGATAAAGCC CCCCGCCAGG TTTTTTATTA TTACTATAAT GAAAACAGCC TCAAGGCCGT AAGGTATAAA AACTGGAAAC TGGTGTTGCC ACATACTTCT GTATCCTATA CCAGCGACAT CCATGGCAAA GATGGTTTTC CGGGAGCCGC AACGCGGGCG GAAGTAAAAA TGGCTTTATA CGACCTGGCC CATGATCCCG GCGAGGCTTA TGATGTTCAG CAACAGTATC CCGAACTTGT ACAAAAAATG CTTGTTTTTG TGGAAGAGGC AAGAGCGGAC ATGGGAGATG ACCTTACCGG CAGAAAAGGT AAAAATCTGC GTCAGCCTGC CATTGTCAAT TAA
|
Protein sequence | MKTGLFILSF CCFFAAGRAQ TTKTQRPNVI IINMDDMGYG DTEPYGMTGI PTPNFNKAAK EGMRFTHFNA AQAICSPSRA ALLTGCYPNR IGLRGALSPD SKIALDTAEE TIASLLKKAG YKTAMLGKWH LGSKAPNLPL HYGFDSFYGL PYSNDMWPVD YEGKPQAAVA GKKSYPELPL LDGDKPADYV RTPDDQAMLT GTFTRKAVRF IENNKSAPFF LYLAHPMPHV PLAASAAFRG KSELGLFGDV IMELDWSVGE IMKSLDRNKI ASNTILIIMS DNGPWLRFGN HAGSSGGFRG GKMTIWDGGT RVPCIIRWPG KVEAGSVNSN LITNMDILPT LLQLSHAAPP EKKIDGISFA DLLLGRSDKA PRQVFYYYYN ENSLKAVRYK NWKLVLPHTS VSYTSDIHGK DGFPGAATRA EVKMALYDLA HDPGEAYDVQ QQYPELVQKM LVFVEEARAD MGDDLTGRKG KNLRQPAIVN
|
| |