Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2890 |
Symbol | |
ID | 8254000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3444745 |
End bp | 3446316 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644936537 |
Product | sulfatase |
Protein accession | YP_003093150 |
Protein GI | 255532778 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TAACCTATTT GAAGCATAGT ACCTGGGTAA CCGCATTTGC TTTTTCAGTT TGCCTGGTTT TGCTCAGCAA AACAAAATTG CAGGCCCAGC AACAAAAGAA GCTGCCCAAT ATTGTATATA TTCTGGCTGA TGATCTGGGC TATGGCGATA TTAAAATTTA CAATGCAGGT GCAAAAGTAA ATACACCCCA CATTGATAAA CTGGCCGAGC AGGGCATGCG CTTTACAGAT GCGCATACCA CATCATCTGT ATGCACGCCT TCGCGCTATT CTATTCTGAC CGGTCGCTAC CCCTGGCGCA GCAGGCTTCC GGTAGGTGTA TTGAGAGGGT ATAGCAGAAC ATTGATAGAA GAAGGCCTGC CAACGGTAGC CGGTTTGTTG AAAACAAGCT CCTATCGCAC AGCGGTAATT GGGAAATGGC ATTTAGGATT GGATTGGATG CCAAAAGAAG CATTCAAAGA TTCTATTAAT CCCGCTTTTA ACAAAGACAG GCTGTACGGC ATTACCGATG AAATGAATCC GGATCAGATA GATTTTGGAA GAGCACCCGT TCGTGGCCCG CGTACACAAG GTTTTGATTA CTCCTATGTG CTGCCTGCCT CTCTGGACAT GCCGCCATAT GCCTATCTGG AGAATGATCA GCTAACAGAG CCGCTTACGG GCTATACACC AGGTAATAAA TTGGCCAGCG GCTATACCGG CCCCTTCTGG AGGGCCGGCT TAAAAAGTCC CTCATTTGAT TTTTACGGTG TATTGCCGGC CTTTACCAAT AAGGCCACCG ATTTTATTAA AAAAGAGGCT GCAACAAAAA ATCCTTTCTT CCTGTATTTC CCTATGCCGG CACCACATAC CCCATGGATG CCTACCGCCG AATACCGGGG TAAATCGCAG GCAGGGGAAT ATGGTGATTA CTTACAGGAA GTTGATGCTG CAGTAGGTAA AATATTGCAG GTATTGGACA GCCTGGGCTT ATCAAAAAAT ACCTTAGTGG TGTTTACCAG CGACAACGGG CCTTACTGGC GGGATGATTT TGTGCAGCAA TATGGCCACC ATGCTGCGGG GCCATTCAGG GGAATGAAAG GCGATGCTTA TGAAGGTGGC CACAGGGTTC CTTTTATTGT GCGTTACCCC GGTAAAGTAA AAGCAGGGAC CATTAGTAAT GTAACAACCA CCCTTGCCAA CCTGATGGCC ACCTGTGCCG ATTTAACTGG CAACCATGCT GTCCAGTTTG AAACGGAAGA TAGCTACTCC ATTTTACCGG TATTGCTTGG TAAAGCTGCC GGGATTGCAG AACAGCCGGC CATTGTCAAT ATTTCTTCCA AAGGGTTTTA TGATATCCGG AAAGGGCCCT GGAAATTGAT TACCGGTTTG GGTTCCGGAG GGTTTTCGGT ACCCTCAATA GTTAAGGCCC CTGAAGGGCA AGCTGCCGGG CAATTGTACA ACCTGGATAC AGACATTAAA GAAGAGACAA ATTTGTATAG CCGGTATCCT GAAAAAGTAA AAGAACTAAG CGCCTTATTG GAAAAAATAA AAGCAGCGCC CAAAGGAAAA CGTGCAAAAT AA
|
Protein sequence | MKKITYLKHS TWVTAFAFSV CLVLLSKTKL QAQQQKKLPN IVYILADDLG YGDIKIYNAG AKVNTPHIDK LAEQGMRFTD AHTTSSVCTP SRYSILTGRY PWRSRLPVGV LRGYSRTLIE EGLPTVAGLL KTSSYRTAVI GKWHLGLDWM PKEAFKDSIN PAFNKDRLYG ITDEMNPDQI DFGRAPVRGP RTQGFDYSYV LPASLDMPPY AYLENDQLTE PLTGYTPGNK LASGYTGPFW RAGLKSPSFD FYGVLPAFTN KATDFIKKEA ATKNPFFLYF PMPAPHTPWM PTAEYRGKSQ AGEYGDYLQE VDAAVGKILQ VLDSLGLSKN TLVVFTSDNG PYWRDDFVQQ YGHHAAGPFR GMKGDAYEGG HRVPFIVRYP GKVKAGTISN VTTTLANLMA TCADLTGNHA VQFETEDSYS ILPVLLGKAA GIAEQPAIVN ISSKGFYDIR KGPWKLITGL GSGGFSVPSI VKAPEGQAAG QLYNLDTDIK EETNLYSRYP EKVKELSALL EKIKAAPKGK RAK
|
| |