Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0785 |
Symbol | |
ID | 8251874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 926663 |
End bp | 928081 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644934435 |
Product | sulfatase |
Protein accession | YP_003091069 |
Protein GI | 255530697 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.288009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGAA TAAAAACGAT AAGTACTTTG TTGCTGGCCC TTTGGACAGG CATTAGTGCT GCACAGGTAA AAACTGCGGC CAAGCCCAAC GTGATTGTCA TTGTTAGCGA TGATGCCGGA TATGTAGATT TTGGTTGTTA TGGTGGTAAA CAGATCCCCA CACCCAATAT TGATGCCATT GCCAAACAGG GTACGCGGTT TACTGATGCA TATGTTTCGG CTTCAGTATG TGCCCCCTCA AGGGCCGGAA TTTTAACCGG ACGTTACCAG CAGCGCTTTG GCTTTGAGCA CAATACATCA AATGTTTTGG CCCCGGGGTA TAAAATAACT GATGTAGGAA TGGATCCTTC GGAACAGACC ATTGGAAATG AAATGCAGGC AAATGGGTAT AAAACCATTG CAATTGGTAA ATGGCACCAG GGTGATGAAC CTAAACATTT TCCGCTAAAC AGGGGCTTTA ACGAATTTTA TGGTTTTACA GGGGGGCACC GTGATTTTTT TGCCTATAAA GGCAAAAGAA CCAATGAACA TGCTTTGTAC AACAATAAAG AGATCGTTCC GGAAAATGAA ATTACCTATC TGACGGATAT GTTTACCGAT AAGGCTACGT CTTTTATTAC AGCAAATAAA GACAAGCCCT TTTTTATGTA CCTTTCTTAC AATGCAGTAC ACACGCCGAT GAATGCGAAA AAAGACCTGA TGGAGCGTTA TGCAAGTATA GCCGATACCG GGCGCAGGGC CTATGCAGCC ATGATGACCT CATTGGATGA TGGAATTGGT AAGGTAATGG CCACACTTAA GGCAAATCAG CTGGATAAAA ATACACTGAT CATTTTTATC AACGACAATG GTGGCGCTAC AGTAAACTCT TCTGATAACG GGCCGTTAAG GGGTATGAAA GGGTCAAAAT GGGAAGGTGG CATCCGTGTG GCCATGATGA TGAAATGGCC TGGACATATT GCTGCAAATA AAACAGATAG CCGTCCGGTA AGCTCATTAG ATATCCTGCC TACGGCCATT GGTGCCGGAA AAGGTAAACA AAAGGGTACA AAAAAGCTGG ATGGGGTAAA CTTACTTCCT TATTTAAGTG CGGGTAATAA AAAGACACCC CACGAGGCGC TATATTGGCG AAGAGGCGTA GCCGCAGCCA TGAGAGAAGG GAACTGGAAG CTGATCCGGG TTAAGGAAAG CCCCACCGTA CAGAATGTAT TGTTGTTTGA CCTGAGTAAG GACCTTTCAG AGACTAAAAA CCTGTCGGAA AAATATCCTG CCAAAGTAAA AGAGCTGCTT GTCAAACTTG CTGAATGGGA AAAAGGACTG GACCAGCCGC ACTGGTATAG TTCTTACGGC GACCAGAACC AGATCATGAA GCACCGTATG GAAACTACAG GCCGCGAGAT GGAGAGAATG TACCCTTAA
|
Protein sequence | MKGIKTISTL LLALWTGISA AQVKTAAKPN VIVIVSDDAG YVDFGCYGGK QIPTPNIDAI AKQGTRFTDA YVSASVCAPS RAGILTGRYQ QRFGFEHNTS NVLAPGYKIT DVGMDPSEQT IGNEMQANGY KTIAIGKWHQ GDEPKHFPLN RGFNEFYGFT GGHRDFFAYK GKRTNEHALY NNKEIVPENE ITYLTDMFTD KATSFITANK DKPFFMYLSY NAVHTPMNAK KDLMERYASI ADTGRRAYAA MMTSLDDGIG KVMATLKANQ LDKNTLIIFI NDNGGATVNS SDNGPLRGMK GSKWEGGIRV AMMMKWPGHI AANKTDSRPV SSLDILPTAI GAGKGKQKGT KKLDGVNLLP YLSAGNKKTP HEALYWRRGV AAAMREGNWK LIRVKESPTV QNVLLFDLSK DLSETKNLSE KYPAKVKELL VKLAEWEKGL DQPHWYSSYG DQNQIMKHRM ETTGREMERM YP
|
| |