Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3374 |
Symbol | |
ID | 8254493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4012932 |
End bp | 4014455 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937026 |
Product | sulfatase |
Protein accession | YP_003093630 |
Protein GI | 255533258 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA ATATCAACAA AGTACCCGGC AGCATGATGC TGCTCATTGC AATAACACTG GCATTTTCAT CAAACGCCCA AACAACAAAG ACCAGTAAAC CAAATATTGT GATCATCTAT GCAGACGATT TGGGATATGG CGATATCAGT GCCTATGGTG GCGACGTGAA AACACCTAAC ATAGACCGGC TGGCTAGCCA GGGCCTGAGC TTTACGAATG GGCATTCCAC ATCAGCAACC TGCACACCTT CGCGCTATTC CCTGCTTACC GGTAAATATG CCTGGAGAAA ACAGGGCACC GGCGTTGCAC CTGGCAATGC TCCGCTTATT CTAGATCCCG AAAAAAATAC CATAGCAGAT GTATTGGGTA AAGCCGGTTA TAAAAGTGCT GTGGTTGGTA AATGGCATTT AGGCCTTGGC CCAAAAGAGG GGGCAGATTG GAATGGTGAC ATCAAACCAG GGCCTCTGGA GCTGGGTTTC AACTATTCCT ATATTTTACC GGCAACAGGC GATCGTGTTC CATGCGTTTA TGTAGAGAAC CATAGAATTG TTAACCTTGA TCCCAAAGAT CCCGTTCACG TTTCTTATCT GGCTCCTATA GCAAATGAAC CAACCGGACT CAATAATCCG GAACTATTAC GTGTTCAATC TTCGCATGGA CACAATCAGG CCATAGTAAA TGGAATTGGC CGCATTGGCT ACATGACCGG AGGAAAATCA GCATTATGGA CAGACGAGGA CATTGCCGCT GTATTGGCTT TAAAAGCAAG CAAATTTATT GAAAACAATA AAAATCAGCC TTTTTTCCTA TACCTGGCCA CTCATGACAT CCATGTACCA AGGGTACCAA ACTCAAAGTT CCTTGGTAAA AGTGGACTTG GTGTGCGTGG CGATGCGATA CTCCAGTTGG ACTGGACCGT AGGGCAGGTT ACAAAAACAT TGGATAGCCT TGGTTTAAGC AAAAATACCC TTGTGATATT TAGCAGCGAT AACGGGCCGG TTCTGGATGA TGGTTACGTA GATGAGGCCA TAGAAAAACT AGGCACGCAT AAACCTGCAG GGCCACTTAG AGGGGGTAAA TACAGTTTGT TTGATGGTGG AACCCGTGTG CCACTGATTG TGAAATGGCC TGCAGCGATC GCCGCAGGTA GCAGCTCTGA TGCCCTGATT AGTCAGGTCG ATTTCTTTGC CTCACTGGCC GCATTAACCG GCCAAAAACC AGGCGCCGGA GATGCCCCAG ATAGTCAGAA CGTCATTAAT GCCTTAACCG GAAAATCCAA ATCAGGGAGG TCATGGCTTA TCGCACATGC AGGTACACTG TCTATTACAA AAGGTGACTG GAAATATATT GAACCCGCGA AAGGAAATGC AGCAGCATCA AGACACAAAG AACTCGGAAA ATCTGCTGTT GCTCAATTAT ATAACCTTAA AAACGATCTC GCTGAAACCA GGAACCTTGC AGATGAAAAT CCGGAATTGG TAAAAACACT GGCAGCCGAA CTGGAAAAAG TTAAGTCGCT ATAG
|
Protein sequence | MKTNINKVPG SMMLLIAITL AFSSNAQTTK TSKPNIVIIY ADDLGYGDIS AYGGDVKTPN IDRLASQGLS FTNGHSTSAT CTPSRYSLLT GKYAWRKQGT GVAPGNAPLI LDPEKNTIAD VLGKAGYKSA VVGKWHLGLG PKEGADWNGD IKPGPLELGF NYSYILPATG DRVPCVYVEN HRIVNLDPKD PVHVSYLAPI ANEPTGLNNP ELLRVQSSHG HNQAIVNGIG RIGYMTGGKS ALWTDEDIAA VLALKASKFI ENNKNQPFFL YLATHDIHVP RVPNSKFLGK SGLGVRGDAI LQLDWTVGQV TKTLDSLGLS KNTLVIFSSD NGPVLDDGYV DEAIEKLGTH KPAGPLRGGK YSLFDGGTRV PLIVKWPAAI AAGSSSDALI SQVDFFASLA ALTGQKPGAG DAPDSQNVIN ALTGKSKSGR SWLIAHAGTL SITKGDWKYI EPAKGNAAAS RHKELGKSAV AQLYNLKNDL AETRNLADEN PELVKTLAAE LEKVKSL
|
| |