Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4004 |
Symbol | |
ID | 8255138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4835562 |
End bp | 4836620 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644937668 |
Product | glycosidase PH1107-related |
Protein accession | YP_003094257 |
Protein GI | 255533885 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00339526 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATC AAGCGCAAAG ATTTATACAA AACCCTTTAT TATCGCCAAA AGAATTGAAG CCAAGCAGGC CTGGACTGGA GATCACCTGT CTGCTGAACC CCGGTGTTTT TACTTTTGAA GGCAAAACCT GGTTGCTGGT ACGTGTGGCG GAGAGACCGA AACAACAGGA GGGTTTTATT TCTTTTCCGG TACTGAAAGG AGGGGGGATT GAAATTATTG AGATCGCGTC GGGTGATCCG TACCTGAATG CGGATGACCC GCGGGTCATC AGGTATAAAG GGCAGGATTA CCTGACCACC CTATCGCATT TACGCCTGCT TTGCAGTGAC GATGGTGTTC ATTTTTATGA GCCGGAAGGC TATCCCCTTT TGCAGGGGGA GACTTTACAG GAAGCTTTTG GGGTAGAGGA TTGCAGGGTA GCATTGATTG AGGGGATGTA TTACCTGACT TATACCGCTG TTTCCGGGCA GGGTGTAGGG GTTGGCCTGC GTAAGACCAA GGACTGGAAA ACTTTTGTTT CCGAAGGAAT GATCATTCCA CCGCACAATA AGGACTGTGC CATTTTTGAA GAAAAGATCA ATGGTAAGTT TTATGCGCTG CATCGACCGA GCAGTGTTGA TATCGGTGGA AACTACATCT GGATTGCTCA ATCGCCTGAT GGCATACATT GGGGAGGGCA TAAATGTATT GTTACCACCA GAAAAGATAG CTGGGACAGT GCAAGGGTAG GCGCCGGGGC TGCGCCCATA AAGACGTCAC TGGGCTGGCT GGAAATTTAT CATGGTGCAG ATACCGCCCA CCGGTATTGT CTGGGTGCTT TTTTGCTGGA TCTTGATGAC CCTTCTCTTG TGCTGGCACG CAGTACAGAA CCTATTATGG TACCTACGGC TACTTATGAG CTGACTGGCT TTTTCGGACA TGTGGTGTTT ACGAACGGCC ATGTAGTGCA GGGCGATGAG CTGACCATTT ATTATGGTGC TGCAGATGAG TTTGTTTGCG GGGCTAAATT CTCTATTAAT GAAATATTAA CCTCCCTAAC TTACTATCAT GATTCATAA
|
Protein sequence | MKDQAQRFIQ NPLLSPKELK PSRPGLEITC LLNPGVFTFE GKTWLLVRVA ERPKQQEGFI SFPVLKGGGI EIIEIASGDP YLNADDPRVI RYKGQDYLTT LSHLRLLCSD DGVHFYEPEG YPLLQGETLQ EAFGVEDCRV ALIEGMYYLT YTAVSGQGVG VGLRKTKDWK TFVSEGMIIP PHNKDCAIFE EKINGKFYAL HRPSSVDIGG NYIWIAQSPD GIHWGGHKCI VTTRKDSWDS ARVGAGAAPI KTSLGWLEIY HGADTAHRYC LGAFLLDLDD PSLVLARSTE PIMVPTATYE LTGFFGHVVF TNGHVVQGDE LTIYYGAADE FVCGAKFSIN EILTSLTYYH DS
|
| |