Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4128 |
Symbol | |
ID | 8255263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4989467 |
End bp | 4990453 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644937793 |
Product | hypothetical protein |
Protein accession | YP_003094381 |
Protein GI | 255534009 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0321104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.774041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TATTTAATTT AAAAAATGTT AAGGTTGAAG TGAAAGTTTT CACTTTTAAA TGCACGTTAT TGGTATGGGC ATTAACAACT GTGCTTTACG GGTGTAAAAA AGATAAAAAA GAAGATGGGG GTACAGATGT ACCAGAAGAA GTAGTGGATA ATGTGATTGA ACCTCTTACC AAGAAGATTA TGGACAATAC CACGGTAATT GGAACATTCA TTTCGGATGA GACAGGTTCT GTTACTGCTG GAATAAATAT TACCAGGCTG GCTTTTCTCA GGAAAGATAA ATTGCCTGTA AGAATCTTTA TTATGGAAGT TGATATGAAA ACGCCAAAAC TTGAAATTCA GGCAATGGCG CCTTATAATG ACTACATTAA TGGTTTGCAG AGGCTTTCCG AAATGTGCAG GGACAATGAA CTTCCGGGAA CAAATATTGT TGCTGCTGTT AATGGTGATA CCTTTAGTAC AACAGGTGCC CCAACCAGTT TGTTTTATAT AAATAATCGG GTTTACTATG GTACTGTTGC TACAGGGAGA ACCTTTTTTG CTGCAATGAA GGATGGGACA ATAGTTATTG GGGGAAAGGA TACAAAGGGG GTAGAAAGAC CTGTTGATAA AGCCCAGATT AAGAATGCAG TTGGGGGGAA TCAGTGGCTG GTAGACAACA ATATAAAAGC CACTTTGACT GACGCTACGA TTAGTGCCCG GACAGCAATT GGTTATAATG CCAATAAGGT AATTTATGCA ATTGTAGTGG ATGGATCACA AGCTACTTAT TCAAATGGTT TAACGCTTGT TGACTTAAGA GATATTATGG CTGCGCTTGG TACAAAAGAC GCAGTCAACC TTGACGGAGC TTCGTCCTCA ACTTTAGTGG CTAAGGATTT GACTAAAGGA ACGTGGAATG TTTTAAATAA GCCTGCATTG GCACTTAATG CAGAAAGGTT AATTGGAAAC GGGCTTGGCT TTATCCTTAA AAACTAA
|
Protein sequence | MKKLFNLKNV KVEVKVFTFK CTLLVWALTT VLYGCKKDKK EDGGTDVPEE VVDNVIEPLT KKIMDNTTVI GTFISDETGS VTAGINITRL AFLRKDKLPV RIFIMEVDMK TPKLEIQAMA PYNDYINGLQ RLSEMCRDNE LPGTNIVAAV NGDTFSTTGA PTSLFYINNR VYYGTVATGR TFFAAMKDGT IVIGGKDTKG VERPVDKAQI KNAVGGNQWL VDNNIKATLT DATISARTAI GYNANKVIYA IVVDGSQATY SNGLTLVDLR DIMAALGTKD AVNLDGASSS TLVAKDLTKG TWNVLNKPAL ALNAERLIGN GLGFILKN
|
| |