Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4258 |
Symbol | |
ID | 8255394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 5134074 |
End bp | 5135939 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644937924 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003094511 |
Protein GI | 255534139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.105749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGCCA AGATTGGGGG CGATGAGGAC CATTGTGGGT GGAAATATAA ATTTAAGCTG ATGAGAACTA ATGTAATCAA GTTTTGTGCA GTATACGCTG GCCTGTTATT GCTTTCCTTT CGGGCCGACA TTAAGTCGAC TGAAACCAGG GCTGCCTGGC TGCGGATCAA TCTGCTGGGC TATCAGCCGC AAGCCATAAA AGTGGCTGTT TGGGTAAGTC AGGATGAAAC TGAGCTGCCG CAAAAGTTTG AAATTATAGA AAAGGAAAGC GGAAAAGTAG TCTATACCTC TGGAAACATT AAGCTTTTTG GTGCTTACGG ACCTTTTAAA AGATCGGCCA GGTTAAACTT TAGTGATTTT ACCCTTCCGG GACGCTACTT TATCAAAGCA GGAAGTGTGC AATCTCCTGA ACTGATCATT GACAAGCATG TTTATAACCA TACAGCCGAT TTTGCCCTGA GGTATATGCG CCAGCAGCGA AGCGGGTATA ACCCCTATCT GAAAGACAGC TGCCATACCC GGGATGGTTA TACGATGTAC GGGCCAATGC CCGACAGTAC ACATATAGAT GTTAGCGGAG GCTGGCATGA TGCCTCTGAT TACCTGCAAT ATGCCACCAC ATCGGCCAAT GCTACTTATC ATTTGCTGGC CGCTTATCGT GATTTTCCAG GTGTTTTTAC AGACCAGTAC CTTGCCAATG GCCTTGAAGG TAAAAATGGC CGGGCAGACA TACTGGATGA AGCCAGCTGG GGATTGCAGT GGTTACTGAA AATGCACCCC CGTAAGGATT GGATGTTTAA TCAGATTGCA GACGACAGGG ACCACCAGGG CTTGCGGCTG CCTACAAAAG ACAATGTAGA TTATGGGAAA GGAAGCGAGA GACCGGTTTA TTTTGCCAAT GGGAAACCGC AGGGCCTGGG TAAATATAAG AACAGGGCAA CCGGAACGGC TTCAGTTGCC GGTAAATTTA GCAGTGCCTT TGCCTTGGGA CAGCGACTAT TTAGGGGAAC AGACGCAGCA TATGCCAAAT TGCTGGGTGA AAAAGCCAGA TCTGCCTATG CTTTTGGCTT AAAACAACCC GGGGTGCAGC AAACAGCGCC AAACCGGGCC CCATATTTTT ATGAAGAGGA CAATTGGACG GATGATATGG AGCTGGCTGC GGCAGAACTG TACCAGACTT TTGGTGGGAA AAAATATTTC CGGCAGGCAG CGGTTTATGC CAGGAAGGAG CCGGTGATCC CATGGATGGG TGCCGACACT GCCAGGCATT ATCAATGGTA CCCTTTTCAT AATTTTGGAC ATTATGAAAT TGCGAAAGCA AAACAGCAGG GAATTTCGGA ACAGGCAACC GGTTATTACA GGGCGGGCCT GGAAAAGGTT TGGGACAAGG CCAGGAACAA TGCTTTTTAC AGAGGTGTTC CTTTTATCTG GTGTTCCAAC AATTTGACGG CTTCTTTTGC CATTCAGGGA TATCTGTATG GAAAGCTGAG CGGCGACCAG CAGTTTGAGG AGCTGGTGCA GGCCAATTTT GACTGGCTTT TTGGTTGCAA CCCATGGGGT ACCAGTATGG TATACGGCTT ACCTGCCTGG GGCGATACCC CCAAAGACCC CCATTCGGCA CTAACGCACC TGTATCAATA TCCTGTTGAC GGAGGTCTGG TTGATGGCCC CGTGTATGGG AGCATTTATA AAAACCTGAT TGGTATTAAA CTGGTACAGC CGGATGAATA TGCTGGATTT CAATCCGGGT TGGTGGTTTA CCATGATGAT TTCGGTGATT ACAGTACAAA TGAGCCCACA ATGGACGGTA CAGCTTCGCT GATCTATTTA CTGGCTGCTT TTGATAGCAG GACGAGCGGG AAATAA
|
Protein sequence | MCAKIGGDED HCGWKYKFKL MRTNVIKFCA VYAGLLLLSF RADIKSTETR AAWLRINLLG YQPQAIKVAV WVSQDETELP QKFEIIEKES GKVVYTSGNI KLFGAYGPFK RSARLNFSDF TLPGRYFIKA GSVQSPELII DKHVYNHTAD FALRYMRQQR SGYNPYLKDS CHTRDGYTMY GPMPDSTHID VSGGWHDASD YLQYATTSAN ATYHLLAAYR DFPGVFTDQY LANGLEGKNG RADILDEASW GLQWLLKMHP RKDWMFNQIA DDRDHQGLRL PTKDNVDYGK GSERPVYFAN GKPQGLGKYK NRATGTASVA GKFSSAFALG QRLFRGTDAA YAKLLGEKAR SAYAFGLKQP GVQQTAPNRA PYFYEEDNWT DDMELAAAEL YQTFGGKKYF RQAAVYARKE PVIPWMGADT ARHYQWYPFH NFGHYEIAKA KQQGISEQAT GYYRAGLEKV WDKARNNAFY RGVPFIWCSN NLTASFAIQG YLYGKLSGDQ QFEELVQANF DWLFGCNPWG TSMVYGLPAW GDTPKDPHSA LTHLYQYPVD GGLVDGPVYG SIYKNLIGIK LVQPDEYAGF QSGLVVYHDD FGDYSTNEPT MDGTASLIYL LAAFDSRTSG K
|
| |