Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2749 |
Symbol | |
ID | 8253857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3247888 |
End bp | 3251307 |
Gene Length | 3420 bp |
Protein Length | 1139 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644936397 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003093012 |
Protein GI | 255532640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0806564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGA ACTTAAGTAA AACCATAGCA TTCGTTTTAT TGACTTGCAG CCTGCAGCCT GGCGCAGCTA TGGCGCAGGC AAATGAGTCT GCTAAGCCCT GGGTATTATG GCACTGGATT AAAGGCGGGG TTTCAAAACC AGGTATTACC GCCGATCTGG AAGCCATGAA GTCAGCGGGC ATCGGAGGTG CATATCTTTT ATCGATTAAA GATGTACCCA ATCCCCCTTT ATTTAACCCA TCTGTACGGC AGCTTACCCC ACAATGGTGG GACATGGTAA CCTTTGCCAT GCAGGAGGCC AGGCGTTTAA ACCTGAAACT GGGGATGCAT GTAAGTGATG GCTTTGCACT TGCAGGAGGG CCCTGGATTA CACCTGAACT TTCTATGCAG AAAGTGGTAT CTACCCAATT GAATATCAAG GGTGAAACAG CTGAAAAAAT AAAACTGCAG CAACCGGAAA CCCTGGAAGG TTATTATAAA GACATTGCGG TATACGCCTA TCCCTCAGCT GAAGGTGCCG GTATTTCCAC ACAGACTGTG ATTCCCGAAA TTACGACCAG TAATGCTGCA GATGCAAGCG GACTGATCAG ACCGGGTAAT ACGAAAAATT TTGGAAGTAA TGAAGCCTGC TGGATTCAGT ATGCTTTTAA ACAGCCTTTT ACCACCCGCA CCATCCGCAT AAGTACTGGC AGCAATAACT ACCAGGCACA ACGGCTGGAA ATACAGGTTA GTGATGATGG TGAAAATTTC CGTTCGGTGG GGCGCCTGGA GCCGCCAAGG CATGGCTGGC AGGACACTGA TGCCGATGTG ACACATAGTA TTGTACCCGT TACCGCAAGA TATTACCGCT TCGTATACGA TAAAAAAGGA TCGGAGCCCG GTTCAGAAGA TCTGGACGCC GCCAAATGGA AGCCTTCTTT AAAACTGAGG AACCTGGAGC TTTCTGCGGA AGCCAGGATC AACCAGTTTG AGGGAAAAGC TGGCTTGGTA TGGCGCATCA GCAAAGCCAG CACAAAAGAG ACGCTGGAAG ACCATTTATG TGTACCGATA GATAAGATCA TCAACCTGAG CGATAAAATT AAAGCAGACG GCAGCTTAGA CTGGAAGGCG CCTAAAGGAA ACTGGACCAT ATTGAGGATT GGCCATACTT CCACCGGTCA TAAGAATGCC ACTGCCGGAG CAGGTATGGG GCTGGAATGT GATAAATTTA ATCCGGCTGC AGTAAAATTA CAATTTGACA GCTGGTACGG GGCAGCCTTA AAACATGGAG GACCTGAAAT CGCCTCAAAG GTTTTGAATG AATTGTTTGT TGACAGCTGG GAGTGTGGCA GCCAGAACTG GTCGCCACTT TTTGCAGCGG AATTTAAGAA ACGCAGGGGC TACGACCTGA TGCGTTATCT GCCGGTTATG GTAGGCATAC CGCTTGGAAG TGTTGAGCTG TCTGAAGCCT TTTTACATGA TGTGCGAAAA ACCATTGCTG AACTGGTTGC AGACCAGTTT TACTACACCT TATCCAGCCT GACCAAAGAG AAAGGTGTCA CTTTTGCCGC TGAGAATGTT GCACCGACCA TGCTGAGCGA TGGCCTGCTG CATTATAAAA ATGTAGACAT GCCGATGGGT GAGTTCTGGT TGAACAGTCC GACACACGAT AAACTGAACG ACATGCTGGA TGCGGTTTCC GGCGCGCACA TTTATGGCAA AAATATCATA CAGGCAGAAG CTTTCACCAC CATTAGGATG GATTGGAATG AACATCCCGG CAATATGAAG ACCTTACAGG ATAGAAATTA TGCGCTGGGC ATCAATAAAC TGGTATATCA TGTTTTTGCC CATAATCCAT GGCTGGACCA AAAGCCAGGG ATGACACTGG ATGGGATAGG TTTATATTTT CAGCGTGACC AGACCTGGTG GAAACTTGGA AAGGCCTGGG TAGATTATGC CACACGTTGT CAGGCACTTT TACAACAGGG TAAGCCTGTA GCCGATGTCG CGGTATTTAT TGGTGAAGAA TTGCCCAGCC GCTCGGTATT ACCTGATCGG TTGGTTAACC TGCTGCCTGG CCTTTTTGGT GCTGAAGTGG TGGCAGCTGA GGCAAAAAGA CTGGCCAATA CCGGCGAGCC CTTAAGACAA AAGCCTGCAG GGGTAACACA TTCGGCAAAC ATGGCCGATC CGGAAGACTG GGTAAATCCA TTAAGAGGCT ATGCCTACGA TTCTTTTAAT CCTGATGTAT TGCTTGACGC AAGTGTAAAA GACGGGAGAG TGGTGTTTGC CAGCGGTGCA AGTTATGCGG TACTGGTTAT CCCGGGTAAA ATGCTCCTGA ACCCAAATTA TCAGTATATG AGTAAGGAAG TGGCTGCAAA GCTGAACGAG CTGGCGAAAG CAGGTGCAAG CCTTGTGCTC GGTGAACGTC CGAAATTCCA GCTGGGTATG CCTAAAACAG GCAGTGATCA GGACTTTGAG ACTTTACTGA CCGAGCTTTG GGATGGTAAT TTTAAAACAG CAGGACAGGG GCAACAGCAG CTTTCCCTGA AAACACTGGG TAAGGGAAGG ATCATTAAGG GTGCTTATCA GGCAGAAAGT TTTGACCTTA TTGGCCTGGA AAGGGATCTG CTGGTAATGG AAGGAAATGG AGATTATGCG AAAAAGGTAG CTTATACCCA TAGGATTGCA GCCGATAAAG AGTTTTATTT TGTTGCTAAC CAGGAAAATA AAAAGCGCTT GCTGGAGTTT TCTTTTAGAA CAGCCGGTAA AATACCTGAA CTTTATGATG CTGTAACTAA TGAAACTATT GCCCTGAAAG CCTGGAAAAC TGAGGCTGGT CGTACAAGAA TGCAGCTAAA GCTGGCCCCA AATGCCTCCG CCTTTGTCAT CTTTAAAGAT AGAGCGGCTA CTACTGCCTC TGCTACAGGC AAAAACTGGA AAGAACAGCA ATTGGTAAAC ACGATAAATG GGAGCTGGAA AGTACGTTTT GACCCTGCTT ATGGCGGTCC GGAAAAGCCG GTAACTTTTG CTGCGCTGAG CGATTGGAGT AAACATCCGG ACAGCCTGAT CAGATATTAC TCAGGTTCAG CGGAGTATAG TAATAATTTT AAAGTTAAAA AACAGGCCGG GCAGCAATAC TTGCTGGATT TGGGAACGGT AGGCTGTATC GCAGAGGTTC GGGTAAACGG AATTTCTTGT GGTGTGGTCT GGACTGCGCC TTACCAGGCC GACATTACAG CTGCTCTAAA AAATGGGGAC AATGTGCTGG ACATAACGGT AACCAATACC TGGGCCAACA GGATCATCGG CGATCAGCGT TTGCCGGAAG ACAGGAGAAT CACAAAAACA AATGCGCCAT ACCGTTTGGA AGGCAAGCCA CTGAATGAGG CTGGCTTGCT TGGTCCGGTA AGGCTATTTA AACAGGAACA ATCAAATTAA
|
Protein sequence | MNLNLSKTIA FVLLTCSLQP GAAMAQANES AKPWVLWHWI KGGVSKPGIT ADLEAMKSAG IGGAYLLSIK DVPNPPLFNP SVRQLTPQWW DMVTFAMQEA RRLNLKLGMH VSDGFALAGG PWITPELSMQ KVVSTQLNIK GETAEKIKLQ QPETLEGYYK DIAVYAYPSA EGAGISTQTV IPEITTSNAA DASGLIRPGN TKNFGSNEAC WIQYAFKQPF TTRTIRISTG SNNYQAQRLE IQVSDDGENF RSVGRLEPPR HGWQDTDADV THSIVPVTAR YYRFVYDKKG SEPGSEDLDA AKWKPSLKLR NLELSAEARI NQFEGKAGLV WRISKASTKE TLEDHLCVPI DKIINLSDKI KADGSLDWKA PKGNWTILRI GHTSTGHKNA TAGAGMGLEC DKFNPAAVKL QFDSWYGAAL KHGGPEIASK VLNELFVDSW ECGSQNWSPL FAAEFKKRRG YDLMRYLPVM VGIPLGSVEL SEAFLHDVRK TIAELVADQF YYTLSSLTKE KGVTFAAENV APTMLSDGLL HYKNVDMPMG EFWLNSPTHD KLNDMLDAVS GAHIYGKNII QAEAFTTIRM DWNEHPGNMK TLQDRNYALG INKLVYHVFA HNPWLDQKPG MTLDGIGLYF QRDQTWWKLG KAWVDYATRC QALLQQGKPV ADVAVFIGEE LPSRSVLPDR LVNLLPGLFG AEVVAAEAKR LANTGEPLRQ KPAGVTHSAN MADPEDWVNP LRGYAYDSFN PDVLLDASVK DGRVVFASGA SYAVLVIPGK MLLNPNYQYM SKEVAAKLNE LAKAGASLVL GERPKFQLGM PKTGSDQDFE TLLTELWDGN FKTAGQGQQQ LSLKTLGKGR IIKGAYQAES FDLIGLERDL LVMEGNGDYA KKVAYTHRIA ADKEFYFVAN QENKKRLLEF SFRTAGKIPE LYDAVTNETI ALKAWKTEAG RTRMQLKLAP NASAFVIFKD RAATTASATG KNWKEQQLVN TINGSWKVRF DPAYGGPEKP VTFAALSDWS KHPDSLIRYY SGSAEYSNNF KVKKQAGQQY LLDLGTVGCI AEVRVNGISC GVVWTAPYQA DITAALKNGD NVLDITVTNT WANRIIGDQR LPEDRRITKT NAPYRLEGKP LNEAGLLGPV RLFKQEQSN
|
| |