Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2717 |
Symbol | |
ID | 8253825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3199322 |
End bp | 3202459 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936365 |
Product | Beta-galactosidase |
Protein accession | YP_003092980 |
Protein GI | 255532608 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.945973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC TCAAAACCTA TGTTGCATTT TTATTCTTGT GGCTATTTGG CTTTGTCCAG CTTGCTGTTG CCCAGAAAGC ACCCTGGCTT GATGAAAAAA ACAGTGAAGA AAACAGGCTG CCTATGCATG CCGCTTACTT TGTTTATGAA AACGAAGCAG TGGCCAAATT GGGTGACTGG AAAAAATCAA AAAATTACAT TAATTTAAAT GGCGCCTGGA AATTTAAGTT TGTTGATTGT CCTGCCGCCT TACCGGAAAA TTATTATGCC CTTAATTTTA AAGACCAGGA TTGGGATGTG TTTAAAATTC CGGCCACCTG GGAAGTAAAT GGCTATGGAT ATCCAATTTT TGTAAACTAT CCGAATGAAT TTCGTGACCG GATGAAACCC AATCCTCCAC TGGTACCAAT GGATTTTAAC CCCACGGGCG TTTACCGCAG GCAAATTGAA ATCGGTAAAG ACTTTGCCGG AAAGCAAGTT ATCTTACACA TTGGCGCAGC AAAATCGAAC ATCCAGATTT GGGTGAACGG AAAATATTCC GGCTATGGCG AAGATGGAAA ACTACCTTCA GAATTTGACA TCACCAAATT GGTAAAACAG GGCCAAAATT TAATTGTGCT TAAAGTAATG CGCTGGAGCG ACGGTACTTA CTTAGAAGAT CAGGACATGT GGCGGGTAAG TGGTATTGTC CGCGATTGTT ACCTGCTGGC CAGGAACACC ATCCATTTAG CAGACATTGA AATTATGCCG GATTTAGATG CAGCCTATCA GAACGGATTG CTGCATGTTA AAGTTTCACT CAGTACACCG GCAAAGGTTA CTGCACTTTT CGAATTGCGT GATGGCGAAA AAATAGTGGC CAAAAAAAAT ATTGCCTTTG ATGGTAAGCG CAACAGGGCA ATTGATGTGA GCGTAAACGA TCCCATATTA TGGAATGCAG AAAATCCATA TCTATACCAG GCAACCTTCA AATTATTGGA CCGGTCAGGC AAAATTACCG AAGTTATTTC ACAGAAGGTA GGCTTTAGAA AGGTAGAAAT GAAGAATGGG CTGTTATTGG TAAATGGAAA GCCAATTCTG ATCAAAGGGG TAAACCGGCA CGAAATAGAC CCGGTTTCCG GGCAAACGAT CTCGAAAGAA ATTATGTTGC AGGACATCAG GCTGATGAAA AAATTTAATA TCAATGCAGT AAGAACAAGC CATTACCCGA ATGACCCGTA TTGGTACGAA CTCTGTGATG AATATGGCAT TTACATGGTG GCAGAAGCCA ATATAGAATC ACATGGCGTG GGCCCGCTGG CATACCATGA ATTCAACCTT ACAAAAGGGC TGGGTAATGT TCCATCCTGG CGCGATGCCC ATATGCTTCG CTTAAAAAGA GCAGTTGAGC GTGATAAGAA CCATCCTTCA ATTGTCATAT GGAGCCTGGG CAATGAAGCA GGGGCAGGCT ATAATTTTTA TGAAACCAGA CAGTGGCTGA AACAACGGGA TACCACCCGG CTGGTGCAAT ACGAAGGCGC CATTATCGAT TATACCAGGT ACATTACCGA TTGGAATACC GACATCATTA ACCCCATGTA CCCAGAGCCA GACAACATGC TGGCCTATGC AAAAAGTACA CCTCATCCTG CAAAACCTTT CATTATGTGC GAATATGCCC ATTCTATGGG AAATTCGTTG GGTAATTTTA AAGATTATTG GGATCTGATC AGAGGCAACC CTCATGCTTT TCAGGGTGGT TTCATTTGGG ATTTTGTAGA CCAGGGCCTG CTTAAAATCA CGGCACAAGG TGATACGATC TATACCTATG GAGGAGATTA CGGGCCGCCA ACAGCGCCAA GCGATAACAA TTCAATGAGC GATGGCGTGT TCCAATCCAA CAGAAAACCC GATCCTGAGG CCTGGGAAAT GAAAAAAGTT TATCAGGACA TTCACAGTAC CTGGATGGGG AACAACAAAG TTGAAATTTA TAACGAACGC TTTTTTACTG ATTTAGCTGA TGTAACCTTA AAATGGGAAT TAATGGCAGA TGGGAAAATC GTCCAAAACG GCGAAGTAGC ATCACTTCAT GTGCTGCCGC AAAAAAAAGA GACCATTGCC CTGCCACTGC AGATGCAAAC AGGGGAAGTT TTTTTAAACC TGACTTATCT AACAAAGCGG GCTAAAAACC TTGTTCCGGC TGCCCATATC CTGGCCTGGG AACAATTGCC GGTTTCAGGC GGCCAGCTGC AGGCAGTGCA GGTACGTGGA ACCGAAAAGC TAAACTACAC AAAAGAAGCA GACGCCCTAT CAGTTTCTTC TGCGAATGCC GCACTCAGGT TTAATAAAAA AACAGGCCTG CTGAGCCAGT ATGCCGTAAA TGGAGTAAAT TACCTGGCAA CAGCAACAAC CCTCGAACCT GATTTCTGGC GTGCACCAAC CGATAACGAC ATGGGCGCCA ATTTGCAGAA AACGCTTAAA GATTGGAAAA TTGCCATGAA AAATATGCAG TTAACGGCTT TTGATGTCAA CCAAAACAAC AACATAGTTA CCGTAAAAGC CAGCTACAAC CTGGCCGAGG TTATGGCTAA ACTGAACATC AGCTATCAAA TTAATGCCAC CGGAGAAATA CTGGTAAAGC AGGACTTAAC AGCCGATACC ACACAAAAAA CAGGCCCGAT GCTGTTTAAA TTTGGTATGA AAATGATTCT TCCCCCTGGT TTTGAAAATT TAGATTATTA TGGAAGAGGA CCGTTTGAAA ACTATCAGGA TCGTTATACG GCTGCATTGA TTGGTATTTA TCACCAATCC GTAAAGGCAC AATTTAATGC CTACACCAGG CCTCAGGAAA CCGGAACCAA AACCGATGTC AGGTGGCTTG AACTTAAAAA TGAACAAGGA AAAGGGATCA GGGTTGAGGC AGCAGTGCCC TTAACTACAA GCGCTTTACA TTTTTATACT GAAGATCTTG ATGATGGTGA GGAAAAGCAC CAGCGCCATT CAGGAGAACT TAAACCAAGA AAAGAGACAC AACTGAATAT CGATTTTAAA CAAATGGGTG TTGGAAGTGT AAACAGCTGG GGCGAATTGC CGCTAAAACA ATATTTGTTG CCTTACCAGA ACTATAGCTA CCAGTATAAG ATAATTCCTT TAAATTAG
|
Protein sequence | MKNLKTYVAF LFLWLFGFVQ LAVAQKAPWL DEKNSEENRL PMHAAYFVYE NEAVAKLGDW KKSKNYINLN GAWKFKFVDC PAALPENYYA LNFKDQDWDV FKIPATWEVN GYGYPIFVNY PNEFRDRMKP NPPLVPMDFN PTGVYRRQIE IGKDFAGKQV ILHIGAAKSN IQIWVNGKYS GYGEDGKLPS EFDITKLVKQ GQNLIVLKVM RWSDGTYLED QDMWRVSGIV RDCYLLARNT IHLADIEIMP DLDAAYQNGL LHVKVSLSTP AKVTALFELR DGEKIVAKKN IAFDGKRNRA IDVSVNDPIL WNAENPYLYQ ATFKLLDRSG KITEVISQKV GFRKVEMKNG LLLVNGKPIL IKGVNRHEID PVSGQTISKE IMLQDIRLMK KFNINAVRTS HYPNDPYWYE LCDEYGIYMV AEANIESHGV GPLAYHEFNL TKGLGNVPSW RDAHMLRLKR AVERDKNHPS IVIWSLGNEA GAGYNFYETR QWLKQRDTTR LVQYEGAIID YTRYITDWNT DIINPMYPEP DNMLAYAKST PHPAKPFIMC EYAHSMGNSL GNFKDYWDLI RGNPHAFQGG FIWDFVDQGL LKITAQGDTI YTYGGDYGPP TAPSDNNSMS DGVFQSNRKP DPEAWEMKKV YQDIHSTWMG NNKVEIYNER FFTDLADVTL KWELMADGKI VQNGEVASLH VLPQKKETIA LPLQMQTGEV FLNLTYLTKR AKNLVPAAHI LAWEQLPVSG GQLQAVQVRG TEKLNYTKEA DALSVSSANA ALRFNKKTGL LSQYAVNGVN YLATATTLEP DFWRAPTDND MGANLQKTLK DWKIAMKNMQ LTAFDVNQNN NIVTVKASYN LAEVMAKLNI SYQINATGEI LVKQDLTADT TQKTGPMLFK FGMKMILPPG FENLDYYGRG PFENYQDRYT AALIGIYHQS VKAQFNAYTR PQETGTKTDV RWLELKNEQG KGIRVEAAVP LTTSALHFYT EDLDDGEEKH QRHSGELKPR KETQLNIDFK QMGVGSVNSW GELPLKQYLL PYQNYSYQYK IIPLN
|
| |