Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0942 |
Symbol | |
ID | 8252036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 1096353 |
End bp | 1099235 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644934597 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003091226 |
Protein GI | 255530854 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000970619 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAATTA TAGTAAAAAC TATTATCCTT TTATTTTTTG CACTTCCGGT TTGCGGGCAG AACCCGAAAA GTGTATTGCC AAAAGAAACG AATGTTATAG TTGATGCAGA ATTTCAGCGA CCCGGTAGTA CGTATGGTAC GCGTTGTTGG TGGTGGTGGT TAAACGGCAA TGTAACGCAG CAGTCTATTA CCCGCGACCT GGAGGAGATG AAGGCAAAGG GCTTTAGCGG TGCCTGTATT TTTGATGCTG GCGGAGCCAA CCAGGTTGGG AACAGACAGG TGCCCGAAGG GCCTATGTTT GGCAGCCCGG ATTGGCGGGC ACTTTATCTG TATGCCATTC GCGAAGCCAA GCGCCTGGGC CTGGTGATGT CTATGAACAT CCAGAGTGGC TGGAATCTTG GCGGCCCTGA TGTTAGCCCG GAAGAGGCAG CTAAACAGGT TACATTTTCG GAACTGGGCA TTAAAGGGGG AACAAAGATT AACCAAAAAT TAGTGCTGCC AGCAATAAGG GACGATTACT ATAAAGAAAT TGCGGTGCTC GCCTTTCCAG ATAAAAACAA AGCACATGCA CCTATACATG ATTTGGAGAA CAAAACTGCT TCCAAAGAAG CCGGAGGCTC TGTTCCCGAT ACCCGGCCAT TTTTAACAGA CATTGCCGGG GTGCCTGGTG AGGAAGATGC CCTGAGCAAA CAAGTACTCA ATATCAGTAA GTATTTTAAG GATGGGGTAT TGAATTGGGA CGCGCCTGCT GGCAACTGGA AGGTGATCCG CATAGGGTAT ACCACTACAG ATGCCCGGAC TTCAACTACC AGCGGGAAAT GGGATGGCAG GGTACTTGAT TTTTTGAGTG AAAAGTCATT TAACCGGTAC TGGGACACGC ATGTGGAGCC GCTGCTGCGC CTGATTGGCC CTATGGCAGG TACAACCTTG CGTTATCTGC AAACGGACAG TTTTGAAGGT GGGGGGATGA ACTGGACGGA TGGCTTTGCC GATGAATTCA AGAAAAGGAG AGGCTATGAC CTGACCCTCT TTTTACCGGT ACTGACTGGT AAAATAATTG AAAGCCGTTC TGTAAGTGTG CGTTTTCTGA ACGATTTCCG TAAAACAATA GGAGACCTGG TTTCAGAAAA ACATTACGGT ACTTTCGCAA AAAGATCCAG AGCCCATGGA ATAGGTATTT TACCAGAGTC TGCAGGACCA CACGCAGGTC CTTTTGACGG GCTTAAGAAC TACAGCCACA GTGAGGTGAT GATGAGTGAA TTCTGGTCGC CCAGCCCTCA CCGTCCGCGG CCGATAGACC GTTTTTTTGT TAAACAAGCT GCCAGTGCAG CTAAGATTTT CAATAAGCAG CTGGTTGGTG CCGAATCCTT TACCACGATC GGGAAGCATT GGAACGATGT GATCTGGGCA GACATGAAGC CGAGTGTAGA CCATGAGTTT TGTGCCGGCC TGAACCTGGT ATATTTTCAT ACCTTCACTT CTTCTCCTAA AGAAATGGGA ATGCCCGGCC AGGAGTATTT TGCCGGTACG CACTTTAACC CTAATGTGAC CTGGTGGAAT TATTCGACAG CATTTCTGAG TTACCTGACC CGCTGCCATT ACCTGCTGCA AAAGGGAACA GCACTTTCGG ATGTACTGTA TTATTATGGA GACCATGTGC CCAATCTGGG CAGGCTGAAA GAAGATGACC CTGCGGGGGC ATTGCCGGGA TATGATTACG ACCTGATCAA TGAAGACCGC CTGCTCGATC TGACGGTTAG GGACGGCAAA ATCGAGTTGC CCCATGGTGT AAATTACCGG GTGCTTGTTT TACCAGACCA TAAGATCTTG TCATTGGCAG TGATCAGAAA AGTAAAAGCA CTGGTAAATG CCGGGGCAAC GGTGATTGGT TTAAAGCCCC AATCTACCAG TAGCCTGGTG GGTTATCCGG CTGCTGAAAC GGAACTCAAT CTGCTGGCAG ATGAACTATG GGGAAAGGGC AATACCTCAG CTGGCGAAAA AGTACTGGGA AAAGGGAAAG TAATCTGGGG TAAAACTGCG GCCAATGTAT TGCTGGAAAG TGGCCTTCCT TATGATGTTA AGATTGTGGC TGAAAACAAA ACAGATAAGT TTGATTATAT CCACCGTTAT TTGCCTGATG GAACAGACAT TTATTTTATT TCCAATCAAA ACAATAAGCA GGTAGCCGTT TCCTGTACTT TCCGGATTGC TGACAGAATG CCCGAATTAT GGGATCCCTT AAAGGGCGAA ATCAGGGATG CAAAGGCTTA TCAACAAAAG GACGGCCTGA TTACGGTACC CTTGGTTTTT GACCCGAATG GTTCAGTGTT TGTGGTTTTC AGAAAGCCTC TTGGCAGTAA AGAAGGAAAG AAAACGGACA ATTATCCCGG ATACAAGGAG CTGGAGCAGC TTTCTGGTGC CTGGGATGTG CGTTTTGATC CCAGATGGGG TGGTCCTGCT TTAGTAAAGT TTGATACTTT GCAAAGCTGG ACAGAACGGC CGGAGGATGG GATCCGTTTT TATTCGGGGA CAGCAGTATA TGCCAAAAAT TTTAAGGCAA CGAAAACAAA ATCAAAAAGG CTTTTTCTGG ACCTTGGAGA AGTTAAGGAT GTAGGTATTG CTAAAGTAAA ATTAAATGGG AAAGATCTGG GTATTTTGTG GAGCCCGCCG TTCCGGGTGG AGATTACAGC TGCGTTGAAA ACCGGTGATA ATAAACTTGA GGTTGAAGTT GTGAATAGCT GGCGGAACCG CCTGATCGGG GATGACAGTT TACCAGCCGA TCAAAGGTTG ACGAAAACGA ATATTAAAGT TACTCCTGCA TGGAAAATTT TACCTTCAGG ATTGTTGGGG CCGGTTGTTT TGATGGAATC TAGTGAGAAA TAG
|
Protein sequence | MRIIVKTIIL LFFALPVCGQ NPKSVLPKET NVIVDAEFQR PGSTYGTRCW WWWLNGNVTQ QSITRDLEEM KAKGFSGACI FDAGGANQVG NRQVPEGPMF GSPDWRALYL YAIREAKRLG LVMSMNIQSG WNLGGPDVSP EEAAKQVTFS ELGIKGGTKI NQKLVLPAIR DDYYKEIAVL AFPDKNKAHA PIHDLENKTA SKEAGGSVPD TRPFLTDIAG VPGEEDALSK QVLNISKYFK DGVLNWDAPA GNWKVIRIGY TTTDARTSTT SGKWDGRVLD FLSEKSFNRY WDTHVEPLLR LIGPMAGTTL RYLQTDSFEG GGMNWTDGFA DEFKKRRGYD LTLFLPVLTG KIIESRSVSV RFLNDFRKTI GDLVSEKHYG TFAKRSRAHG IGILPESAGP HAGPFDGLKN YSHSEVMMSE FWSPSPHRPR PIDRFFVKQA ASAAKIFNKQ LVGAESFTTI GKHWNDVIWA DMKPSVDHEF CAGLNLVYFH TFTSSPKEMG MPGQEYFAGT HFNPNVTWWN YSTAFLSYLT RCHYLLQKGT ALSDVLYYYG DHVPNLGRLK EDDPAGALPG YDYDLINEDR LLDLTVRDGK IELPHGVNYR VLVLPDHKIL SLAVIRKVKA LVNAGATVIG LKPQSTSSLV GYPAAETELN LLADELWGKG NTSAGEKVLG KGKVIWGKTA ANVLLESGLP YDVKIVAENK TDKFDYIHRY LPDGTDIYFI SNQNNKQVAV SCTFRIADRM PELWDPLKGE IRDAKAYQQK DGLITVPLVF DPNGSVFVVF RKPLGSKEGK KTDNYPGYKE LEQLSGAWDV RFDPRWGGPA LVKFDTLQSW TERPEDGIRF YSGTAVYAKN FKATKTKSKR LFLDLGEVKD VGIAKVKLNG KDLGILWSPP FRVEITAALK TGDNKLEVEV VNSWRNRLIG DDSLPADQRL TKTNIKVTPA WKILPSGLLG PVVLMESSEK
|
| |