Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3638 |
Symbol | |
ID | 8254769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4351675 |
End bp | 4353975 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937299 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003093891 |
Protein GI | 255533519 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.456784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG CTAACTTATT TATCCTTACG CTCATTTTTA CTGCCCATAT AACTTTTGCC CAAACCAAAA AAACGGGCAA AGCTACCCAT CAGCAAATGA ATACCTTCAT CAGTAACCTG ATGTCAAAAA TGACATTAGA CGAAAAGATC GGGCAGCTAA ATCTGCTTAC CGGTGGCGAA GCTACTACGG GCTCGGTAGT AAGCACCGAT GTAGAAAGTA AAATCAAGAA AGGACAGGTA GGAGGTATTT TTAGCTTAAC CACCCCTGCA CGCATCCGTA AGGCCCAGGA AATAGCCGTT AATCAGACCC GGCTCAAAAT ACCTATCATA TTTGGTCAGG ATGTGATCCA TGGATACAAA ACTACTTTCC CTATTCCGCT GGCCTTATCC AGTACCTGGA ACATGGAAAT GATCAAAAAA ACAGCCCGGA TTGCCGCCAT AGAAGCTACA GCAGATGGGC TGAACTGGAC CTTCTCGCCC ATGGTCGATA TTTCCAGGGA CCCCCGCTGG GGCCGTATTT CTGAAGGTTC AGGAGAAGAC ACCTATCTGG GCTCAGAAAT TGCCAGGGCC ATGGTAAAAG GCTACCAGGG TGATGACCTT GCTAAATACA ATACCATGAT GGCCTGTGTA AAACATTTTG CTTTATACGG GGCGTCAGAA GCGGGCAGAG ACTACAATAC CACGGATATG AGTCTGGATC GCATGTATAA TGAATACTTG CCGCCCTATA AAGCTGCGCT TGATGCGGGC GCAGGCAGTA TCATGACCTC CTTTAACGAC ATCAATGGAG TGCCTGCCAC TGCAAACAAA TGGCTCATGA CTGATCTTTT GCGTAAAGAA TGGGGGTTTA AAGGTCTTGT GGTTACCGAC TATACTGCGG TTAACGAGCT GATAGACCAT GGCCTGGGCG ATCTGAAAGC CGTATCGGCC CTGTCCATTA ACGCAGGTGT AGACATGGAC ATGGTTGGCG AAGGCTTTTT AACCACCCTG AAAAAATCTG TACAGGAAGG CAAAGTAAAA GCACAACGCA TTGATGAAGC CTGCAGATTG GTTTTAGAGG CCAAATATAA GCTGGGCTTA TTTGACGATC CTTTCCGTTA TTGCAATGAA GAAAGGGCTA AAACGGAAAT TTTAAAACCT GAACACCTTG CTTTTGCCCG AGAGGTAGCT GCCGAATCTT TTGTGCTTCT GAAAAATGAA AACCAGACCT TGCCCCTTAA AAAAACGGGT ACCATAGCTT TAATAGGCCC CCTGGCCAAT ACAGGGGCCA ATATGCCTGG TACCTGGAGT GTAAACAGCG ACCTGGCCAA TACCGCCTCT CTTTTAACAG GCATGAAGGC CGTTTTGGGC AAACAGGTAA AAGTGGTACA CTGCCTGGGT TCAAACCTGG TAACAGATGA GGCCTACCAG CAGCGTGCCA CCATGTTTGG CCGCGACATT CCAAGAGACA ACCGCCCTGA AGCAGAAGTG ATAAAAGAAG CTGTGGAACT GGCCAAAACT GCAGATGTAG TTATTGCTGC ATTAGGTGAA AGTTCTGAAA TGAGCGGCGA AGCCTCCAGC CGTACCAACC TGGAAATCCC GGAAGTTCAG CAGCGTTTGT TACAAGCCCT GTTAAAAACA GGTAAACCTG TTGTACTGGT CTTGTTTACC GGCAGGCCAC TGGTACTGAA CTGGGAGCAG CAAAACGTAC CGGCCATTTT AAATGTCTGG TTTGGTGGTA CAGAAACGGC AAAAGCAATA ACTGATGTAC TGTTTGGCGA TGTGAACCCT TCGGGAAAAT TAACAGCCAC CTTCCCGCAA AATGTGGGGC AGATTCCCTT ATATTATGCG CATAAAAATA CCGGCAGGCC ATTGGCAGAT GGAAAATGGT TCAGTAAGTT CCGCTCCAAC TATCTGGACG TAAGCAACGA ACCCCTATAC CCTTTCGGTT ACGGCCTAAG CTATACCAGT TTTGCCTACA GCAATTTAAG GTTAAGTAAA AACAGCTTTA AACCTGGTGA ATCTATTACC GCCAGCATCG ATATTAAAAA CATCGGTTCA AGGGAGGGCA AGGAAGTGGT ACAATTGTAC ATACGCGACC TGGTGGGCAG TTCTACCCGT CCGGTTAAAG AACTAAAGGC TTTCCAAAAG ATCAGCCTAA AACCCGGAGA AAGCAAAACT GTGAGCTTTA AGCTGACCGA AAATGATTTG AAGTTCTATA ATACGGCTTT AAGGTTTGTT GCCGAACCCG GTGATTTTAA CTTGTTTATT GGCGGAAATT CCAGAGATGT GCTGGAAACG AAGTTTAGTT TAAGAAATTA A
|
Protein sequence | MKIANLFILT LIFTAHITFA QTKKTGKATH QQMNTFISNL MSKMTLDEKI GQLNLLTGGE ATTGSVVSTD VESKIKKGQV GGIFSLTTPA RIRKAQEIAV NQTRLKIPII FGQDVIHGYK TTFPIPLALS STWNMEMIKK TARIAAIEAT ADGLNWTFSP MVDISRDPRW GRISEGSGED TYLGSEIARA MVKGYQGDDL AKYNTMMACV KHFALYGASE AGRDYNTTDM SLDRMYNEYL PPYKAALDAG AGSIMTSFND INGVPATANK WLMTDLLRKE WGFKGLVVTD YTAVNELIDH GLGDLKAVSA LSINAGVDMD MVGEGFLTTL KKSVQEGKVK AQRIDEACRL VLEAKYKLGL FDDPFRYCNE ERAKTEILKP EHLAFAREVA AESFVLLKNE NQTLPLKKTG TIALIGPLAN TGANMPGTWS VNSDLANTAS LLTGMKAVLG KQVKVVHCLG SNLVTDEAYQ QRATMFGRDI PRDNRPEAEV IKEAVELAKT ADVVIAALGE SSEMSGEASS RTNLEIPEVQ QRLLQALLKT GKPVVLVLFT GRPLVLNWEQ QNVPAILNVW FGGTETAKAI TDVLFGDVNP SGKLTATFPQ NVGQIPLYYA HKNTGRPLAD GKWFSKFRSN YLDVSNEPLY PFGYGLSYTS FAYSNLRLSK NSFKPGESIT ASIDIKNIGS REGKEVVQLY IRDLVGSSTR PVKELKAFQK ISLKPGESKT VSFKLTENDL KFYNTALRFV AEPGDFNLFI GGNSRDVLET KFSLRN
|
| |