Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3492 |
Symbol | |
ID | 8254612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4156066 |
End bp | 4159029 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644937142 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003093745 |
Protein GI | 255533373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAACC TGAACCGAAG AAAATTTATA CAGATCAGCT CCCTGGTTGC AGCAAATATG GCTGTATTTA GCTTAAACAG TAAAGCCAGC ATTTTTGATG AAAAACTGAC TGAAGAAGAA CTGACTTTAA AGCTCCTGAA AGCTGGGTTT ACGAACCCGC CCGATAGTGC TAAAGCCTCT TGTTACTGGT GGTGGTTTAA CAGCCTGGTA GATGAGGAAG GGATTAGTTT GGACCTTGCT GAATTTAAAG CAAAAGGTAT GGGCGGCGTA TTGCTGATCA ACTCGATCAG CGGGTTTGGC AGTGGCCCGA TCCCTGAAGG CCCGAAATTT TTATCGGATG AGTGGCGTTC ACTTTTTAGG CATGCTTTAA AAGAGGCCGA CAGGCATGGC ATTGAGGTAG GGGTAAACCT GAGCACTGGT TGGGCAATGG GGGGCGCCTG GATTAAACCG GAAGATTCAG GGAGGTGGTT GCTGCAGGCC AAAACAAGGC TGAGCGGTCC GGCCCGTTTT TCCGGAAAAC TCCCTTTGCC CGGTAGCAGG GATGGTTATG ACAATGCCAA ACAGCTTTTT ATAAAAGATT ATATTGACCT GCCGATGGAA CAGCTGGATT ACAGGGATAC TTCGGTAGTG GCTTATAAGG AAATTCCGGG TGCACTGTCG TTAAAAGATG GCGACCGTGC CAGATCATTT GCAGCAAAAA GCAACCGGCT CGATGCAAGC AGCCATGCGG AAGCCGCAGA GGTAATGGAG CCGACTTTGT TGCCATGGAC GGCCTCGGCT GATGATAAGC CTGTTTCTTC TTCGGATGTG ATAGACCTGA CTTCGAAGGT AACGACAGAC GGGCAGCTGG AATGGGATGT GCCAGAAGGG AACTGGATCT TGATCAGGAC GGGGCATCGG ATGACCGGCG CGCGTACTGC CTATGCATTG CCTGAAGCGG CAGGGCTGGA GATCGACTGG CTGGATAAGA AAGGGGTAGA GCGGCAGTTT GAACATCTGG GCAATATCCT GCTGAAAGAA GCAGGGGAGT TTAAAGGCAA ATCGCTGAAG TATTTTCATG ACGATAGTTT TGAAGATGGC TTTCCGAACT GGACATCAGA ATTTTTAAAA CATTTTAAAG GGTACCGTGG CTATGATGCC ACGCCTTACC TGCCTGTTTT TGCAGGTGTG GTAATAGACA GTGCGGAAAT ATCTGAACGC TTTTTATACG ATTACCGGAA AACGGTAGCA GATTGTATGG CCGATGGGCA TTATAAACGT TTTGCAGAAC TATGTCATGA AAATGGCTTG CAGGTACAAA GCGAGGCTGG TGGGCCCAGC TGGTCAGGTA CGATGTGTAT GGATGCGCTT AAAAATCTTG GTCGCATGGA TTTGCCGATG GGCGAATTCT GGCAGGGTAA AACATTTGTA CAGCATGACC AGAACCAGGT AGGAAAGCTG GTGGCTTCGG CAGCTCATAT TTATGGCAAA AAAAAGGTTT CAGCTGAAGC GCTTACTTCA TTTAAGCCGC ATTGGAGTGA TTCGCCCGAA AGCCTGAAGC CGGTGGCCGA CCGTGCTTTC TGCGAAGGTA TAAACCGTTT TGTGATCCAT ACTTCAACCG CTACACGTCC ACGCGACGGC AAACCCGGAT ATGAATATGG TGCAGGCACA CATTTTAACA GGAACATCAC CTGGTGGGAA AAAGCAGGTT CTTTTCTGGA TTATGTGAAC CGCTGTCAGT ATTTGTTACA GTCGGGCCTT TTTGTGGCCG ATGTGCTTTA TTACAATGGC GACTGGGCAC CAAACCTGGT AGCACCCAAA CGTACCGACC CGGCTTTAGG GAAAGGCTAC GACTATGATG TATGCAATGA GGAGGTGCTG CTAACCAGAC TGAGTGTAAA AAACAAACGG ATCGTATTGC CCGATGGCAT GAGCTATGCG CTTATGGTAT TACCTGAAGT AAGTTTTATG CCCTTACCTG TAGCCAAAAA AATCAGGGAA CTGGTAAAGG CCGGGGCTAC CATTATTGGC CCTAAACCGG TAAAAGATCC GGGTTTAAAA AACTATCCGC AATGCGACCA GGAACTGGAT CAAATAGCTG AGGAGATTTG GGGTTCTGCT GCGGTGATTA CGGGCCAGAC TGTCCGTGCT GTTTTATTGA ACAAAGGCAT AGTTCCAGAT TTTGAATATA CAGGTGAAGC CAATTACATT GATTTTATTC ACCGTACTAC AAAAGACGCG GAGATATATT TCCTGGCCAA TCGTAAAGAA GCTGCTGCTA AAACTACCTG TACGTTCAGG GTGAGTACAG GTTACCACCC CCAGCTGTGG GATGCTGTAA GTGGGCGCAT CTTGCCCATG CCGGTATATA AAGCAGCTAA GGGCAGGATT GCAATCGAAT TTGATTTTCT ACCACACCAC TCCATATTTG TGATCTTTGT GAAATCGGCC AGGCCTGTTT TAAAGCCAGC TGATGAATGG CTGTTCCAGG GCAGATTGGG TTTAGCGCTC TTACAGGAAA TCAAGGGCAC TTGGGAAGTA AGTTTTGACC CCAAATGGGG CGGGCCTGAA AGGGTAACCT TTAATAGTTT ACAGGACTGG AGCAAAAGTG AGGATGAGCG GATCAGGTAT TATTCGGGAA AGGCCATATA CAGAAAACAG TTTGACCTGG ATATGCCATT GGCGAAAGGA AAACAATTAT TTCTGGATCT GGGGGTAGTT AAAAATATTG CCAGTGTAAA GCTGAACGGT AAAGACCCGG GAACCATATG GACTGCGCCC TGGATGGTGG ACATCAGCGG AGCACTTAAA ACTTCGGGTA ACCAGCTGGA AATAGAAATC ATCAACCTTT GGCCGAACCG TTTGATCGGG GATGCTGCAT TGCCCATTGA AAAACGGTTG ACAAACACCA ATATCATCTT TAAAAAAGAA GATAAACTGT TATCTTCAGG TTTGCTTGGC CCGGTCAGCA TTCAGGTCCG CTAG
|
Protein sequence | MPNLNRRKFI QISSLVAANM AVFSLNSKAS IFDEKLTEEE LTLKLLKAGF TNPPDSAKAS CYWWWFNSLV DEEGISLDLA EFKAKGMGGV LLINSISGFG SGPIPEGPKF LSDEWRSLFR HALKEADRHG IEVGVNLSTG WAMGGAWIKP EDSGRWLLQA KTRLSGPARF SGKLPLPGSR DGYDNAKQLF IKDYIDLPME QLDYRDTSVV AYKEIPGALS LKDGDRARSF AAKSNRLDAS SHAEAAEVME PTLLPWTASA DDKPVSSSDV IDLTSKVTTD GQLEWDVPEG NWILIRTGHR MTGARTAYAL PEAAGLEIDW LDKKGVERQF EHLGNILLKE AGEFKGKSLK YFHDDSFEDG FPNWTSEFLK HFKGYRGYDA TPYLPVFAGV VIDSAEISER FLYDYRKTVA DCMADGHYKR FAELCHENGL QVQSEAGGPS WSGTMCMDAL KNLGRMDLPM GEFWQGKTFV QHDQNQVGKL VASAAHIYGK KKVSAEALTS FKPHWSDSPE SLKPVADRAF CEGINRFVIH TSTATRPRDG KPGYEYGAGT HFNRNITWWE KAGSFLDYVN RCQYLLQSGL FVADVLYYNG DWAPNLVAPK RTDPALGKGY DYDVCNEEVL LTRLSVKNKR IVLPDGMSYA LMVLPEVSFM PLPVAKKIRE LVKAGATIIG PKPVKDPGLK NYPQCDQELD QIAEEIWGSA AVITGQTVRA VLLNKGIVPD FEYTGEANYI DFIHRTTKDA EIYFLANRKE AAAKTTCTFR VSTGYHPQLW DAVSGRILPM PVYKAAKGRI AIEFDFLPHH SIFVIFVKSA RPVLKPADEW LFQGRLGLAL LQEIKGTWEV SFDPKWGGPE RVTFNSLQDW SKSEDERIRY YSGKAIYRKQ FDLDMPLAKG KQLFLDLGVV KNIASVKLNG KDPGTIWTAP WMVDISGALK TSGNQLEIEI INLWPNRLIG DAALPIEKRL TNTNIIFKKE DKLLSSGLLG PVSIQVR
|
| |