Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0331 |
Symbol | |
ID | 8251416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 384334 |
End bp | 387351 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644933979 |
Product | Heparinase II/III family protein |
Protein accession | YP_003090617 |
Protein GI | 255530245 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAA AATTTAGCCT GATCCATGTA TTCATGGTCC TGATGTTATT GGGACATACT AATGCCCATG CCGGCCGGAT CTATGGTGGC TTCTATACCG CTGAAAAAAT AGCCAATGTA AGAAGCAATT GTAACAGATA TGATTGGGCT GCAAAACAAA GGGACCGGTT TATTGCCCAG GCAAAATACT GGCTGGCCAA AGATGACGAA ACGCTATGGG CCATGGTTCC CGGACAGGAC CTGCCGCGAT GCATAGATGT TACCTTTGAC AGGCTGACCG AGGGCCCTAA ATTTTTAGGT TGTCTGAAAT GCGGACACAA CATTTCAAAA TATGGTAATT ATCCATACAA TCCTGAATTC GAGCAAAAGC CCTGGAAATT GACCTGCCCT TCCTGCGGCA CTGTTTTTCC TACCAACGAT TTTGGTAAAT ATTATAAAAG TGCCATTGAT GAACATGGTC TTTTTAATCC GGCAAGTGGC GACAAGAGCC TGCTTTACAA CCTGGAACAT CCTGATCCAA AAGATCCGCT GCATAAATAT GGTGTAGACG ATGGATTTGG TTATGTAAAT GAAAATGGGC GGTCGTACAG GTTCATCGGA TATTATACCT GGAAGTACTG GGACCATATT AATGCAGGAC TCAACGCCCT GGCAAATGCA TTTTTGTATA CCGGAGATCA GCGCTACGCA CATAAGGCGG CAATTTTGCT GGACCGCATA GCGGATGTTT ATCCCGACAT GGATTGGGCG CCTTATTCAA AAAAAGGCTG GTACCATTCG GATGGAGGGA CAAACAGAGG TAAAATAGAG GGAGCCATCT GGGAAACCGG TACAGTTCAG GGCTTTGCAG ATGCTTATGA TAAAATTTTA AGTGGTACGG TTAATGATGC TTCACTTTAT TCATTTTTAA AAAAGCAATC GTTGAAATAT AAACTGCCCG GGGCCAAAGG CACACGGGAC CTGTTTGTAA AAAATGTTGA TGACGGAATC TTGCGTACAG CTTTTAAAGG CGTGCTTTCA AAACAGATCT GGGGCAACCA GGGTATGCAC CAGTTAACTG TTGCCAGATG TGCAGTAGCT TTAAATACGG CCCCTGAAAC TACGGAATGG CTCGATTGGT TGTTTACCCA TGATGGAGGA AACATTCCGG GCCTGATGAT CCGTAACCTG GACAGAGATG GTACTACCGA TGAAGGTGCT CCGGGGTATA CTTATATGTG GGGAAACCTG ATCGCCAAAC TCGGGGTGCT CCTGGCTGAC TACAAGAGTT ATACCAAGCA CGATATTTTT GCCGAATATC CTCAGTTCAA CGCTACATTT ATGGCGGCCT ACCGCATGTC TGTATTAGGG ATTTCGATAC CTAATATCGG CGACGCAGGC GCTACAGGTA CGGTTACCAA CAGTTATATC GACCCCAATT TTATTGCCCT GGGGTATTTT TATACCAGAG ATCCGCAAAT GGCAATTGCA GCCTACCGGG CAAATGGTAA TTCTGTAAAG GGGCTGGGAA TGGACATCTA TTCCAAAGAT CCTGAATCAC TGAGCCGGGA GATCAAAAGC GTTGCAGAAA AGAACCCCGC AAATGACGGA AGAGGGGGAT TGATGAGCGG ATTTGGCCTG GCCTCACTGG AAATTGGTAA AGGCACATCG GGCATTGCAC TGGCCAGCAA TTTTGGACGG AGCATTAAAC ATGCACATCC GGATATGCTC AATTTTGACC TGCTGGCTTA TGGCAACTGG CTTGCGCCTG ATCATGGATA CCCCGAATAT GCAACAAAAT GGCCAAGTAA CAACGAATGG ACAGGCAGTA CGCTTTCGCA CAACCTGGTA TTTGTAAACG GGCTTCCGCA AAAGGAAGTA TGGGGCGGAC ATACCCGGAT GTTTAAACAG CTGAAAGGAT TTGGGGCTTT TGAACTTGAT GGAAAAAAGG CCTATCCGGA TGTAAAGGAA TACAGTCGGA CAATGTTGTT GATCGAAGGA CCCGATACCG GCAATGCCTA TGCAATAGAT ATATTCCGGG TATTGGGTGG TCATGATCAT CTGTACAGTT TTCATGGCCC GCCAGGAACG ATCAGTACCG AAGGATTAAA GTTAAAGCCC CAGCAGGGAG GAACTTATGC CGGAACTGAA GTAGCCAAAG GTACGCTTGC CAAAGGTTTT CCAATTGGTT ATTCACATTT ATACAATGTT AAAAGAGACA CCATTCCTCC CCCACAATTT ATGCTCGACT GGAAAGTGGA GGATGGCTAC AGAAATGTTA AAGCAACAGA CCATCTGCAT TTGCGTATGT ACGCGTTGAA CCAGGTGGAT GATGTGGCCT TGGCCGATGG TGATCCACCA CAGAATAAAA CGGGTAATCC TAAAACGCTC GGTTATGTTT TAATGCACCG CGCAGCACCA GCGCTAAACA GTAATTTCGT TAACCTGATT GAACCTTATA AGCAAAACCC CTTTATCAAA TCCGTAAAAC GTTTGGATGA GGGGAAAAAC ATGCAGGTAT CGCTAAAGAT AGAACATGTA AACGGAGAGA TAGATTATAT CCTTTATAAT CCGGATTCAA CACAAACTAT GCAGACTGCA GATGGATTGA AAATGGATGG GACCCTTGGC TATGTGAGAC AAAAGGGGGG TAAGCCTGTT GAAGGGATCC TGCTGAATGG CAAACGGCTC AGCTATGCAA ACATGAACCT GATAGCTGCA GGACCGATCA GGGGAAAAGT GGTGAAAATG AACAGGGAAT TAAAGGGGGG AGGATGGCTG CTGGTTGATC AGCAGCTACC TGTTGATGGA AGTTTAAACG GATCTCAGCT CATGGTCAGC ACAGAAGGGA AACGCGATGC CTGCTATTCA ATTGTCGGCA TTGAGCGTCA GGGCAATTTA ACCAGGGTTT ACTGCGGCCC CATTACTTTT GTTAATGATT ATAAAGGTGA AAATTATAAA GAAGGATTGA TGTATGACTT TGAAGAAGGC GCTGCATTTA CCATTACTTC TCATAAAATA TGGAAACAAA AAATTTGA
|
Protein sequence | MKLKFSLIHV FMVLMLLGHT NAHAGRIYGG FYTAEKIANV RSNCNRYDWA AKQRDRFIAQ AKYWLAKDDE TLWAMVPGQD LPRCIDVTFD RLTEGPKFLG CLKCGHNISK YGNYPYNPEF EQKPWKLTCP SCGTVFPTND FGKYYKSAID EHGLFNPASG DKSLLYNLEH PDPKDPLHKY GVDDGFGYVN ENGRSYRFIG YYTWKYWDHI NAGLNALANA FLYTGDQRYA HKAAILLDRI ADVYPDMDWA PYSKKGWYHS DGGTNRGKIE GAIWETGTVQ GFADAYDKIL SGTVNDASLY SFLKKQSLKY KLPGAKGTRD LFVKNVDDGI LRTAFKGVLS KQIWGNQGMH QLTVARCAVA LNTAPETTEW LDWLFTHDGG NIPGLMIRNL DRDGTTDEGA PGYTYMWGNL IAKLGVLLAD YKSYTKHDIF AEYPQFNATF MAAYRMSVLG ISIPNIGDAG ATGTVTNSYI DPNFIALGYF YTRDPQMAIA AYRANGNSVK GLGMDIYSKD PESLSREIKS VAEKNPANDG RGGLMSGFGL ASLEIGKGTS GIALASNFGR SIKHAHPDML NFDLLAYGNW LAPDHGYPEY ATKWPSNNEW TGSTLSHNLV FVNGLPQKEV WGGHTRMFKQ LKGFGAFELD GKKAYPDVKE YSRTMLLIEG PDTGNAYAID IFRVLGGHDH LYSFHGPPGT ISTEGLKLKP QQGGTYAGTE VAKGTLAKGF PIGYSHLYNV KRDTIPPPQF MLDWKVEDGY RNVKATDHLH LRMYALNQVD DVALADGDPP QNKTGNPKTL GYVLMHRAAP ALNSNFVNLI EPYKQNPFIK SVKRLDEGKN MQVSLKIEHV NGEIDYILYN PDSTQTMQTA DGLKMDGTLG YVRQKGGKPV EGILLNGKRL SYANMNLIAA GPIRGKVVKM NRELKGGGWL LVDQQLPVDG SLNGSQLMVS TEGKRDACYS IVGIERQGNL TRVYCGPITF VNDYKGENYK EGLMYDFEEG AAFTITSHKI WKQKI
|
| |