Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3797 |
Symbol | |
ID | 8254931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4555892 |
End bp | 4557904 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937461 |
Product | Heparinase II/III family protein |
Protein accession | YP_003094050 |
Protein GI | 255533678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAACCA TTAATCTAAA TCTTAAAGAT TGCATGACTA CGAAAATTTT TAAAAGGATC ATTGTATTTG CTGTAATTGC CCTATCGTCG GGAAATATAC TTGCACAAAG CTCTTCCATT ACCAGGAAAG ATTTTGACCA CATCAACCTT GAGTATTCCG GACTGGAAAA GGTTAATAAA GCAGTTGCTG CCGGCAACTA TGACGATGCG GCCAAAGCAT TACTGGCATA CTACAGGGAA AAAAGTAAGG CCAGGGAACC TGATTTCAGT AATGCAGAAA AGCCTGCCGA TATACGCCAG CCCATAGATA AGGTTACGCG TGAAATGGCC GACAAGGCTT TGGTCCACCA GTTTCAACCG CACAAAGGCT ACGGCTATTT TGATTATGGT AAAGACATCA ACTGGCAGAT GTGGCCGGTA AAAGACAATG AAGTACGCTG GCAGTTGCAC CGTGTAAAAT GGTGGCAGGC TATGGCCCTG GTTTATCACG CTACGGGCGA TGAAAAATAT GCAAGAGAAT GGGTATATCA GTACAGCGAT TGGGCCAGAA AAAACCCATT GGGCCTGTCG CAGGATAATG ATAAATTTGT GTGGCGGCCC CTTGAAGTGT CGGACAGGGT ACAAAGTCTT CCCCCAACCT TCAGCTTATT TGTAAACTCG CCAGCCTTTA CCCCAGCCTT TTTAATGGAA TTTTTAAACA GTTACCACCA ACAGGCCGAT TATTTATCTA CGCATTATGC CGAACAGGGA AACCACCGTT TATTTGAAGC CCAACGCAAC TTGTTTGCAG GGGTATCTTT CCCTGAATTT AAAGATTCAC CAAGATGGAG GCAAACCGGC ATATCGGTGC TGAACACCGA GATCAAAAAA CAGGTTTATG CCGATGGGAT GCAGTTTGAA CTTTCACCAA TTTACCATGT AGCTGCCATC GATATCTTCT TAAAGGCCTA TGGTTCTGCA AAACGAGTTA ACCTTGAAAA AGAATTTCCG CAATCTTATG TACAAACTGT AGAAAATATG ATTATGGCGC TGATCAGTAT TTCACTGCCA GATTATAACA CCCCTATGTT TGGAGATTCA TGGATTACAG ATAAAAATTT CAGGATGGCA CAGTTTGCCA GCTGGGCCCG GGTTTTCCCG GCAAACCAGG CCATAAAATA TTTTGCTACA GATGGCAAAC AAGGTAAGGC GCCTAACTTT TTATCCAAAG CATTGAGCAA TGCAGGCTTT TATACGTTTA GAAGCGGATG GGATAAAAAT GCAACCGTTA TGGTATTAAA AGCCAGTCCT CCCGGAGAAT TTCATGCCCA GCCGGATAAC GGGACTTTTG AACTTTTTAT AAAGGGCAGA AACTTTACCC CAGACGCCGG GGTATTTGTG TATAGCGGCG ACGAAGCCAT CATGAAACTG CGGAACTGGT ACCGTCAAAC CCGCATACAC AGCACGCTTA CACTCGACAA TCAAAATATG GTCATTACCA AAGCCCGGCA AAACAAATGG GAAACAGGAA ATAACCTTGA TGTGCTTACC TATACCAACC CAAGCTATCC GAATCTGGAC CATCAGCGCA GTGTACTTTT CATCAACAAA AAATACTTTC TGGTCATCGA TAGGGCAATA GGCGAAGCTA CCGGAAACCT GGGCGTACAC TGGCAGCTTA AAGAAGACAG CAACCCTGTT TTCGATAAGA CAAAGAACCG GGTTTACACC ACTTACAGAG ATGGTAACAA CCTGATGATC CAATCGTTGA ATGCGGACAG GACCAGCCTC AATGAAGAAG AAGGAAAGGT ATCTTATGTT TACAATAAGG AGCTGAAAAG ACCTGCTTTC GTATTTGAAA AGCCTAAAAA GAATGCCGGC ACACAAAATT TTGTCAGTAT AGTTTATCCA TACGACGGCC AGAAGGCTCC AGAGATCAGC ATACGGGAAA ACAAGGGCAA TGATTTTGAG AAAGGCAAGC TTAATCTAAC CCTTACCATT AACGGAAAAC AACAGCTTGT GTTGGTTCCT TAG
|
Protein sequence | MLTINLNLKD CMTTKIFKRI IVFAVIALSS GNILAQSSSI TRKDFDHINL EYSGLEKVNK AVAAGNYDDA AKALLAYYRE KSKAREPDFS NAEKPADIRQ PIDKVTREMA DKALVHQFQP HKGYGYFDYG KDINWQMWPV KDNEVRWQLH RVKWWQAMAL VYHATGDEKY AREWVYQYSD WARKNPLGLS QDNDKFVWRP LEVSDRVQSL PPTFSLFVNS PAFTPAFLME FLNSYHQQAD YLSTHYAEQG NHRLFEAQRN LFAGVSFPEF KDSPRWRQTG ISVLNTEIKK QVYADGMQFE LSPIYHVAAI DIFLKAYGSA KRVNLEKEFP QSYVQTVENM IMALISISLP DYNTPMFGDS WITDKNFRMA QFASWARVFP ANQAIKYFAT DGKQGKAPNF LSKALSNAGF YTFRSGWDKN ATVMVLKASP PGEFHAQPDN GTFELFIKGR NFTPDAGVFV YSGDEAIMKL RNWYRQTRIH STLTLDNQNM VITKARQNKW ETGNNLDVLT YTNPSYPNLD HQRSVLFINK KYFLVIDRAI GEATGNLGVH WQLKEDSNPV FDKTKNRVYT TYRDGNNLMI QSLNADRTSL NEEEGKVSYV YNKELKRPAF VFEKPKKNAG TQNFVSIVYP YDGQKAPEIS IRENKGNDFE KGKLNLTLTI NGKQQLVLVP
|
| |