Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0935 |
Symbol | |
ID | 8252029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1088455 |
End bp | 1090125 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644934590 |
Product | endonuclease |
Protein accession | YP_003091219 |
Protein GI | 255530847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.16421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000880846 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATACATT TTAACCAACT GAAAGCAAAT ATAAGAAGTC TGTTTGAGAC GTTTAAAAAA GCAATGAACT ATGATAGTGC TATGTCGCAG GCAATTGATG CGGTATGCTC TCCTTTCTCC TTTGAAGATG TAATTCTTCA GGGAAAGGCA TTTGCGTCAT TAAATGAGTT AAAAGACTAT ATCCGTGAAG AGCTTGAAGA AGAGGTAAAC CTGATAGTAG ATGATGGAAC ACATTTTTCA TTAACAGATA ATGAAGGACA TATAGACTGG TACCGATCCA AAAAGGCTGA TGATGAAATC AGATTTCGTT TCTGGAACAG ATACCGTAAA TATCTCACAC ACATCAAGGG ATGGGCTGAA TCTTCGGTCG ATAAAATCGA TACTATATCT GACGAAATTT TGGAGAACAT TGAGGATCCC ACTATTCCTA ACCGGGCGTT TGACCGCCGG GGGCTTGTTG TCGGATACGT GCAATCCGGT AAAACAGCAA ACTTTATGGG CATTGTCAAT AAAGCGATAG ATTCTGGTTA CAGAATCATT ATAATACTTG CCGGTACGCA GGAAAGTTTA CGACAGCAAA CCCAGGAAAG AATTGACGAA GAAGTGTTAG GTATTGACAC CAATCCTGAA GAGAAACAGA AAAGAATAGG GGTGTCCACG CTTCCGGGAG AAGCTTATAT TCCGATTGAT TATTTCACGG AATCCAACCT GAAACCCAAT AAATCAGGTG ATTTTAATAT CAGGAAATCA AGAGGAACGC CGCCAAGCAG TGAACGGCCG ATTTTGTTTG TAGTGAAAAA GAACAAATCC ATACTTACCA ATCTCAGGAA ATACCTGGAG CACTGGATTA ATATTTTTGA CGATGATCTT ACGTATAAAA ACGATACGGT TAATCAGTTC AATAATCTAC CCTTACTGAT CATTGATGAT GAGTCTGATC AGGCCTCAAT CAATACCAAG AGAACCGTCA GTCCCGATGG GGACGAAGTT GACCCCACTG CCATCAATTA CTGTATAAGA GAAATTCTGA ACCTTTTCCG TCAAAAAGTT TATATAGGTT TTACTGCCAC ACCGTTTGCC AATATATTCA TCAGACATGA TATGGATCAC AGGGTACTCG GAAAAGATCT TTTTCCTTCA GCTTTTATTA AAACCCTGGG TGCCCCATCC AACTATTTTG GTCCAAAGGA AGTTTTTGGC TTAAATAACG ATGCTGATTC CGGATTGCCA ATCTATAGAA GTGTTGTTGA TGCGGGAGGA TTGCATACTG TTTTACCAAT TGGCCATAAA GCAGACTATG TATTGACGGA GCTACCACAA ACATTAAAGC TGGCATTGAA ATCCTTTCTG ATTTCATCTG CTGTACGCTG GTCAAGAGGA CATGATAAAA AACACAATAC AATGCTGGTC CATTGTACCA GGTATAATTT TGTACAATCT GCACTGGCAG AACTGATCAA CGACGAAATG AGTTTGATCC GGACAGCTAT TCTGGCTGAT GACGTTGAGG TTTTATCAGA AATGCAGCAA TTATATATCA CCGACTTTAT CCCCACTTCC GGAGAAATGG ATAAGAACAC GCCTGAATGG ATAGACATAG TCCCGTTCAT TAAAAAGACC GTGAAAAAAC TGGAACGGGG AATGCCGCAT TATAAATGGT ACTGTAGGTG A
|
Protein sequence | MIHFNQLKAN IRSLFETFKK AMNYDSAMSQ AIDAVCSPFS FEDVILQGKA FASLNELKDY IREELEEEVN LIVDDGTHFS LTDNEGHIDW YRSKKADDEI RFRFWNRYRK YLTHIKGWAE SSVDKIDTIS DEILENIEDP TIPNRAFDRR GLVVGYVQSG KTANFMGIVN KAIDSGYRII IILAGTQESL RQQTQERIDE EVLGIDTNPE EKQKRIGVST LPGEAYIPID YFTESNLKPN KSGDFNIRKS RGTPPSSERP ILFVVKKNKS ILTNLRKYLE HWINIFDDDL TYKNDTVNQF NNLPLLIIDD ESDQASINTK RTVSPDGDEV DPTAINYCIR EILNLFRQKV YIGFTATPFA NIFIRHDMDH RVLGKDLFPS AFIKTLGAPS NYFGPKEVFG LNNDADSGLP IYRSVVDAGG LHTVLPIGHK ADYVLTELPQ TLKLALKSFL ISSAVRWSRG HDKKHNTMLV HCTRYNFVQS ALAELINDEM SLIRTAILAD DVEVLSEMQQ LYITDFIPTS GEMDKNTPEW IDIVPFIKKT VKKLERGMPH YKWYCR
|
| |