Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0719 |
Symbol | |
ID | 8251807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 832221 |
End bp | 834584 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644934368 |
Product | hypothetical protein |
Protein accession | YP_003091003 |
Protein GI | 255530631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAG GAAAAAACGC GGTACGATTT TTTAATTTAC TTGTTTTATT TATAATCCTG TTTTCCAATA ACGCTGTTAA GGCACAAAAA CAACCCGTTT ATAGCATCTC CCAGGCCAAA CAGTTCAAGA CGCTGATGGA TACGGTAAGG AAAGATATGC CGGTAGAGAA ACTATACCTG CACCTGGACA AGTTCAATTA CCTTGCGGGC GATACACTTT GGCTTAAGGC CTATTTACTG GAAGGACCTT TTTTAATGCC ATCCAGCAAA AGTGGACTGT TTTATGTTGA ACTGGTAAAT GCTGAAAATA AAGTATTGAA ACGGATGAAA TTTCCGGCTA AATATGGGAT GGGTTGGGGT AACATCAGCC TGGATGAAAG GGACTTGCCA ACAGGCAATT ACCTGCTGCG CGCCTACACC AACTGGATGC TTAATTTTGG TGAGGCCGCT GCTTATACCG CAAACATTTA TATCAGTAAT GCAACGTTAC CGGGCAATGC TGCAGTTTTA AAAAAGGCTG TTAAAAACCT CAATACACCT CTTATTCCAA ACACCGGAAC TACTTTAGGC ATAAATAGTT CTCCGGAAAA AGCAGATATG GAACTGACCA TTACTGCTGA AGGGGAAGCT AAAAATTTAT CAGGTTATTA CCTTGTTGGC CAGTCCAGAG GTGTAGTTTG CTATGCCATT CCGGTTAACT TAAAAGACGG AAAGATTGTA AACGCTGTTC CAAAGAGCCT GTTCCCTACC GGTATTGCAA GGTTTACCCT TATTGATAAA AATCTGAAAG CACTGAATGA AAGGATGATC TATATTGATC AGCACGACCA GCTGAAGGTC AACGTCAATA CGGTGCAGGC CGTTTACAAA AGCAGGGACA GTATTGCGCT GCGGATAAAG GTTACAGATA AGAACAATGA GCCTGTACAG GGAAGTTTCT CCCTGTCGGT AACCGATGAC AGCCAGGTTG CTCAGGGCCC TACTAAAAAC GGGGACCTGA ATACCTATAT GCTGCTTTCC TCAGAACTGC GCAATCAGCC TAAAAGTGCA GATGCATACA GCAACGGTAC TGTTGCTGCC CAACAGGCAT TGGATAGTCT GGCACTTACC GCTGGTTGGT TGGGCTATAA CTGGAACGAG CTGTCAAACT ATAAAATACC TAAATATAGC GCCGAACCGG AATTTATGAT TACAGGAAAA GTAAGCTCAA CTTTTTCAGG ATTGGCGGGG GCAAAAGTTA GTTTATTTGT AAAAAAACCT CTACTGTTTA TGGATACCGT TGCCGGCCCA GATGGAAGGT TCATTTTCAA AAACCTGCCC ATTGCAGATA CGGCAGTTTA TAAAATACAG GCCACTAATA AAAAAGGTAA AAACTTCTTT GTGAACCTGG AAGTAGATGA ATGGAAACCG CCCGTGTTCA GCCCTTTACT TGTAAATTAT TCAACCGAGG CCCAGGTAAA AGACCCCGCT CTTGAGCAGA AGGTAGAAAA AGCCCTTGCG CTAAAACAAC AGCAGGATAA ATCTACCGGA AAGCTATTGA ATGAAGTGAA CATTGAAGCA AAAAAGATAG TCAAGGGTTC ACATAATTTG AATGGATCAG GCGAGGCCGA TCAGGTTTTA ACGGAAAGCG ACCTGTTAAA AGAGGGTAAA AGGCCATTAT ACGAGTTGCT AATGGCACGT ATACCCGGTT TGGTTATGGG CAGCTATTTA TATCCCCCAT CTAAAATAAG AAAGTTTGGC TTGAAGCTTA AAAATCAGCT GGTTAAAATT ATCATAGATG GTATAGACCT GGACCAGAGC TACGAGTACT GGCAGCTGAT GGCAGATGCC GATGATTCCG CAGAAGGTTT GCAGGAGCGA TATACGCACA TCAAAACGAA CCTTGACAAT TTTACAGCCG AAGATGTGAA AGGAATAGAG GTAATGTACA ATGCTTCTTA TAATGTAAAA TACAACAATA AGTTTTTAAG TACGGCCGAA ACCTCATTTA GCGGAAAAGG TGGGCTAACC GGAGTTGGTG GTGCTTCGGG CATAGATTAT ACCTATATAG AAATCACCAC CCGAAATGGC AGAGGGCCTT TCATGAAACA AACACCCGGA ACTTACCTGT ATAAACCGCT CGCTTTTTCA CTGCCAAAAA AATTCTATAG CCCAAAATAC CTTAACAAGG ATAGTAGTGT AACCGATATC CGTTCTACTA TTTATTGGGA ACCCAATATC ATTACCGATG AAAAAGGGGA AGCCACTGTA TCATTTTATG CCGCAGGGCA ACCGTCCAGT TATACCATTG TTGTGGAAGG GAGCGATATG AACGGACAGC TGGGCTCGGC ACGCATGCCT GCCCTTATAA AAATAGCGCC TTAA
|
Protein sequence | MKQGKNAVRF FNLLVLFIIL FSNNAVKAQK QPVYSISQAK QFKTLMDTVR KDMPVEKLYL HLDKFNYLAG DTLWLKAYLL EGPFLMPSSK SGLFYVELVN AENKVLKRMK FPAKYGMGWG NISLDERDLP TGNYLLRAYT NWMLNFGEAA AYTANIYISN ATLPGNAAVL KKAVKNLNTP LIPNTGTTLG INSSPEKADM ELTITAEGEA KNLSGYYLVG QSRGVVCYAI PVNLKDGKIV NAVPKSLFPT GIARFTLIDK NLKALNERMI YIDQHDQLKV NVNTVQAVYK SRDSIALRIK VTDKNNEPVQ GSFSLSVTDD SQVAQGPTKN GDLNTYMLLS SELRNQPKSA DAYSNGTVAA QQALDSLALT AGWLGYNWNE LSNYKIPKYS AEPEFMITGK VSSTFSGLAG AKVSLFVKKP LLFMDTVAGP DGRFIFKNLP IADTAVYKIQ ATNKKGKNFF VNLEVDEWKP PVFSPLLVNY STEAQVKDPA LEQKVEKALA LKQQQDKSTG KLLNEVNIEA KKIVKGSHNL NGSGEADQVL TESDLLKEGK RPLYELLMAR IPGLVMGSYL YPPSKIRKFG LKLKNQLVKI IIDGIDLDQS YEYWQLMADA DDSAEGLQER YTHIKTNLDN FTAEDVKGIE VMYNASYNVK YNNKFLSTAE TSFSGKGGLT GVGGASGIDY TYIEITTRNG RGPFMKQTPG TYLYKPLAFS LPKKFYSPKY LNKDSSVTDI RSTIYWEPNI ITDEKGEATV SFYAAGQPSS YTIVVEGSDM NGQLGSARMP ALIKIAP
|
| |