Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0567 |
Symbol | |
ID | 8251654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 679040 |
End bp | 680401 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644934215 |
Product | NHL repeat containing protein |
Protein accession | YP_003090851 |
Protein GI | 255530479 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00166551 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT ACCGAAAGAG TATGGATAGG TGTTTTGCCG CCTTCCTATG GGTATGCTTC AGTTCCTGTT GTTTTGTTTC CTGTAAAAGC GAAAAGGAAG GCCCTAGTAA AACCTATGAT TCTTCAAAGC CCGTTGTACT CTCCTCTTTT TCTCCGAATG AGGGAGGAGC GCGCGATAAA ATACTTTTAG ACGGGGAAAA CTTTGGAAAT GACCCAAGTA AAATCAAAGT CTATTTTAAC CATGCAAAAG CTTCAGTTAT CTCGTCAAGT GGTAACCGGA TCTATGCCAT TGTACCCCGG CTTCCAGGTG ATAATCCTAA GATTTCTGTG GTTGTGGGTA CTGACTCGGT TGTTTATAAA GACACTTTTA CTTACCATAT TCAGGCTCAG GTAAGCACAG TTACGGGTAA CGGGCAAAAA AACTTTAAAC CCGGAACGCT CTCTGAAGCT GAAGTTTATG GTAAATACCT GGAACTGGAT GCCGAGGGAA ATATATTTAT GTCCTGGCGG GATGGTGGCA ATCCGGCTAC ATTTGGTGTA GCCAGGATAA ATGAAAAGCA AAATATAGTT ACTCCACTGA TCGAGTCGGT CGCTGCCGGT CGGATCTTAT ATGCCAACGG CTTAACTGTG GATCGTGTTA CTGGTATGCT AACAGCCGCT CATGAGTCTA CAAAGGAGGT TATTTTTACA TTCGACCCCA GGGAGGCCTG GTACCCGCGT CAGCGTAACA TCAAATACTC TACGGCAGAT TATAATTCCA TTGTTACTGC GGATCTGTAC AAAAACTTCG TGACTTATTG TCCGTATGAT GGGTATCTTT ACACGAGATA CAGAGATGGA AAGGTTGCCA AAATAAATCC TCAGACATTC GAAGCAAAAA TTGTTCACCA GGGACCTTAT GGATCGCAGT ATGGTCAGGC CATTAATCCA GTAAAACCAT GGCTCCTGTA CATCACGCTC ACTACCAATG CCACACCGAC CAATTTCAGA CAAGGTATTA TGGTACTGGA TCTTCGTGAC CCTAATGGAT CAGGCGGCTT TAAACGCCTC AATGCCCCTG GAGGTAGTGC CTTCCGCGAT GGGCCGTTGG CAGACGCACT GTTCAACGAT CCTAAAGAGA TTAAGTTTGA CAACAGCGGA AACATGTTTG TTGCAGACTA TGGCAACCAT TGTATCCGTA TGGTATCGGC TGATAATATC GTAACAACGG TAGCAGGCCA ACCGGGCAAA TCAGGCTATA AAGATGGCGG ACCGGTAGAA TCTCTATTTA ACCAACCCTG GGGAGTGGCT GTCAATGAGC AAGGTGACAT TTATATTGCA GACTGGAGTA ACGCCAGGAT ACGCAAATTA GTTATTGAAT AA
|
Protein sequence | MKNYRKSMDR CFAAFLWVCF SSCCFVSCKS EKEGPSKTYD SSKPVVLSSF SPNEGGARDK ILLDGENFGN DPSKIKVYFN HAKASVISSS GNRIYAIVPR LPGDNPKISV VVGTDSVVYK DTFTYHIQAQ VSTVTGNGQK NFKPGTLSEA EVYGKYLELD AEGNIFMSWR DGGNPATFGV ARINEKQNIV TPLIESVAAG RILYANGLTV DRVTGMLTAA HESTKEVIFT FDPREAWYPR QRNIKYSTAD YNSIVTADLY KNFVTYCPYD GYLYTRYRDG KVAKINPQTF EAKIVHQGPY GSQYGQAINP VKPWLLYITL TTNATPTNFR QGIMVLDLRD PNGSGGFKRL NAPGGSAFRD GPLADALFND PKEIKFDNSG NMFVADYGNH CIRMVSADNI VTTVAGQPGK SGYKDGGPVE SLFNQPWGVA VNEQGDIYIA DWSNARIRKL VIE
|
| |