Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1697 |
Symbol | |
ID | 8252799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2006807 |
End bp | 2008207 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644935349 |
Product | hypothetical protein |
Protein accession | YP_003091970 |
Protein GI | 255531598 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.472344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0751505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATGT TTTCAGGTAA TCATTTTAAA ATCAAGTATA AACAGGCAGG TATATGTTTG CTTTTAGTAA CGGCTATGAG TTGTCAGCAA AAAACTTCAG GATCTGATGA ACTAAAGATC AGTTTACTTA CACTGGATCC GGGGCATTTT CATGCTGCCC TCATACAAAA ATCGATGTAT AAGGAGATTA GCCCTGTGGT ACATGTGTAT GCACCTGAAG GACCGGAGCT GGAAGCCCAT TTGAAGCTGA TTGAAAAATA CAACAGCAGG GCAGAAGATC CTACTAAATG GCAGGAAGTG GTTTATAAAG GAGAGGATTA CCTTCATAAG ATGCTGGAAG AAAAGAAAGG GAATGTGGTA ATTTTAGCAG GCAATAACCA AAGGAAAACA GAGTACATTC AAAAATCCAT AGATGCGGGT ATAAATGTAC TGGCTGATAA ACCGATGGTT ATTGATGGCA AAGGATTTGA ACAACTGGAA AAGTCGTTTG AATCTGCACA AAAAAATAAG GTAATATTAT ACGACATTAT GACTGAGCGC TATGAAATTA CCAATATGCT GCAAAAGGAA TTTTCATTAC AACAGGATGT ATTTGGAACA CTTGAAAAAG GGACTGCCGA AAAACCAGCT ATTACTAAAG AAAGTGTACA TCATTTCTTT AAAAATGTTT CGGGAGCCCC ATTGATAAGG CCACAATGGT ATTTTGATGT AGACCAGGAA GGAAACGGAT TGGTTGACGT AACTACACAT CTTGTAGATA TGATCCAATG GGAATGCTTT TCCGATCAGC AGATCGACTA TAAAAAGGAT GTGAATATGC TGTCTGCAAA ACGCTGGACC ACTCAGATTA CCCCTTCACA ATTTAAAAAA AGTACGGGTG CAGGCAGTTA TCCTGCTTTT CTAAAAAAAG ATGTAAAAGA TAGCTTGCTG AACGTTTATT CAAATGGAGA AATGAACTAT ACCCTTAAAG GAGTACATGC AAGGGTTTCC GTGATCTGGA ATTTTGAAGC ACCTGAAGGT ACAGGAGATA CACATTATTC GGTAATGCAT GGGACCAGGG CCAGTCTGAT TATAAAACAG GGACCTGAAC AGCAATATAA GCCTACTTTA TATATAGAGT CCCAAAAGGT TGATGACAAA GACTATGCTG CAGCTTTAAA GCAAAGTGTG GAAAAGATAG CCAAAACCTA TCCGGGATTA GAGCTGAAAG CTTATAAAGG AGGCTGGGAG GTAGTAATTC CGGAAAAATA CAAAGTGGGG CATGAGGCGC ATTTTGCTGA AGTAGCTAAA AAATATTTAG GCTTTTTAAA AGAAGGAAGA CTTCCTGAAT GGGAAAAGGC TGCTTTATTG AGTAAATACT ATACCACTAC AAAAGCCCTG GAATTTGCCG TAAAAAAATA A
|
Protein sequence | MSMFSGNHFK IKYKQAGICL LLVTAMSCQQ KTSGSDELKI SLLTLDPGHF HAALIQKSMY KEISPVVHVY APEGPELEAH LKLIEKYNSR AEDPTKWQEV VYKGEDYLHK MLEEKKGNVV ILAGNNQRKT EYIQKSIDAG INVLADKPMV IDGKGFEQLE KSFESAQKNK VILYDIMTER YEITNMLQKE FSLQQDVFGT LEKGTAEKPA ITKESVHHFF KNVSGAPLIR PQWYFDVDQE GNGLVDVTTH LVDMIQWECF SDQQIDYKKD VNMLSAKRWT TQITPSQFKK STGAGSYPAF LKKDVKDSLL NVYSNGEMNY TLKGVHARVS VIWNFEAPEG TGDTHYSVMH GTRASLIIKQ GPEQQYKPTL YIESQKVDDK DYAAALKQSV EKIAKTYPGL ELKAYKGGWE VVIPEKYKVG HEAHFAEVAK KYLGFLKEGR LPEWEKAALL SKYYTTTKAL EFAVKK
|
| |