Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2927 |
Symbol | |
ID | 8254038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3489968 |
End bp | 3492973 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644936575 |
Product | hypothetical protein |
Protein accession | YP_003093187 |
Protein GI | 255532815 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000253968 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCTCCT ACCAACGCCT GAATAACCTT ACCGGATTTA TACTGTTTGG CATAGCCGCC GTAGTTTACT GGCTTACCAT GGAGCCTACA CTGAGTTTCT GGGACTGCGG TGAATTTATT GCCGCTTCCA GCAAATTAGA AGTGGGCCAT CAGCCCGGTG CGCCTTTGTT CCTGATGATT GGCAAACTCT TTTCTTTGCT GGCAATGGGC AATACCACTA AAATTGCTTA TTGGATCAAC TTCAGCTCAG TACTGTTTAG CGCCGCAACC ATTATGTTCC TGTACTGGAC CATTACTGCA CTTGCCACCA AACTTTACCC TGAAAAGAAA AGCAATACCC AGATCCTGAG CATCATAGCG GCGGGTGCCA TTGGTGCGCT GGCCTATACC TTTTCAGATA CCTTTTGGTT CTCGGCCGTA GAGGCCGAAG TGTATGCCCT GTCTACCCTG TTTACGGCCA TAGTATTCTG GGCCATACTA AAATGGGAAA ATGAGCCCGA CAACCGCTGG CTGGTGTTTA TCGCCTTCAT GGTGGGCCTG TCGATAGGTG TGCACCTGTT GAGTCTGCTT GCCATTCCGG CCATAGTACT GGTACACTAT TTTAAAACAA CGGCAAAGCC TGGCCTGAAG GGAACGTTTA AGGCATTGTT ATTTGGCTGC TTGCTGGTAG GCCTGGTACA GTTTGCCATT GTACAATATC TGGTGCTGAG TGCCGCACAG GCCGATCTTT TCTTTGTCAA CACCCTGGGC CTTGGTTTTG GTACCGGCGC CATGAGTTTT ATACTTCTGA TAGTTCTATT GATTGCCTAT GGCATTTACT ATTCTGTTAA ACACAAAAAA TATCAGCTTA ACCTGGCATT GGTATGCCTT GCCTTTGTGC TTTTTGGCTT CAGTTCCTAT TTCATGATCG TGATCAGGGC AAATGCCAAA CCGAACATCA ACCTGTCCAA TCCCGACAAT CCTTTTTCCC TGTATGGCTA CCTTGGGCGC ACCAATTACG GAGATACGCC ATTGCTGTAT GGCCGTACTT TTGACGCCGG GCAAACCGGC ATTAAAGAAA CGGGTACCGA ATATAGAAAG GGAGCTGATA AATATGAAGA ATCGGGTAAA ACATATAAGG CCGAATACGA TAAAAACCTG ATCTTCCCGC GCACCTATAG CCAGAAACCC AATCACATCG CCTTTTACCG GCAATGGCTG GGGCTTGGGG AAAATGAAAG CCCGAACCTG GCGCAAAACC TGAGCTTCTT TAGCACCTAC CAGGTAGGTT TCATGTACAT GCGCTATTTT TTATGGAACT TTGTAGGACG GCAGAACGAT ATCCATAGCC AGGGTAATTT TACCGACGGC AACTGGATCA CCGGGATCAA AAGTATGGAT GCCCTGCGCC TGGGCAATCA GGCCAAACTC CCGCCCTCCA TTACAAAAAA TGAAGGCAAC AATGTATATT ATGGCCTGCC CCTGCTGCTG GGACTGGCAG GAATGATCTA TGTATACCGT AAAAACAAAC AGGCTACACT CATTATAGCC ACCCTGTTCT TTTGCACCGG ACTGGCCATT ATCTGTTACC TGAACCAGGA CCCGATGCAG GTTCGGGAAC GCGATTATGC TTATGTAGGT TCATTTTATG CCTTTGCTAT TTTTATTGGT TTTGGTGTGC TGGCCATTCA GGAACTGTTT CAGCGTTTTG CAGCAGCTAA ACTCAGCCTG GCCATTGCTG TGCTGACGGG CCTGCTTGCC GCACCAGCAA TTATGGGCAT ACAGGGCTGG GACGACCACA ACCGTTCGGG CAAGCAAACG GCGCTCGATT TCGCCAGCAA TTACCTGAAC TCCTGCGCCC CCAATGCCAT CCTCTTTACC AATGCGGATA ACGATACCTA TCCCTTATGG TATGCACAGG AGGTTGAAGG CATCAGAACT GACGTAAGGG TGGTGAACCT GCAATTTCTG GCCGACAGCG ATTACATCAA CCAGATGAAA AAACAAGCCT ACCAATCGGC AGCACTGCCC ATTGCCATGC GGCCTGATCA ATACCAGAAA GGGGTACGCG ATTATTTTCC TTATATCGAT TACGGCTTTA AGGATAGTGT AGAGCTAAAA GACCTGCTGG CGGTACTTAC ATCAGACAGC AAGGAAGATA AGGTAGAAAT GCAGGGTGGC TCTTTTGAAA ACTTCCTGCC CACCAAAAGG TTAAAGCTCA GTATAGATGC CGCTCAGCTG GTCAGGACAA ATACCGTAGC TGCCAAAGAC CTGGATAAAG TAGTGTCACA GATGGAATGG GACTTCAAGA AAGATTTTGC TACCAAGGCC GACCTGGCCA TCTTAGATAT CCTGGCGCAT AACAACTGGG AAAGGCCGGT TTACTTTGGT TCTTCGTTGT CTGATGACAC CTATATCGGC CTGGACAAAT ACCTGCACCT GGAAGGTTAT GCCTACCGGC TGCTGCCCTA TAAAAAGGGG GCGGATGATC AACGCGATAA ATCGCAGGTA ACGAATTCTG AGGTCATGTA CCACAATACC ATGCAGAAAA TGAACTTTAA AGGTTTCCAT ACAGCCAGGT ACCTGGATCC TGAAACCCGA AGAGTAGCCA ATGACACCTG GGTTTTTCAG AACGCACTGG CGGGCAACCT CATCAATGAG GGTAAAAAGG CCATGGCACA GCAGGTGATG ACCAAAAGTG TAAGGGAACT GCCTTTAAAA CTTTATTCTA TCCACGACAC GCTAAACAGA CTGGAAACCA TCAGTAACCT GAACCATTTA AACGACCGGA AAACCGCAAG CTTACTGGGC CAACAAACCT TGTCATTTTT AGATGCGGAG CTGGGTTATA TCGCTTCCCT GTCGCCCGCC CTGCAGCGTG CCTCAATTGG CGACATACAG CTGGGTATGT ATGTTTTAAG CGGACTGGAT GAGTTAACCG CGAAAGGAAT TGACCCAAAG CTGAACCAGC AGATCAAAGC GAAATTTAAA GAGCTGGAAG GTATTTTTAG CCGGAACTTA GGCTAA
|
Protein sequence | MISYQRLNNL TGFILFGIAA VVYWLTMEPT LSFWDCGEFI AASSKLEVGH QPGAPLFLMI GKLFSLLAMG NTTKIAYWIN FSSVLFSAAT IMFLYWTITA LATKLYPEKK SNTQILSIIA AGAIGALAYT FSDTFWFSAV EAEVYALSTL FTAIVFWAIL KWENEPDNRW LVFIAFMVGL SIGVHLLSLL AIPAIVLVHY FKTTAKPGLK GTFKALLFGC LLVGLVQFAI VQYLVLSAAQ ADLFFVNTLG LGFGTGAMSF ILLIVLLIAY GIYYSVKHKK YQLNLALVCL AFVLFGFSSY FMIVIRANAK PNINLSNPDN PFSLYGYLGR TNYGDTPLLY GRTFDAGQTG IKETGTEYRK GADKYEESGK TYKAEYDKNL IFPRTYSQKP NHIAFYRQWL GLGENESPNL AQNLSFFSTY QVGFMYMRYF LWNFVGRQND IHSQGNFTDG NWITGIKSMD ALRLGNQAKL PPSITKNEGN NVYYGLPLLL GLAGMIYVYR KNKQATLIIA TLFFCTGLAI ICYLNQDPMQ VRERDYAYVG SFYAFAIFIG FGVLAIQELF QRFAAAKLSL AIAVLTGLLA APAIMGIQGW DDHNRSGKQT ALDFASNYLN SCAPNAILFT NADNDTYPLW YAQEVEGIRT DVRVVNLQFL ADSDYINQMK KQAYQSAALP IAMRPDQYQK GVRDYFPYID YGFKDSVELK DLLAVLTSDS KEDKVEMQGG SFENFLPTKR LKLSIDAAQL VRTNTVAAKD LDKVVSQMEW DFKKDFATKA DLAILDILAH NNWERPVYFG SSLSDDTYIG LDKYLHLEGY AYRLLPYKKG ADDQRDKSQV TNSEVMYHNT MQKMNFKGFH TARYLDPETR RVANDTWVFQ NALAGNLINE GKKAMAQQVM TKSVRELPLK LYSIHDTLNR LETISNLNHL NDRKTASLLG QQTLSFLDAE LGYIASLSPA LQRASIGDIQ LGMYVLSGLD ELTAKGIDPK LNQQIKAKFK ELEGIFSRNL G
|
| |