Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2787 |
Symbol | |
ID | 8253895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3294884 |
End bp | 3297997 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644936433 |
Product | hypothetical protein |
Protein accession | YP_003093048 |
Protein GI | 255532676 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.595004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCTTGC TCATACTGGG ACAGTTTTTC GTATTGGCCG GGTATGCACA AAAAGTAAAT ATCCCCGAAC CACCAAAACC AATATTTAAA GGTAAAGAAG GTAAACTGAA TTACAGTCCG GACGAAAAAG GCAACCGGAT TCCCGATTTT TCTTATGCAG GTTATAAAGC CGGAGAGCAA CCCATACCAG AGGCTGTAGT AAAGGTTGTT GTGCCGGTTA AATCAGGCGA TGCTACCCTG AGGATCCAAT CGGCCTTAAA CTATGTTGCA GCTTTGCCCT TAGGCAAAGA TGGCTTAAGG GGAGCTGTAT TGCTGGAAAA AGGAAAATAT GAAGTTGGAG GTGCTTTAAA GATCAATGCT TCCGGCGTGG TTTTGCGTGG AAGCGGAATG GGGGAAAACG GAACGGAGAT ATTTGCAACA GGACTGGACA GAATGGGGGT ATTACGCATA GCTGGTAAAC CAGATCGTAT TAAAGAGGCC CCTGTAGCAG TTACAGATCA ATATGTTCCG GTAAATGCAA TGAAGGTTAC CCTTGCAAAT GGAGGATTTA AAAAAGGTGA TCAGGTAATT GTACAACGCA TATCCTCTAA AAACTGGATT GATTTGATTG GAACAGACCA TTTTGGCGGG GGCATTACCT CACTGGGCTG GAAAGCAGGA CAACGTGACA TTTATTGGGA CAGAAAAGTG ATTGGGGTTG AAGGGAATAC TTTATTATTG GATGCACCCT TAACTACAGC GCTGGATGCT GTTTATGGTG GGGCTACTGT ATCAAAATAT AGCTGGAATG GCAGAATTTT CAATTCCGGT GCAGAAAATA TAAGATTTAC ATCGGGCTTT GATGATAAAA ACCCTAAAGA TGAATACCAC CGCTGGACGG CCATTTCTAT AGAAAATGCC ACAGATGCAT GGGTACGCCA GGTTGTTTTT GAACATTTTG CAGGTTCAGC AGTAACTGTT CAGGAAACTG CAAACAGGAT AACTGTGGAA GATTGTAAAT CGCTGGCGCC GGTTTCGGAG ATTGGTGGCG AACGCAGATA TACTTTTTTA ACTACAGGAG GGCAAACACT GTTTCAAAGA TTGTATTCTG AATATGCTTA TCATGATTTT GCAGTTGGCT TTTGTGCTCC CGGTCCAAAT GCTTTTGTCC AGTGCCAGGC TTATCTGCCA TTTAGCTTTA GCGGAACAAT TGACAGTTGG GCATCAGGTG TTTTATTTGA TATTGTTAAT GTGGACGGAC AGGCCCTGAG TTTTATGAAC AGAGGGCAGG ACGGACAAGG TGCAGGCTGG TCGGCCGCCA ACAGCGTATT CTGGCAGTGT ACAGCGGCCC GGGTAGACTG TTATGCTCCG CCAACTGCGC AGAACTGGGC ATTTGGTACC TGGGCACAAT TCTCGGGCGA CGGTTATTGG GATATGTCTA ACGAGCAGAT CCAGCCGCGT AGTTTGTATT ATGCCCAATT GAAGGACAGG CTGGGAAAAC AGGCCGATGA ACGGACTTTT GTAATGCCTG TAGAAACGGA AGCTTCAAGT AGCCCGCCAG TGGATGTAGC TCAGAAGCTA ACCAGACTGG CTGATAAACC GGCCATGCTG TTAACGGAAT ATATAGATCA GGCAACCGAA AGACAAAAGA TTTCAACTGA TACGCGCAAT GCAAAAAATA TTGACAAAAT AGGTGTTGAG AGAATTAATA CTCCGGCTAA GGCCAGTGCG ATGCAGATAA GCAATGGTTG GTTGCGTCGG GGCAATGCCC TGGTAACCGG AAACCGTGCA GACGTCCAAT GGTGGAATGG CAGCGCAAGG CCATATGCGC TTAAAGGCAT GAAAATGCAC ATCACCCGTT TTGTACCTGG CCGTACAGGT AAGGGGCTGA CAGACGATCT GGAAGAAATA ACTGACTCTA TGCAAAAAGG ATCAGTAAAG ATCTTAGACC ATAATTATGG TTTGTGGTAC GACAGGAGAC GTGACGACCA TGAGCGGATC AGAAGAATGG ACGGAGAAGT ATGGACGCCA TTTTATGAAT TGCCTTTTGC ACGCAGCGGA CAGGATAAAG CATGGGATGG ACTGAGCAAA TACGACATTA GCAAATACAA CCCATGGTAT TGGGGCAGGT TAAAACAATT TGCAGACCTT GCTGATCAGA AAGGCCTGGT ACTGATCCAC GAAAACTATT TCCAGCATAA CATTATAGAA GCTGGTGCGC ATTATGCCGA TTTTCCATGG CGTACGGCAA ATAACATCAA TAATACCGGT TTTCCGGAGC CAGTACCTTA CGCGGGCGAC AAGAGGATAT TTATGGCAGG GCAATTTTAT GACATCAGCA ATGCTGAGCG TAAGGCACTG CACCGTGCCT ATATCCGTAA ATGCCTTGAT AATTTTAAAG ACAATACCGG TGTAATCCAG TTGATCGGTG CAGAGTTTAC CGGGCCATTA CATTTTGTAG AGTTCTGGAT AGATACCATT AAAGAATGGG AAAAAGAGAC AGGTAAACAC CCGATTATTG GTTTAAGTAC CACTAAAGAT GTGCAGGATG CCATATTGGC CGATAAAAAC AGGGCAGGAG TAGTCGATCT GGTCGACATC CGTTATTGGC ATTACCAGGC TGATGGTTCT GCTTATGCAC CACAAGGTGG ACAAAACCTG GCTCCTCGCC AGCATGCACG TTTGCTGAAA CCTAAAAAAA CATCTTTTGA TCAGGTATAC CGTGCTGTAG CAGAATATCG TACCAGATAT CCCGAAAAGG CAGTGATCTA TTCAGGTGAT GGTTTTGATG CTTTTGGCTG GGCCGTTTTT ATGGCTGGCG GATCTTTGTC AAATGTTCCG GCTGCCAATA ATGCTTCGCT TTCGGGTGTG GCCACAATGA AACCATTTAA TTTGGCCGGC CGGTCTTCAG GTCAGTATGC TTTGGCTAAT CCAGATGGCG CATATCTGTT GTACAACAGC TCTTCCGTTC CTGTAAAACT TGACCTAAGT AAAGCTAGAG GAAACTATGT GGTAAAATAC ATCAACCCGC GCAGCGGCCT GGTAGTTAAG GAAGAAAAGA TAAAGGGGGG AGCCGCTAAA GAATTTAATA AGCTTTCATC GGGAGACGAA GTCGTTTTTA TCAATAAAAT TTAA
|
Protein sequence | MGLLILGQFF VLAGYAQKVN IPEPPKPIFK GKEGKLNYSP DEKGNRIPDF SYAGYKAGEQ PIPEAVVKVV VPVKSGDATL RIQSALNYVA ALPLGKDGLR GAVLLEKGKY EVGGALKINA SGVVLRGSGM GENGTEIFAT GLDRMGVLRI AGKPDRIKEA PVAVTDQYVP VNAMKVTLAN GGFKKGDQVI VQRISSKNWI DLIGTDHFGG GITSLGWKAG QRDIYWDRKV IGVEGNTLLL DAPLTTALDA VYGGATVSKY SWNGRIFNSG AENIRFTSGF DDKNPKDEYH RWTAISIENA TDAWVRQVVF EHFAGSAVTV QETANRITVE DCKSLAPVSE IGGERRYTFL TTGGQTLFQR LYSEYAYHDF AVGFCAPGPN AFVQCQAYLP FSFSGTIDSW ASGVLFDIVN VDGQALSFMN RGQDGQGAGW SAANSVFWQC TAARVDCYAP PTAQNWAFGT WAQFSGDGYW DMSNEQIQPR SLYYAQLKDR LGKQADERTF VMPVETEASS SPPVDVAQKL TRLADKPAML LTEYIDQATE RQKISTDTRN AKNIDKIGVE RINTPAKASA MQISNGWLRR GNALVTGNRA DVQWWNGSAR PYALKGMKMH ITRFVPGRTG KGLTDDLEEI TDSMQKGSVK ILDHNYGLWY DRRRDDHERI RRMDGEVWTP FYELPFARSG QDKAWDGLSK YDISKYNPWY WGRLKQFADL ADQKGLVLIH ENYFQHNIIE AGAHYADFPW RTANNINNTG FPEPVPYAGD KRIFMAGQFY DISNAERKAL HRAYIRKCLD NFKDNTGVIQ LIGAEFTGPL HFVEFWIDTI KEWEKETGKH PIIGLSTTKD VQDAILADKN RAGVVDLVDI RYWHYQADGS AYAPQGGQNL APRQHARLLK PKKTSFDQVY RAVAEYRTRY PEKAVIYSGD GFDAFGWAVF MAGGSLSNVP AANNASLSGV ATMKPFNLAG RSSGQYALAN PDGAYLLYNS SSVPVKLDLS KARGNYVVKY INPRSGLVVK EEKIKGGAAK EFNKLSSGDE VVFINKI
|
| |