Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0581 |
Symbol | |
ID | 8251668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 699005 |
End bp | 702034 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644934229 |
Product | hypothetical protein |
Protein accession | YP_003090865 |
Protein GI | 255530493 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.272409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATT CAAAAATCAA TAATCTTACC GGCTGGTTTT GCTTCCTTGT AGCCGCTGTA ACTTATATTT TAACATTAGA GCCTTCCGTT AGTTTTTGGG ATTGTGGCGA ATTCATTGCT TCGGCCTTTA AAATGCAGGT TGTTCATCAG CCCGGTGCCC CATTGTTTTT AATGATCCAA AGGTTCTTTT CTTTGTTTGC ATTGGGCGAT GTTCAGAAAG TAGCTTATTT TATGAACGTA GGTTCTGCCA TAGCCAGTGC TGCCACCATC CTGTTTTTAT GCTGGACCAT CACTGCCCTG GCCAAAAAAC TGCTGGTTAA AGAAAATGAA GAGATCAGCA GATCTACAAT GATCTCTATC ATGGGTGCAG GAATCGTAGG TGCATTGGCT TATACTTTTT CTGACAGTTT CTGGTTTTCT GCTGTTGAAT CTGAAGTGTA TGCCTTGTCA TCCTTATTTA CAGCTATAGT TTTCTGGGCC ATTTTAAAAT GGGAAGCCCA TGCCAATGAA AAAGGTGCCG ACAAATGGTT GTTGTTTATT GCCTATATCA TGGGGCTTTC CATTGGTATT CACTTGTTAA ACTTACTAAC CATCCCTGCC ATTGCTTTTG TCTATTATTT CAAAAAGACA TCAAAAACAA CTACCAGCGG CATCATTAAA ACCGGAATTA TAGGAATACT TATACTTGCT GTTATCCAAT ACGGAATTAT ACAATACCTG GTTTCATTTG GCGCCTATTT CGATCTTTTC TTTGTAAACA CCTTGGGAAT GGGCTTTGGT ACCGGGGTAA TGTGCTTTGC CATCCTGCTA ATCGGCGGCC TGGTATGGGG CATCCGTTAT TCTATCAAAC ACCAGAAAAG GGTGCTGAAT ATCGCCTTAC TTTCTACAGT TCTGATCATA TTTGGGTACT GCTCTTTTGC CATGATCATC ATTCGGGCCA AAGCTGATCC AAACCTGAAC AACAGTGATC CTGATAATGC TTTTTCATTC CTGAGCTATT TAAACCGGGA GCAATATGGC GACAGGCCCC TGTTGTTTGG CCCTAACTAT AATTCTCAGA AGGTTAACCT TACACAAGGC AAAACTTTAT ATAGAAAAGG TGCCGAGAAG TATGAAGCAG CGGGCAGAAA AACAGATTAT GAGTACGATA GAACTACGCC TTTTCCGAGG ATGTATAGTG ATGACCAGCG GCATATAGGT TATTATAAAG ACATGATGGG CTTTAGTGAT GATCATTTCC CTAATCTGTT TGACAATATT GGATTCCTGT TCAAGTACCA GATAGGTCAG ATGTACATGC GGTATTTTAT GTGGAATTTT GTAGGCAGAC AAAATGACGA TCAGGGACAG GGCAGCCTGT ATGAAGGACA ATGGTTAAGC GGCATAAAAC CTATTGATGC TTTAATGCTG GGCAATCAGA AAAACCTGCC CCCTTCTATA ACTGACAGTA ATGCTTATAA CCGCTTCTTC TTTTTACCAT TAATACTGGG GTTATTGGGT GCCATATGGC ATTTTAAACG CAACCAGAAA GATGCCGGAA TAGTAGCCCT CCTGTTCTTC TTTACAGGAC TGGCCATTGT ACTTTACCTT AACCAAAAAC CAATGGAACC GAGAGAAAGG GACTATGCTT ATGCAGGCTC TTTTTACGCC TTTGCCATTT GGGTAGGTTT GGGTGTACTG GCCATAAGGG AATGGTTGTT CAAAAAATTA AGCCCTGCAA CCGGGGCTGT TCTGGCTACC GTAGCGGGTT TATTTGCAGC ACCCGTGATC ATGGCGGCCC AGGGATGGGA TGATCATGAC CGTTCAACCA AAATGGTAGC CCATGACATT GCAGTCGATT ATCTGCAGTC GTGTGCCCCA AATGCAATTC TTTTCACTTA TGGCGACAAT GATACCTATC CTTTATGGTA TGCCCAGGAG GTAGAAAATA TACGCCCGGA TATCCGCCTG GTAAACTTAA GTTTATTTGA TACCGACTGG TACATCAATG GTATGCGGCA TAAACAAAAT GAGTCGGCTC CTTTACCTAT ATCCATGAAG CCTGAGCAAT ATGTGCAGGG CGTAAGAGAT GTGATGTATT ATCAGGATTA CAAAGTTGCC GGCCCTGTAG AATTAAGTAA TATCCTTGCC ATCCTTTTAT CAGATGATCC GGAAGATAAA TTACCACTGC AGGATGGCAG CAAGGAAAAC TTCATCCCTA CCAAAAACTT CAAACTTACT GTAAACCGTG CTGATGTACT TAAAAATGGC GTGGTAAGCG CTACCGACTC CAGTAAAATC GCACCGGCAC TGGAATGGAC CTTTAACAAA AACTATGTAA CCAAAGGTAC ATTGGCCATG ATTGATATTT TGGTGCATAA TGACTGGAAA AGGCCTGTTT ACTTTGCCAG CACGGTGCCT TCAGACCAAT ACAATGGCCT GGACCAGTAT TTATATAATG AAGGTCTTGC CCTGCGCCTG CTTCCTTTAA AACCTGATAC AGCGGCAAAC AGATCAGAAC TGATCAATAC CCCTGTACTT TATAAAAATG TAATGGACAA ATTTGTTTGG GGCAATGTAA AAAATGCAAA ATATCTTGAT CCACAATCTT CTGATGACAT CTCGATCTTT ACCAATGTAT TCAACAATAC CATTACCGGT TTAATTAAAG AAGGTAAAAC AGCAGATGCA AAGAAAGTTG TAAACCGTTA TTTTGAAGTA ATGCCGGAAA GATTTTATGG CATGCGTTCT ATGATGGGCA CTTATTTTAT GGCTGAAAAC CTTTACCTGC TTAACGAAGC ACCAAGGGCA AATGCATTAA TTGAAAAATC GGCCGCTTAT ATCCAGAAAG AACTGACCTA TCTGGCCGAT GTTTCTGAAA GCAAGCGGAG GTTTATAGGC AATCAGAATG TTCAGCTCGG ATTATCCTTC CTGAACCAGA TGGCCAGAAC TACTGCCCAG TATAAGCAGA CAAAACTGAG TGAAAGCTTA ACCCGTCAGT TTGAAGGAAT GGAAGCCAGG TTCTCAATGT ATTTCTCACA GGGGCAATAG
|
Protein sequence | MNYSKINNLT GWFCFLVAAV TYILTLEPSV SFWDCGEFIA SAFKMQVVHQ PGAPLFLMIQ RFFSLFALGD VQKVAYFMNV GSAIASAATI LFLCWTITAL AKKLLVKENE EISRSTMISI MGAGIVGALA YTFSDSFWFS AVESEVYALS SLFTAIVFWA ILKWEAHANE KGADKWLLFI AYIMGLSIGI HLLNLLTIPA IAFVYYFKKT SKTTTSGIIK TGIIGILILA VIQYGIIQYL VSFGAYFDLF FVNTLGMGFG TGVMCFAILL IGGLVWGIRY SIKHQKRVLN IALLSTVLII FGYCSFAMII IRAKADPNLN NSDPDNAFSF LSYLNREQYG DRPLLFGPNY NSQKVNLTQG KTLYRKGAEK YEAAGRKTDY EYDRTTPFPR MYSDDQRHIG YYKDMMGFSD DHFPNLFDNI GFLFKYQIGQ MYMRYFMWNF VGRQNDDQGQ GSLYEGQWLS GIKPIDALML GNQKNLPPSI TDSNAYNRFF FLPLILGLLG AIWHFKRNQK DAGIVALLFF FTGLAIVLYL NQKPMEPRER DYAYAGSFYA FAIWVGLGVL AIREWLFKKL SPATGAVLAT VAGLFAAPVI MAAQGWDDHD RSTKMVAHDI AVDYLQSCAP NAILFTYGDN DTYPLWYAQE VENIRPDIRL VNLSLFDTDW YINGMRHKQN ESAPLPISMK PEQYVQGVRD VMYYQDYKVA GPVELSNILA ILLSDDPEDK LPLQDGSKEN FIPTKNFKLT VNRADVLKNG VVSATDSSKI APALEWTFNK NYVTKGTLAM IDILVHNDWK RPVYFASTVP SDQYNGLDQY LYNEGLALRL LPLKPDTAAN RSELINTPVL YKNVMDKFVW GNVKNAKYLD PQSSDDISIF TNVFNNTITG LIKEGKTADA KKVVNRYFEV MPERFYGMRS MMGTYFMAEN LYLLNEAPRA NALIEKSAAY IQKELTYLAD VSESKRRFIG NQNVQLGLSF LNQMARTTAQ YKQTKLSESL TRQFEGMEAR FSMYFSQGQ
|
| |