Gene Phep_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1050 
Symbol 
ID8252144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1236235 
End bp1239708 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content48% 
IMG OID644934703 
ProductYD repeat-containing protein 
Protein accessionYP_003091332 
Protein GI255530960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGTA TCATGAAATT ATATCAGAAT TTAACGCCCC AAAGGGCATG GAAATATATC 
GCTGTTTGCT TAGTGGCAGG TTTTACCAGT TCAAATGCCC AGCACCTTAC GCTGAACACT
TACAGCAACC AGACTGAAAT TAAAGCTACC GGGAGTATAA CGCTGACGGA TGGATTTTAC
ATCCCTGCGG GCAAGAATGT AAGGATCTTT ACCGGGGCGA GTTTCCAGCA ATGTGTGGAC
CTGGTAAGTA CCCCAAGTGC AGATCAGAAT TACATCAGTA CCAAGGTATT TAAAAAAGAA
GGGGTAAATG AAGGGAACAT CAATGCCACA CTGAGTACCT GCGAGGTAAA CCAGACCGTA
CAGTATTTTG ATGGATTGGG CAGGCCTTTG CAAACGGTGA CGGTACAGGG CAGTCCCGGC
TTTAAGGATG TGGTGCAGCC TGTTGACTAT GATGCCTTTG GCCGGGAGCA GTTCAAATAC
CTGCCTTACT CTACTGTTAC AGGTGCCAAT GGCAGCTTTA AGCCTTCTGC AATAACCAGT
CAGGCAGGTT TTTACAACAG CCCCCCGGCA GGTGTGGCCA CGATTACAAA CACGGCCTTT
TCAGAGAGCA GGTTTGAACC TTCACCTTTG AACCGGGTAC TGGAACAGGG ATCCCCCGGT
GCCAGCTGGC AGCTTAGTGC CGGTCATACC CAGAAAATGG AGTACGGCTC CAACAACAGT
ACAGACTATG CAGTACGGTT GTATCAGGCT GTACCCTCTG CAACCCCCGG AGAAGAACAC
AAACGGATAC TCAGCGGAAC AGGTTATTAC ACTGCCAATG AACTGTACCT GAGCATCAGC
AAGGACGAGA ACTGGGCAAG TACCGATGGC AAAGCCGGTA CCACAGAAGA ATATAAGGAC
AAAGAGGACA GGGTAGTGCT GAAGCGTGTA TTTAATTATA AAAATGATGT GACTGAAACC
CTAAGCACCT ATTATGTGTA TGATGACCTG GGGAACTTAA GTTTTGTGCT GCCACCAGGT
GCAAACCCGG ATGCCCTGGC CCTGCCTTCC CAGACACTGC AGGACCAGTT CTGTTACCAG
TACCGTTACG ATGGGCGTAA AAGGCTGATT GAAAAGAAGC TTCCGGGCAA GGACTGGGAA
TACATGGTGT ACAACAAACT GGACCAGCTG GTGCTGAGCC AGGACTCCTT GCAAAGGGTA
GCTAACCAGT GGCTGTTTAC CAAATACGAT GCTTTGGGCA GGGTAGCAAT TACCGGTGTA
TACGGAGATG GGGCTTCCAG GAGCAGCCTG GCCGGTACAT TGAACAGCCA GAGCGTACTT
TGGGAAAACA GGCTGGGTAG CGGGACCGAT TATGACAACG GTTCATTCCC TCAGAACAAC
ATTGCCTGGT ACCATACGAT CAATTACTAC GACGATTACA ATTTTCCGGG CAATAGCTTT
CCCCAGCCTG ACGGGGTTAC CCAGATGTCG GCCTCCAGGG TAAAAGGACT GCAGACGGGC
AGTTTTGTTT ACCAGGTGAA CAGCAGTACC CGTTACCTGA GCGTGAACTA TTATGACAAA
GATGGCCGGG TGCTCAAAAC AGCAGCAGAG AACCATTTGG GCGGTACAGA CCTTACGGAG
AATACCTGGA ACTTTGCCGG TGAGCTGACG GGCAGTACCC GTACCCATAG TAGTGGCTCG
GGCCCAGCCA CTACTATTGC CACCCGGCAT GAATACGACC ATATGGGAAG GAAGAAGGCG
ACCATGGAAT CCATTAACGG TCAGCCTGAA GTGGTGCTGA GCAAGCTGGA CTACAATGAA
CTGGGGCAGC TGAGCAAAAA ATGGCTGCAC AGTACGGATA ATGGAACAAG CTTTTTGCAA
CATACGGATT ATGCCTATAA CGAACGGGGC TGGCTGAGCA ACAGTAGATC AGACCAGTTC
AGTATAAGGT TGAAGTACGA CGATGGCACC GTACCGCAGT ACAATGGCAA TATCGCTAAC
CAGAACTGGG GAGCAGCGAC GACCTACCCC AATACTTTTA CCTATGGATA CGATAAGCTG
AACCGCTTAC TGAGCGGGGT AAGTACCGGG GCGATATCGA TGAGTGAAGT GCTGAGTTAT
GATGTGATGG GCAACATCAG TACCCTGAAC AGGGATGGAG CAGGAGCGGG CAGCTATATT
TACGAAGGCA ACAGGCTGAA GAGCATCTCA GGTGGCGGGC TGGCTGCCGG GAGTTATGCC
TATGACGGGA ATGGGAATGC GGTTACCGAT GGGCGTACCG GTGTAAACTT AACCTATAAC
CACCTGAACT TACCCATCAC TGTTAATGGA TCGGGACTGA ACATTGCCTA TACCTATGAT
GCAATGGGCA GGAAACTGAA AAAGGTGAGT AACATGGAGG CCCCTTCAGA TTATGTGGAT
GGGGTACAAT ACACCGGAGG GGCAATAGAG TTCATCATGA CCGAGGAGGG TAAAGCGAGG
AGCAATGGTG GTACCTACAG CTATGAGTAT AACCTGACAG ACCATTTGGG TAATGTGCGG
TATACGTTCT ATCAACACCC ATCCAGTGGT TTGCTGGAAC GTTTACAAAG TGACGATTAT
TATGCTTTTG GTTTAAGAAA ATCAGGAGTT CCAATTTCTG GAAATAATAA ATATCTTTAC
AATGGCAAGG AGTTGCAAAG TGAGCTGGGA CAGCTCGATT ATGGTGCAAG GTTCTATGAC
CCGGAGATTG GAAGGTGGAA CGTGATTGAT CCGTTAGCTG AGAAGGGAAG AAGATGGTCA
CCATATACCT ATGCATTTAA TAACCCAATG AGATTTACGG ATCCCGATGG TATGTGGCCA
GACGATGGCT ATGGCCCGGG AGATGACGAA TTGATTGGTG TTCAAACCGG AATGGCCATA
GGGGGTGCGA TTAGAGATGG TATACATGGA CTCAGAACAC TTGTAGCAGC TGCTGGTGAC
GCTTTAGGTA TAAACAAGGC TGCCCCCGGA ATGAAATGGC AATCTGTAGA CAGTGAAGGT
GGCACATTAG GGTACAGTAT GGCTCAGGTT CCTAGTGAAG GTGGTTTAAA AGATGCCCTA
GGTCATCTGG GTGATGGTGC AAATGCTTTA GCTTTCAATG GTTCTCTGGC TAAAGGTACA
ACGGGTACAT TATTGGCTAA AACAGGTCAG GAAGGTCGAG CGGCAAGTGA AGGTGTGAAA
ATTATTAAGG ATGGAGATGC TGCAAGTAAT ATGCTTAATC CTTTTGATTT AACGCCTACA
CATGGTACAA CAGGATCAAA CTTAAAAAAA GTGGAAGCGT TAATTAAAAA AGATGGCGGG
ATAACTGACC CAGTAAAATA CATAGAACAT AAAGGCAACA AATATATAGT TGATGGTCAC
CATCGTGTGC AGGCAGCTAA AAAGTTAGGG TTTTCACAAG TGCCGGTGCA AGCAGAGCAA
TTGCCTTACG GAAGCTATAG GACAACGGCA GATTTTCAAT TTACAAAACA CTAA
 
Protein sequence
MLRIMKLYQN LTPQRAWKYI AVCLVAGFTS SNAQHLTLNT YSNQTEIKAT GSITLTDGFY 
IPAGKNVRIF TGASFQQCVD LVSTPSADQN YISTKVFKKE GVNEGNINAT LSTCEVNQTV
QYFDGLGRPL QTVTVQGSPG FKDVVQPVDY DAFGREQFKY LPYSTVTGAN GSFKPSAITS
QAGFYNSPPA GVATITNTAF SESRFEPSPL NRVLEQGSPG ASWQLSAGHT QKMEYGSNNS
TDYAVRLYQA VPSATPGEEH KRILSGTGYY TANELYLSIS KDENWASTDG KAGTTEEYKD
KEDRVVLKRV FNYKNDVTET LSTYYVYDDL GNLSFVLPPG ANPDALALPS QTLQDQFCYQ
YRYDGRKRLI EKKLPGKDWE YMVYNKLDQL VLSQDSLQRV ANQWLFTKYD ALGRVAITGV
YGDGASRSSL AGTLNSQSVL WENRLGSGTD YDNGSFPQNN IAWYHTINYY DDYNFPGNSF
PQPDGVTQMS ASRVKGLQTG SFVYQVNSST RYLSVNYYDK DGRVLKTAAE NHLGGTDLTE
NTWNFAGELT GSTRTHSSGS GPATTIATRH EYDHMGRKKA TMESINGQPE VVLSKLDYNE
LGQLSKKWLH STDNGTSFLQ HTDYAYNERG WLSNSRSDQF SIRLKYDDGT VPQYNGNIAN
QNWGAATTYP NTFTYGYDKL NRLLSGVSTG AISMSEVLSY DVMGNISTLN RDGAGAGSYI
YEGNRLKSIS GGGLAAGSYA YDGNGNAVTD GRTGVNLTYN HLNLPITVNG SGLNIAYTYD
AMGRKLKKVS NMEAPSDYVD GVQYTGGAIE FIMTEEGKAR SNGGTYSYEY NLTDHLGNVR
YTFYQHPSSG LLERLQSDDY YAFGLRKSGV PISGNNKYLY NGKELQSELG QLDYGARFYD
PEIGRWNVID PLAEKGRRWS PYTYAFNNPM RFTDPDGMWP DDGYGPGDDE LIGVQTGMAI
GGAIRDGIHG LRTLVAAAGD ALGINKAAPG MKWQSVDSEG GTLGYSMAQV PSEGGLKDAL
GHLGDGANAL AFNGSLAKGT TGTLLAKTGQ EGRAASEGVK IIKDGDAASN MLNPFDLTPT
HGTTGSNLKK VEALIKKDGG ITDPVKYIEH KGNKYIVDGH HRVQAAKKLG FSQVPVQAEQ
LPYGSYRTTA DFQFTKH