Gene Phep_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2838 
Symbol 
ID8253946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3378132 
End bp3381365 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content42% 
IMG OID644936484 
ProductLyase catalytic 
Protein accessionYP_003093099 
Protein GI255532727 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00102032 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACTACC GGATAGCCAC ATTGGCACTT ACACTTGGAT TAAGTACGAT TGCACCAGGC 
CTAAAAGCAC AAAAGGAATC AGTGCCAACT TTTGTAAAGG ATATTGTGAC CAACTATCTT
ACTTTTGAAG AAGCGCAGGA TATTGCGAAC TGGAAATCAG AAAAAGGTAA TCTCTCTTTT
TCATCTGCTC ATTTTAAAGA TGGCACGCAA TCGCTGCTGT GGACGTATCA AAATGGTGCC
ACTATAGAAA TTGCCAATTT AAAGGGCTTA AAAGAAGCCG GAGATTTTTA TCCTGGCGGT
CAGCCTGAAG TTTACGAACC GTCGTTCTAC AAAAAATCGC ATTATGGAGG TGTAAAAATG
TGGTTATACC AGGAAAAACC TTCCATCGGC AAAATTACTT TTCAGGTAGG CAGCAATTTG
CAAATGGCAA ATACGGCTCC AAAATATCGT TTTTCGGTAA ACCTTAACTT TACTGGCTGG
CGGGCCGTTT GGGTTAATTT TAATGAAGAT GCGCTGGTAA AAAACTACAA GGGCAGTGAT
GAAATGACCA GTTTGGTGGC ACTTACTCCA TCGGGGCAAA GTGGTAAAAT ATTCATCGAC
CATTTCATGC TGCTGAGTTT TGTTTCTAAT AAACGTCATT CCGATCTGCA GTTTGAAAAC
CATAAACTTA ACCTCCGCTC TGGTGATGGC TATGAAATAC TGGCACCTTA CCAAATATTT
TTAGCCAAAC AATTTAACCA GCAGGTAGAT GTTAAAAAAC TAACCGAAGA AAGTAAGACC
ATAGCAGACC GATTGGAATT TTTAATACTT GGCGATAAAA CCAACGATTG GAAAAAACGC
AATACGGGCA TCGAAAAAAC AATAGATGGT AAAATTAAAG GTGCTTCAGC TATTTTCGAT
AAGCTTGGAA TACATAAAGC AAACGATTTT GTAAATGGAA GACCCTTATT TGGCATCCGT
GATGAACATA TCCCTAAAGA AGGTTTAAAT TACGATGAGG CTATCCTGCC AACTGCCTTT
CCCTTAGCAA TGGATTACAG GCTTAATGGC AATAAAACTG CCAGAGAAAA ACTGATGCTG
ACCTTTGATT ATTTACAGGA CCAGGGTTGG GCAGCGGGCA GTGCTTTTGG TACGGTAGAC
CATGTGATAA AGCTTAACCC GATTGCAACC GCTATTTTTT TGGTAAGAGA TGATCTGAAA
GCACAAAATA AGCTAAAAGC AGAAACTGAT ATGCTGATCT GGCATACCCG TTTGGGCAGT
ATGTTAACCA TTGATTACAC ACGGGGAGAA AATTCTGATA AAATCCGCGG AGGTGCCCTG
GCCAAACTCA TTACCATTTT ATTAATGGAA AATGATAGCC GTAAGCAAGA GTTGTTGCAG
GATTTTAAAA GCTACATGGA TTATGTAGCC AGTATTTCGC CCGGCTATAG TGACACGTTT
AAACCCGATT TTTCTATTTA TCACCACCGG GGAACTTATT TAAATACTTA TGGTACCAAT
GCGCTAAACA CCATGGCACT CATCCATTGG CTGCTTAGTG GCACACCTTA TCAGCTTTCG
CCACAAACTA CCGGCAACCT TAAACAGGCA TTAAAGCGAC AGGCGGAAAT TGCCTATGGA
GTAGATATTC ATTATGGTGC CGGTGGGCGT TTTCCGCTGG GCAACAGCTC ACTTGATGGG
TTTACCTTTC CGGCACTGGC CTATATGAGT ATGAATGGCA ATACCATTGC AGACAAAGAA
ATGGCTGAAC TGTTTAACTA TATATACGAT ATTGCCGAAC CAGATGTGGT AGCCAGAATG
CTTAGCCCCG TGCTTACCTA TTCGGGTACT TTTGGCACCT TAAATTTAAT AGAACGACTG
CACAAAATGG TAGGTGCACA AAAACACAGA CCTGCCGATG GTGCCGTTTC AATGCCTTAT
TCGGGCTTAA TGGCTTACCG CCAGGGCAAT GCCTTTGCAA CCGTAAAAGG ATACAACAAA
TACGTTTGGG ATTTTGAAAG TGGCAGAGGA GAAAATAATT TGGGCCGTTA CCTTAGCCAC
GGTATGCTGG TAACGGCACA GGGCGATGAA AAAATGGGTT TCAAAAGTTT GGGTCTGGGT
TTGAATGAAG GTTTCGATTG GTCTATGTTG CCCGGAGCTA CAACTAAAAT GTTACCGGCA
GACAAAGTAT TGTATTATAC AAAAGGTGAC CAAAAATATA TAGAAGGTAA ACACCGCAAT
TTCTCTGAAA GCGTAGTGGC CAGTGGACTG CAACAGGGTA CCAATGGATT GTTTGCCTTA
GACCTGCGTG ATGATGTTTT TCCGGATGAA GATAAATCTT TGTTCGACAA CAGTTTCAGG
GCCCGAAAAA CCTATTTCTT TATCGGTAAT GAAATTATTT GCCTGGGTTC TTCCATTCAA
AATAATGATA CCAGGTACCC AACTGTAACC ACGCTGTTTC AGTACCGAAC CGATGAAAAA
CGGAAAAACA GTTTTAATGG AAAGGAAATC AGTGCTTCTT CATCCCTTAA CCAAAAAGCC
GATGGGGGTT ATTTTACCGA TCAGAACGGT TTGCATTACA TCATTCCCAA AGGACAACCT
ATTGTGCTGG GCCAGGATAA GCAAATTTCT TACAGGGGCG CTAATGTAGG CGACCAAAAG
TCGATCGTTT TAAACACATC CGGAGCTTAT GAGAAAACAC AGGAGGTATA TACAAAAGCC
TGGTTTGATC ATGGTACAAA CCCTAAAGAT AAAGGGTACG AATATGAGAT TGTACTGAAT
ACGCCAACAG CTGACCTGAA ACCGTACCTG CAAAATAAAA CCTATGAGGT GATGCAAAAA
AATGCATCAG CCCATATTAT CAGGCATCCA TTATCGGGTA TTACCGCATA CGCTGTTTAT
AGTGCTGCGC AACCCTTAAA AGGAACGCTT ATTCATGTTG ACAGACCTAT GCTGGCAATG
GTAAAAGAAC AGGCTGGGCA TTTATGGCTT AGCCTTGCCG ATCCTGATAT CAGGCAACCA
AAATGGAATC ACAATATGAG CCATATGCCC GATAGCATTA CCAATGGCTG GGGCACAGGG
AGCATAGCTA CGCTTACCCT AAAAGGGGCA TGGTATATTG CCAAACAGCT GCAGGAGGTG
GAAACCGTAA GTTACATGAA TGGTAACACT ATCTTAAAAG TTTTTTGTAA AGAAGGCAAA
AGCATAGATA TCCCTTTACA GCACCGTACT ATGGATAACG TGGATACGGA ATAA
 
Protein sequence
MNYRIATLAL TLGLSTIAPG LKAQKESVPT FVKDIVTNYL TFEEAQDIAN WKSEKGNLSF 
SSAHFKDGTQ SLLWTYQNGA TIEIANLKGL KEAGDFYPGG QPEVYEPSFY KKSHYGGVKM
WLYQEKPSIG KITFQVGSNL QMANTAPKYR FSVNLNFTGW RAVWVNFNED ALVKNYKGSD
EMTSLVALTP SGQSGKIFID HFMLLSFVSN KRHSDLQFEN HKLNLRSGDG YEILAPYQIF
LAKQFNQQVD VKKLTEESKT IADRLEFLIL GDKTNDWKKR NTGIEKTIDG KIKGASAIFD
KLGIHKANDF VNGRPLFGIR DEHIPKEGLN YDEAILPTAF PLAMDYRLNG NKTAREKLML
TFDYLQDQGW AAGSAFGTVD HVIKLNPIAT AIFLVRDDLK AQNKLKAETD MLIWHTRLGS
MLTIDYTRGE NSDKIRGGAL AKLITILLME NDSRKQELLQ DFKSYMDYVA SISPGYSDTF
KPDFSIYHHR GTYLNTYGTN ALNTMALIHW LLSGTPYQLS PQTTGNLKQA LKRQAEIAYG
VDIHYGAGGR FPLGNSSLDG FTFPALAYMS MNGNTIADKE MAELFNYIYD IAEPDVVARM
LSPVLTYSGT FGTLNLIERL HKMVGAQKHR PADGAVSMPY SGLMAYRQGN AFATVKGYNK
YVWDFESGRG ENNLGRYLSH GMLVTAQGDE KMGFKSLGLG LNEGFDWSML PGATTKMLPA
DKVLYYTKGD QKYIEGKHRN FSESVVASGL QQGTNGLFAL DLRDDVFPDE DKSLFDNSFR
ARKTYFFIGN EIICLGSSIQ NNDTRYPTVT TLFQYRTDEK RKNSFNGKEI SASSSLNQKA
DGGYFTDQNG LHYIIPKGQP IVLGQDKQIS YRGANVGDQK SIVLNTSGAY EKTQEVYTKA
WFDHGTNPKD KGYEYEIVLN TPTADLKPYL QNKTYEVMQK NASAHIIRHP LSGITAYAVY
SAAQPLKGTL IHVDRPMLAM VKEQAGHLWL SLADPDIRQP KWNHNMSHMP DSITNGWGTG
SIATLTLKGA WYIAKQLQEV ETVSYMNGNT ILKVFCKEGK SIDIPLQHRT MDNVDTE