Gene Phep_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2018 
Symbol 
ID8253122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2324417 
End bp2326633 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content37% 
IMG OID644935666 
ProductAcyl transferase 
Protein accessionYP_003092285 
Protein GI255531913 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00134115 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAATA AACATACAAT GATAAATTTT GCAAAACGCG TTTATTTTGC TTTCTGGTGG 
AGGGCACATA CAGTTAAAAG CTTTCTGATA AAAAAAAAAG AAATAGATCG TTTGGTGTTT
ACTTTTGGGG GGCGTAATGA CAATTGGCCT ACAGTTGGCC GTAGTCTTTA TAAAGAGCAT
AAGATCTTTA GAAGAGTAAT CCAGAATTGC GATGATATTT TAACTGAACA TGGGGGAACT
GAGATACGAT CCTATTTTGA AGGTGAGGTA AATCCACATA TTTTTGAAGA CGAATCTAAA
TTTACTTGTA TCACAGCCAT ACAATTGGCT ACTGTGGATT TATACAACAG CAATGGTATT
TATCCAAATG CAGTTATGGG GGTTAGCATG GGAGAACCTG CAGCCGCTTA TGCAGCAGGG
GCGCTTACAC TTAAAGAAGC TTTAAAAATC TCCTTAAGCT ACATGACACT TTATAAAGTT
GAGCGGATGC AATACGCCTT TGTATTTCTG AACCTCAATT TTAATGAAGC TTTAGATTTT
TGCAAAAGAT CACCAGTATG GGCAGAAATA ATTTATGAAG ATAGCCCACA AGCTCAGCTG
ATTGCTTGTG ATAAAGATGA CATTGACCAA TTAAAACAAT TTTTTTTGGT TGAGGGAATG
GCCTTTAATT TTGCTGCTGA AAAAACATTT ATTCCTTATC ATACTTCCCG AATCAGGTTG
CAGCATAAAT TATTGTATAA GTACTATGAA GATATAGAGC CAAAACCTTT AAAATGCGAC
TATTATTCAC CAACCCTGGG CAAAATGATT CCAGGAAATA TTGTTCTTGA TCGCGAGTAC
TGGTATAACC TTGTCTGTTT GCCTGTACTA TTTGCGAAAA CATTACAAGC AGTTTTAAAT
GATGGTTATA AAACTTTCTT ACAAATTGGT CCACCAGCGA TCTCAGAAAG ACAGTTTAAT
GCAGTATGCA AACCTGTTGA AATTAAAGTA TTCAACTCCA TGCAATCCGA ATCAGTTGAA
ATAGCAAACC ACAATCTTTT ACAAAAACAA CTGACCAAGA TAAAATTTGA AAGATCTTAT
TTAATTGACG ATGAAATAAT TACACTTACA AACTTTAAAC AAACCTTTAA TATATATAAT
GATCCATCAC CACTTTCATT TGAATATTTA AGAAAGTATG GTGCTGTCCA CTTCCTACCA
GCCCACACTG CCTGGCTGGT TCTGGGTTAC AATGAGGTTG AGTATATTTT AAAACAGCCA
CAAATATTTT CCAGTTCAAT ATTAAAAGAT TACGATCCTA TTCTTCTGGG TGCAGATCCT
GAAACTCATA AAATCATCCG AAATTTGCTG CAACCATTAT TTTCGCCGGA AGTAATCTCA
GAACTGGCTC AATTTACAGC CGTTACCGCC CAGCAATTAC TGGATTTGAT CTTTCAGCAG
GACAATTTTG ATATGGTTAA AGATTATTCA GACCCTTTGT CACTTTTGGT GTTGTGCAAC
TTTTTTGGAC TGTCCTCTGA AAATGCTGAG AGAATGCTAA ATTTTACCGG AAAAGATTAC
CAAAACATGC TTTACTGGCA GCGTTTGCAG GAATTTTTCG AAACTGAATT TTTGATCTGT
GAACTCCATA AACAAGACTG TTTATGGGGA AAATTACGTC AATTGGTTAT CAATAATGAA
TTTATCCTGT CAGATGCAAC AAGTCTGTTG AGAATCATTT GGACAGCCGG AATGGCAACA
ACCAGCGCTC TAATCAGTAG CGCTATATAT ATAACCTTAA CCGAGCCAGC TCTAGCTGAT
CAGTTAATGG ATGATGAAAA GCAGATCGGC AGATTTATTG AAGAATGTCT CAGGCTGCAA
ACACCAATAA CAGCTGTTCA CAGAATCGCT ACTCAAGAAG TGATTTTACA AGAACAAACA
ATTCCTGCAG GAAGTACGTT GATGCTTAAT TTAAGATCTG CTATGACAGA TCCTGATCAC
TACACTGAAC CGGAAAGATT TTCCCTAACT CGACCAGCAA AAAGACACCT TGCCTTTGGT
GCAGGTATCC ATCAATGTAT AGGAATGGGT ATGGCCCGTG CAGAAGCCCG GGGTGCAATA
CAAACCTTGC TTACAAGGCT ACCGGACCTT AAAAAATACA TTTATACAAA ACCAGAATAC
AATAAAGGCT CTGATCTGGT GATCATGTCT TCGTTAAAAC TATCTAAAAA ATCTTAG
 
Protein sequence
MMNKHTMINF AKRVYFAFWW RAHTVKSFLI KKKEIDRLVF TFGGRNDNWP TVGRSLYKEH 
KIFRRVIQNC DDILTEHGGT EIRSYFEGEV NPHIFEDESK FTCITAIQLA TVDLYNSNGI
YPNAVMGVSM GEPAAAYAAG ALTLKEALKI SLSYMTLYKV ERMQYAFVFL NLNFNEALDF
CKRSPVWAEI IYEDSPQAQL IACDKDDIDQ LKQFFLVEGM AFNFAAEKTF IPYHTSRIRL
QHKLLYKYYE DIEPKPLKCD YYSPTLGKMI PGNIVLDREY WYNLVCLPVL FAKTLQAVLN
DGYKTFLQIG PPAISERQFN AVCKPVEIKV FNSMQSESVE IANHNLLQKQ LTKIKFERSY
LIDDEIITLT NFKQTFNIYN DPSPLSFEYL RKYGAVHFLP AHTAWLVLGY NEVEYILKQP
QIFSSSILKD YDPILLGADP ETHKIIRNLL QPLFSPEVIS ELAQFTAVTA QQLLDLIFQQ
DNFDMVKDYS DPLSLLVLCN FFGLSSENAE RMLNFTGKDY QNMLYWQRLQ EFFETEFLIC
ELHKQDCLWG KLRQLVINNE FILSDATSLL RIIWTAGMAT TSALISSAIY ITLTEPALAD
QLMDDEKQIG RFIEECLRLQ TPITAVHRIA TQEVILQEQT IPAGSTLMLN LRSAMTDPDH
YTEPERFSLT RPAKRHLAFG AGIHQCIGMG MARAEARGAI QTLLTRLPDL KKYIYTKPEY
NKGSDLVIMS SLKLSKKS