Gene Phep_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1017 
Symbol 
ID8252111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1191070 
End bp1193430 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content45% 
IMG OID644934671 
Productprotein of unknown function DUF1080 
Protein accessionYP_003091300 
Protein GI255530928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0838366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA CCACAAGTAT TTTAAAAATG ACAATGCTGT TGCTGCTGAC TGCATTTACC 
ACTTATGCTC AGCAGATTGG TAAGGACAAT CCTTTACTGG GCGACTGGAA GACCAGCGAT
AATAAATATT ATCTGCAGGT ATTTCTGACC AAGGAAGGTG CCTGTAAGGC CAATCTGATC
AATAAGCTAA ATTCGCCTGA AGCGCCGCTT GCAGTGCTTT CTGCTGAACT TGCAGGAACT
GCCCTTAACT TTAACGGGGA TGGCTGGAGC GGAAAAGCAG AAAAGGGTAA GTTAAACCTG
GTTAAAGGAA CAGAGAAACT TGTGTTCAAT CATTATGTCC GTACTTCCCC AACCATGGGA
GCAGCGGCAC CTAAGGATGC AATTGTATTG TTTAATGGTA AAGGCCTGGA TGAATGGGGC
AAATTGGCCC CTAAAGAATG GTTAAAACCA GATGGCCCTG TTGATGGCTG GAAAATTTTA
CCCGGTGGGA TTCTTGAAGT TACCCCAGGT ACAGGTTCCA TCGTTACCAA AAAAGTGTTT
GGAGATTTAA AACTTCATTT TGAATGTCGT TTATTGGGCG AAGTGACCAA TGGTGGTGTA
TATTTCATGT CCAGATATGA AATAAATATC AAAGATTCTT ATGGCCAGAT CGGAGGTTCT
CCATCCGGCG CTTTTGGAAA CATTTCCAAA CCGGCTGATT TTTATCCCAG TGTAAATATG
GCGTATCCGC CTATGGTATG GCAAACATTT GATATCGAGT TCAGGGCACC CCGTTTCGAT
GCTTCGGGTA CTAAAAAGAT AGAAAGTGCC CGCATTTCAA TTGTGCACAA TGGGGTGCAA
ACTTATAAAG ATGCCCCTAT GGAAGAAGTT AAAGGCGCAA CAGGAATTTT GGGCGAAGCA
GGTGTAGGTC CGATTTATTT ACAGGAGCAC GGTACAGCTT ATCAGTTCAG AAATATCTGG
GTGGTAGATA AAACAGTTAA GGGTACAGAA AATGCGCATA CGCCCGTAGT GGCATCTGCT
GATGGCGCTG TGAAGAAAGG CGGTGGCGGT AAAAAAGGCG GTGGTGGCAA AAAGAATGGC
ACAGGTAAAA AGCCTGGTGC AGAAGGCGAT ACGGCTGCTG CCGGTGAGCA ACCTGCCAAA
AAGAAAGGCG GAAGTAGTAA AAAGAATGCC ACTGCAGATT ATGAAGGTGA AAAGAACCCG
GCTTATGCCG CGGTAACCGT AATGATCACA CCAGATTTAA GTGGAAAGCC GGCAAAACCT
GTTGGTTTTA ACCATCCCGG CGTATTGGTA AACCGTGCAC AGCTGGACGA AATTAAGAAA
AGGGTAGCTG CGGGAACCGA GCCACAGAAG TCGGCTTTTG AAGCCTTAAA AGAGAGTCCC
CTGGGTGCTT TGGGTTATAC CCCTCAGCCA AGGGATACCG TTTCCTGTGG TCCTTATTCT
AAACCCAACC TGGGCTGTAA AGATGAACAG AATGATTGTG CAGCAGCCTA TACACAGGCT
TTGTTATGGT TCATTACCGG CAATAAAACC TATGCTGAAA GTGCCATCAA AATTATGAAT
GCCTGGTCAA CAAATCTCGT AGGTGGGCAT AATTATGCCA ATGGCCCTGT ACAGGCAGCC
TGGTGCGGTT CGGTATGGCC AAGAGCTGCT GAAATTATCA GGTACACTTA TAAAGGTTGG
TCTGATGCGG ATGTGCTAAA ATTTCAAAAC ATGCTCAGGG TACAATATCT GCCATCTATT
ATTCATGGCG ATTGTGAGAA CGGCAATAAG GAACTGGCCA TGGCCGAAGC AATTATCAAT
ATTGGCGTGT TTAATGATGA CCGTACAGTG TTTGACCTGG GTTTAAAAAT GTGGAGGGGG
CGAACTCCTG CTTATATCTA TTTGAAAACG GATGGGCCAA AACCAATAGA CCCTCCGGGC
TGCGGACCGG CAATATGGAG CAATAAAGGC TATATGCCAG AACTTGTTGA TGGTTTGATG
CAGGAAACGG CAAGAGACTC CCATCACCCA TGGATGGCTT TTGCTTCGAT GGCAAATGCT
GCCGAAACGG CCAGGCAACA AGGGGTCGAC CTATACGCCG AAGAAGGTAA AAGGATGGTG
GCTGCTTTGG AATTCCAGGC ACAATACTTA AAACCGAATA AAGCTACACC ACCTGCAGAA
TTACAATTTG CATTGCAGCC AACCTGGGAG ATCGCCTACA ATCATTTTCA CAACAGAATG
GGTATGAATT TGCCAAATAT GAAACTGGTT CTACCTACCA ACAGGCCAAC CAAAGGCGAT
CACCATATGC TGTGGGAAAC CCTTACGCAT GCTGAAATGG GGAAAATAGG TGTTCCTGAA
CAGGTTAAAA AAGCAAAATA A
 
Protein sequence
MKRTTSILKM TMLLLLTAFT TYAQQIGKDN PLLGDWKTSD NKYYLQVFLT KEGACKANLI 
NKLNSPEAPL AVLSAELAGT ALNFNGDGWS GKAEKGKLNL VKGTEKLVFN HYVRTSPTMG
AAAPKDAIVL FNGKGLDEWG KLAPKEWLKP DGPVDGWKIL PGGILEVTPG TGSIVTKKVF
GDLKLHFECR LLGEVTNGGV YFMSRYEINI KDSYGQIGGS PSGAFGNISK PADFYPSVNM
AYPPMVWQTF DIEFRAPRFD ASGTKKIESA RISIVHNGVQ TYKDAPMEEV KGATGILGEA
GVGPIYLQEH GTAYQFRNIW VVDKTVKGTE NAHTPVVASA DGAVKKGGGG KKGGGGKKNG
TGKKPGAEGD TAAAGEQPAK KKGGSSKKNA TADYEGEKNP AYAAVTVMIT PDLSGKPAKP
VGFNHPGVLV NRAQLDEIKK RVAAGTEPQK SAFEALKESP LGALGYTPQP RDTVSCGPYS
KPNLGCKDEQ NDCAAAYTQA LLWFITGNKT YAESAIKIMN AWSTNLVGGH NYANGPVQAA
WCGSVWPRAA EIIRYTYKGW SDADVLKFQN MLRVQYLPSI IHGDCENGNK ELAMAEAIIN
IGVFNDDRTV FDLGLKMWRG RTPAYIYLKT DGPKPIDPPG CGPAIWSNKG YMPELVDGLM
QETARDSHHP WMAFASMANA AETARQQGVD LYAEEGKRMV AALEFQAQYL KPNKATPPAE
LQFALQPTWE IAYNHFHNRM GMNLPNMKLV LPTNRPTKGD HHMLWETLTH AEMGKIGVPE
QVKKAK