Gene Phep_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0566 
Symbol 
ID8251653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp675804 
End bp678935 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content45% 
IMG OID644934214 
ProductTonB-dependent receptor plug 
Protein accessionYP_003090850 
Protein GI255530478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00250412 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGA CTTTTAGAAT TTTGTGCTTG CTGTCGCTGT GTCTGTCCCT GACGACATTA 
CATGCACAGG AAACCAAGCA GTTACTCACC ATTGCCGGTA TTGTTACCGA TGAAAAGGGA
GCAGCTATCC CAGCGGTATC TGTTTACATT AAAGATCGGC CCTCAGCCGG TACCGCTACC
AATAATGAAG GAAAATTCAG CATTCAGGTC TCGTATGGCG ATAAAGTCGC ATTTAGCTAT
ATCGGCTTTA AAAATGCAGA ACATGTTGCT GTCGAAACAA AAAAGAATTT AACCATAGTA
CTTAAAGACA ATAACGAGGC CCTGGAGGAA GTAATGGTAG TTGGACTGGG AAATGTACAA
CGGAAGATAA GTTCGGTTGG AGCCATCACA ACGGTAGATG TTAAGGACCT TCAATCCCCT
GCACCATCGA TTGCAAACCT TCTTGGCGGA AGGGCAGCCG GTGTAATCTC CAGATTAGGC
AGTGGTGAAC CAGGAAAGAA CATTTCTGAG TTTTGGGTAC GTGGTATCGG TACATTTGGA
GCCAACAGTA GTGCATTGGT GCTCATCGAT GGTTTGGAGG GCGACCTGAA TACCATTGAT
CCGGCGGATG TGGAAAGTTT TTCGATCCTT AAGGATGCAT CAGCTACCGC AGTGTATGGC
GTTAGGGGAG CCAATGGGGT AGTATTGGTT ACCACTAAGC GCGGTCAGGT AGACCGCATG
CAGCTCACCG CACGCGCCAA TACCACTTTA TCCAGTCTCA ACCGCTTACC AGAATATCTG
CGTGCGTATG ATTATGCACA GCTGGCGAAT GAAGCAAGTT TGGTCCGTGG GCAGAGTCCT
TTGTACAATC AAACGGAATT AGGTATTATC CGCGATGGAC TGGATCCGGA TATGTATCCA
GATGTAGATT GGCAGGATGA AATTCTCAAT AAGACTTCCT GGCGCCAGAG CTATTACATG
AGTGGCCGTG GTGGGTCTGA GGTTGCACGC TATTTCCTTA GCCTTGGCGG CAAAAGTGAA
AGTGCAGCTT ACAAAGTAGA CAAAAATAGC CAATACAGTT CTAATGTGGG TTTTAATACC
TATAATTACC GGATCAATCT GGATGTTAAC CTGACCAAGA CCACTAAAAT TTTCCTGGGT
TCTGATGGGT TTCTATCTAA ACTGGCTCAG CCTGGGTTAG CAAATACAGA TTACATATGG
GGAGCACAAT CCGTACTTAC CCCGGTGACC ATCCCTACAC GATATTCCAA TGGCCTGCTT
CCAGGCCTGG GAGGTGGAGA GCAGTCATCG CCTTATGTCA TGATCAACCG CACAGGTAAG
GCTTCAGATG AGGTGTATAA AGGCAAAACC ACATTAGCCC TAAACCAAGA CCTTTCCAGT
GTATTGGAGG GACTTAAATT CCGGGTTCAG GGAGCTTACG ATCTATACAG TTACTTTAGC
GAGCGTAGGC GTGTACAGCC AGCGCTTTAT AATGCCTTAG GACGGGCCTA CGATGGCTCG
CTCATTACTT TGCAAACCGT ACAAGAACAA AAAGCGAGTT ACACCAGATC TACCCGTCAA
TACCGTAAAT ACCACTTTGA ATCTGTTATC AATTATGATA AAGTATTCAA CAGTGACCAC
AGGACCTCCG CACTGGTCTA TTACTACATT AGCGATGCCA AAGATACCGA AGACGCGACC
AGTAACCTGG ACGCCATCCC CTTACGTTAT CAGGGTGTAT CGAGCAGGTT GACCTATGGT
TTCCGGGACA CTTACCTTTT AGATGTCAAT TTTGGCTATA CCGGTTCTGA GAACTTCCAG
CCCGGCAGGC AATATGGTTT TTTCCCGTCA ATAGCCCTGG GATGGGTACC TACTGGCTAC
AAATTTGTAA AAGAAGCCGC ACCATGGCTT AATTATTTTA AGATCAGGGC TTCCTACGGT
ACGGTGGGTA ACGACCGGAT TTCAACTATC CGTTTCCCTT ACCTGACCAA GGCTAACCAG
GGTAATGGCA CGGTATGGGG CGTGCCCGAT ATCGAGACAA TTAATGAGAC AAGGATTGGT
GCAGATAACC TGGCCTGGGA AAAAGCGATT AAATCCAATT TAGGTATTGA AGGTAAGCTC
TTTGATAGTA AAGTAGATTT TGTGGTAGAC TTTTTCAAAG ATCAGCGTAA CGGTATTTTT
CAACAAAGGG TACAGGTTCC TGATTACGTG GGCGTCATTT CCAATCCTTT CGCCAATGTG
GGCCGCATGA AAAGTTCCGG AATAGATGGA AACATCAGCT ATACTGGCAA TCTTTCCAAA
GACATAGGTT TTACCTTAAG AGGTAACTTC ACCTATTCCA AAAACCTAGT ACAGAACTGG
GAACAAGCTT ACCTGGAATA TCCTTACCTG GAGTACAATG GTTTCCCTTA TAATTCTATA
AGAGGCTACC AGTCGCTGGG ACTTTTTAAA GACGAAGACG ACATCAAATA TAGCCCTAAA
CAAACGTTTG GAGATGTACT GCCTGGCGAC ATCAAATACA AGGATGTAAA CGGTGATGGC
ATCATCGACA AACTGGACAT GGTTCCTTTA ACCCACAGCA ACTATCCTTT GATGATGTTT
GGCATGGGCG GCGAGTTCCG CTATAAAAAG CTGACATTAG GTGTATTGTT TAAAGGAACA
GGAAAAACAT CCTTTTTCTA TGTAGGACAG CCAACCACAA TAAACAATGT AACGGTAACT
AACGGAATGG GTTATATGCC ATTCTTTAAT GGTAACCTGG GCAACGTACT CAGCCTGGCT
GCCGATCCTA AAAACCGCTG GATCCCGAGG GATTATGCCC TGGCAAACGG AATAGACCCT
GCATTGGCCG AAAACCCAAA TGCCCGCTAC CCACGCCTGC AATACGGGAA TAACACCAAC
AACAGCCAGT TGTCGTCCTT CTGGCAGGGA GATGCGCGCT ATATACGCCT GGAGGAAATT
ACGTTGAATT ATAACATTAA TCCATCGATC CTGAAACGCC TTGGAATTAA GTCTATGGAC
CTGCAATTTG TGGGCAATAA CTTATACATA TGGGATAATG TTAAGCTTTA TGACCCGGAA
CAAGCCGCCT GGAACGGGCG TAAATACCCG ATACCAACAA CCTATTCCTT TCAAGTGTAC
GTCAATTTTT AA
 
Protein sequence
MTKTFRILCL LSLCLSLTTL HAQETKQLLT IAGIVTDEKG AAIPAVSVYI KDRPSAGTAT 
NNEGKFSIQV SYGDKVAFSY IGFKNAEHVA VETKKNLTIV LKDNNEALEE VMVVGLGNVQ
RKISSVGAIT TVDVKDLQSP APSIANLLGG RAAGVISRLG SGEPGKNISE FWVRGIGTFG
ANSSALVLID GLEGDLNTID PADVESFSIL KDASATAVYG VRGANGVVLV TTKRGQVDRM
QLTARANTTL SSLNRLPEYL RAYDYAQLAN EASLVRGQSP LYNQTELGII RDGLDPDMYP
DVDWQDEILN KTSWRQSYYM SGRGGSEVAR YFLSLGGKSE SAAYKVDKNS QYSSNVGFNT
YNYRINLDVN LTKTTKIFLG SDGFLSKLAQ PGLANTDYIW GAQSVLTPVT IPTRYSNGLL
PGLGGGEQSS PYVMINRTGK ASDEVYKGKT TLALNQDLSS VLEGLKFRVQ GAYDLYSYFS
ERRRVQPALY NALGRAYDGS LITLQTVQEQ KASYTRSTRQ YRKYHFESVI NYDKVFNSDH
RTSALVYYYI SDAKDTEDAT SNLDAIPLRY QGVSSRLTYG FRDTYLLDVN FGYTGSENFQ
PGRQYGFFPS IALGWVPTGY KFVKEAAPWL NYFKIRASYG TVGNDRISTI RFPYLTKANQ
GNGTVWGVPD IETINETRIG ADNLAWEKAI KSNLGIEGKL FDSKVDFVVD FFKDQRNGIF
QQRVQVPDYV GVISNPFANV GRMKSSGIDG NISYTGNLSK DIGFTLRGNF TYSKNLVQNW
EQAYLEYPYL EYNGFPYNSI RGYQSLGLFK DEDDIKYSPK QTFGDVLPGD IKYKDVNGDG
IIDKLDMVPL THSNYPLMMF GMGGEFRYKK LTLGVLFKGT GKTSFFYVGQ PTTINNVTVT
NGMGYMPFFN GNLGNVLSLA ADPKNRWIPR DYALANGIDP ALAENPNARY PRLQYGNNTN
NSQLSSFWQG DARYIRLEEI TLNYNINPSI LKRLGIKSMD LQFVGNNLYI WDNVKLYDPE
QAAWNGRKYP IPTTYSFQVY VNF