Gene Phep_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0343 
Symbol 
ID8251428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp401264 
End bp404554 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content41% 
IMG OID644933991 
ProductTonB-dependent receptor plug 
Protein accessionYP_003090629 
Protein GI255530257 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.603691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGGG GTCAGCAGTA TAAATGGCTG GCCGGCCAGA GCTATTTAAC TATCCGGATT 
TTTATACTAT ATTTCGGGTA CGTACACTTA TTAATAAAAC TTAAAATTAA ACCATATAAC
ATGCTGAAAA TTTACAACAA AACCAAACGA CCATGGCGAA TTCCCTGCTG GGGAATAGCC
ATGCTGATTT TCCTGACCTC CTTTATGCCG GGGCGGGTTT TATCACAAGA TTTAAAATTA
ATTACCGGTA CAGTTACTGA AGCTAACGGA GCACCACTTC CGGGTGTAAG TGTTAAAGTA
AAGGGTACAC AAAAGGCGAC TTCAACAAGT AATGCCGGAA AATACAACAT CCAGGCTAAA
AGTACTGATG TTTTAGTTTT TTCTTTTGTT GGCTCAGAAA CACAGGAAGT CACTGTTGGT
GCCAGAACAA CAATCAATGT TAAACTGGCT GATGATGCGA AATCATTAGA CGAGACCGTG
ATCATTGGTT ATCAGACGGT AAAGAAAACT GATTTAACCG GTGCAATCGG CCAGGTTAAA
ATGTCTGACC TTAATAAAGC TCCTGTGTCC TCATTTGCTG AGGCGCTTGC CGGACGTGTA
TCTGGCGTAC AAGTGTCATC AGGAGATGGA CAACCGGGTA GCAACATGAG CATTATGATC
AGGGGGGCGG CATCAATCAA TAACAGTACG CAACCTTTAT ACGTTATCGA TGGTTTCGCT
ACGGAGGCGG TTGAAAGTAC CAGCTTGAAC CCTGATGATA TTGAATCCAT AACCATATTG
AAAGATGCAG CTGCTACTTC TGTTTATGGA TCAAGAGCAA CAAATGGTGT TGTAGTGATT
CAAACTAAAA GTGGTAAAGT TGGGAAACCA GTGATTTCCT ATAACAGTTC ATATGGTGCC
CAGATCGTAG CTAAAGAAAT TGATCTGATG AGCCCTTATG AATTTGTAAA ATACCAGTTT
GAACGTAACC CAACAGATGC AGCGGCAAAC TATCTTACCA ATGGGAAAAC ATTGGAGTCC
TATAAAAACG ACAAGGGTAT TTACTGGCCC GGACAAATTC TTCGCACCGG TTCTGTGTTT
ATAAATAATA TTGCGGTAAG AGGCGGTAGC GAAAATACCA AGTATGCCAT ATCGGGTGGG
GGCTTTGATC AAAAGGGAGT GCTGATTAAC ACCGGTTACA AAAGATATCA AGGAAGGGCA
AAACTTGATC AGACCATAAG TAATAAGCTT AAAGTCGGTA TCTCTGCAGA TTATATGGAT
GTTTCTGCTT TTGGGGTGCA GGCTTCCAGT ATTCCTACCG GCGGTGGTGC TTCCAGCGCA
ATCTGGTTTA GGACCTGGGT TTACAGACCC ACCGGCGGAA GTGCAACCAC CTACGACATT
TTGAATGAAG ACGGAGATCC CGAGAGCATT AATAGCAGTG ACATCAGGTT AAACCCAAGG
GTAACCGCAG AAAATGAATA CTCGTATAAT ACGTATGTGA ACTTTAACGC GAACGCCTAT
CTTACCTATA ATGTAACAAA AGATCTGATT TTAAAAGTTA CAGGGGTAAA AAATGCACTT
AGAAGAGGTC AGGATCGCTT TTATAATTCA AAAACGCCAC AGGCAAGCCC ATTAAATCCA
CTGGCTACTC AGGGTATATT TGGATCCGTG CTTCAATCTT TTGCCGACAC CTGGTCGAAC
GAAAACACCC TTACTTATAA CAAACTTTTT AATGATACCC ATTCTTTAAC GGTAATAGGA
GGTAATAGTC AAACATCTTA TAAAAGTAAG TCTAATGGAT TTACCTCTAC CTTTTTACCA
GAAGAAAGTC AGGGTATGGC CGGCCTGAAT GAAGGAACAG TTACCGCTCC TGTGGCAACA
GCCAGCAGCA ATACACTTTC TTCTTTTTTC GGGATCGTAG ATTATAATTA TAAATCTAAA
TATTACCTCA AAGCAGGATT AAGGGCCGAT GGATCTTCAA AGTTTTCACA ACGCTGGGGA
TATTTTCCAT CGGGTGCAAT TGCCTGGAAT ATGCATAAAG AAGACTTTAT GAAAGACCTG
ACTTTTATTT CTAATTCTAA ATTGAGGTTA AGTTATGGCG TAACAGGAAA CAACAGAATT
GGTGATTTTG ACTGGTATGC CAAGCTGGAA CTGGATGCCG GAGCTGGTTA TTCCTACAAT
AACTCACCCA ATATAGGGGC TTACATATCA GGCGTTGAAA ACAGGAATTT GAAATGGGAG
AAAACTGAAA GCTCTGATAT TGGCTATGAA CTGGGTTTAT TTGACAATAA GATTGAACTT
ACGGTAGATG CCTATCGGAG AACTACAAAA GATTTGCTGA TCACTAATGC CCCTATTGCG
GCACATACAG GATTTGCAAC GGCCACTAAA AACATAGGTT CATTGCGTAA CCAGGGACTG
GAATTTGGTT TAAATACGGT TAATATAAAA ACCCGGTCAT TCACATGGGA AAGTAGTTTT
AACATCACTT TTAATAAGAA TAAAATTATT GCATTAAACA GGGATCAGGG ATTTATACAA
ACCACTCCTT CATTTGAAAC TGCTTTTACC GATATGTATC TTTCAGAAGT TGGGCAGCCC
TTGGGCATGA TGTATGGTTA TGTATGGGAC GGTAATTACC AATATGCCGA TTTTGACAAT
CCTTCACCGG GTACCTATAT CCTTAAGCCA TCAGTACCTA CAAATGGAGC TTCAGTGATC
CAACCTGGAG ACATTAAATA TAAAGATTTA AATGGGGATG GAACAGTAAA TTCATTGGAC
CGGACAATTA TTGGCCGTGG CCAACCCATT CATTTTGGCG GGTTCTCTAA TAACTTCGGT
TATAAGAATT TTAGCCTGAA TGTTTTTCTA CAATGGTCTT ATGGAAATGA CATTTACAAT
GCCAACCGGC TTATATTGGA AGGTAATGCA AACGGAAGAA CGGATTTGAA CCAGTTTGCA
AGTTATATTG ACCGTTGGTC CCCAACAAAC CAAAACAGTA AAAACTATCG TGCCGGAGGC
CATGGGCAGG TGGGTTACCA TTCTTCAAGA GTTGTGGAAG ATGGCTCATA CCTGCGGTTA
AAAACGGTTT CACTGGCTTA TGCCTTACCA GCCAAATACA TCAAAAGATT ATTTCTGAGT
TCTTTAAGTC TGAATGTTTC AGCACAGAAT TTATTTGTTT TGACCAAATA TACCGGTATA
GATCCTGAAG TTTCAACACG CGGGCCATTC TCCGTATTGT CGCCAGGATT TGATTATTCT
CCATATCCGC AGGCCAGAAC AATTGTGGTA GGACTAAACG CAGCATTTTA A
 
Protein sequence
MDRGQQYKWL AGQSYLTIRI FILYFGYVHL LIKLKIKPYN MLKIYNKTKR PWRIPCWGIA 
MLIFLTSFMP GRVLSQDLKL ITGTVTEANG APLPGVSVKV KGTQKATSTS NAGKYNIQAK
STDVLVFSFV GSETQEVTVG ARTTINVKLA DDAKSLDETV IIGYQTVKKT DLTGAIGQVK
MSDLNKAPVS SFAEALAGRV SGVQVSSGDG QPGSNMSIMI RGAASINNST QPLYVIDGFA
TEAVESTSLN PDDIESITIL KDAAATSVYG SRATNGVVVI QTKSGKVGKP VISYNSSYGA
QIVAKEIDLM SPYEFVKYQF ERNPTDAAAN YLTNGKTLES YKNDKGIYWP GQILRTGSVF
INNIAVRGGS ENTKYAISGG GFDQKGVLIN TGYKRYQGRA KLDQTISNKL KVGISADYMD
VSAFGVQASS IPTGGGASSA IWFRTWVYRP TGGSATTYDI LNEDGDPESI NSSDIRLNPR
VTAENEYSYN TYVNFNANAY LTYNVTKDLI LKVTGVKNAL RRGQDRFYNS KTPQASPLNP
LATQGIFGSV LQSFADTWSN ENTLTYNKLF NDTHSLTVIG GNSQTSYKSK SNGFTSTFLP
EESQGMAGLN EGTVTAPVAT ASSNTLSSFF GIVDYNYKSK YYLKAGLRAD GSSKFSQRWG
YFPSGAIAWN MHKEDFMKDL TFISNSKLRL SYGVTGNNRI GDFDWYAKLE LDAGAGYSYN
NSPNIGAYIS GVENRNLKWE KTESSDIGYE LGLFDNKIEL TVDAYRRTTK DLLITNAPIA
AHTGFATATK NIGSLRNQGL EFGLNTVNIK TRSFTWESSF NITFNKNKII ALNRDQGFIQ
TTPSFETAFT DMYLSEVGQP LGMMYGYVWD GNYQYADFDN PSPGTYILKP SVPTNGASVI
QPGDIKYKDL NGDGTVNSLD RTIIGRGQPI HFGGFSNNFG YKNFSLNVFL QWSYGNDIYN
ANRLILEGNA NGRTDLNQFA SYIDRWSPTN QNSKNYRAGG HGQVGYHSSR VVEDGSYLRL
KTVSLAYALP AKYIKRLFLS SLSLNVSAQN LFVLTKYTGI DPEVSTRGPF SVLSPGFDYS
PYPQARTIVV GLNAAF