Gene Phep_2652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2652 
Symbol 
ID8253759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3102884 
End bp3106006 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content41% 
IMG OID644936299 
ProductTonB-dependent receptor plug 
Protein accessionYP_003092915 
Protein GI255532543 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA CCAAATTATT ATTGTTCTTG ATTTTCACTG CAGCATTGCC TGCCCTTGCT 
CAAAAAACTA TCAAGGGTAA TGTTAAAAAT TCCAGCGGAA TGCCAATGGC CGGAGTTAAT
GTTGCGGTGA AGGGCAGCAC AAAAGGAACA GCAACAAATG CGAACGGAAA TTTTACCATC
GAACAAGTGA AACCATCTGA CGTACTTGTT TTCTCCTACA TGGGATTTCA AACTATTGAA
AAAATGGTAC TGGAGCAGCA AACCATTAAT GTTATCCTTA AAGCGGACGA CCTCAGACTA
TCTGAGGTAG TAGTAATTGG CTATGATGTG GTCAGAAAAA GAGACCTAAC CGGTTCAGTT
GCCGTCATAG ATCCGAAAGA GCTTGCAGCA ACTGCCACAG CAAACTTTGA CCAGGCCCTG
GCCGGAAGAG TTGCTGGCGT ACAGGTTACT TCTAAAGACG GAACACCTGG CAGTCCTCTT
AACATCATCA TCAGAGGAGG GAACTCCATA ACAGGCGACA ACTCTCCGCT GTATGTGGTA
GATGGCGTTC CGCTTGAAGA TTTTGATCCA TCCAGTATCA ATACCCGTGA CATTAAGAGT
TTTGACATTT TGAAAGACGC CTCTGCAACA GCTATTTATG GGTCCAGAGG AGCCAATGGA
GTGGTGGTTA TCACTACCAT AGGTGGACGT AATGACGGAA AAACAGACAT CAATGTTACC
TCTTCAGCCT GGGTTCAGTT CATTCCCAAC CGATTGGAGG TATTAAATCC TTATGAATAT
GTAAAATATC AGCAAAAAAT AGCCTACGCA AATGACAGTT ATGCCCCCGG ACAAAATGTA
GCGATGTTTA CCGCCAACTG GATAGACCCT GAACTGTATA GAAATGAGAA AGGAACGAAC
TGGCAGGACG AAATTTTTCA GACCGCACAG ACCAACAACC ATACAATTTC GTTGAGAGCA
GGCAATAAAA ATACAACTTT GTTGTATAGC GGAAATTATT TAAATCAGGA AGGAACTTTG
ATTACTACCG ATTTTAAAAA GATCAACAAC AGATTGAAAT TTACCCATAA AGTAATTAAT
AATTTTGAAG TTAACGGACA GGTTGAGTAT AGTTACATCA ACTACAATGG AATGGAAGTT
GCCGGCAATA CCAGGAACAG TGTGATCAGA GATGCCATTT CTTTCAGACC TGTAAGTCCT
GTAAATTGGA ATGCAAATGA AGAAAGTGCC ATCGCAGATC AGGACCCATA CTTGTATGAT
CCCGTAAAAA CATTAAAAAA TACAGAACGC AAGCGTGTGG ATGATGTCCT TTCCGGAACT
TTAGGATTTA ACTATAACTT CTTAAAGAAA TTTGACCTGA GTGTTAGTGG AAACTATAGA
ACATCGATTA CAGAAAATGA CATCTTCTAT AAAAAAGACA CACAGGAAGC CACCAGGACA
AACAGGGGAA TAAATGGCAC CATTACAGAC AAAAGATTCA ATACCCTGTC TACTTCCAAC
ACTTTAAGGT TTAAAGATCA AAAGGATAAA CATGCTTACG GTGCACTTCT AGGGTTTGAA
GCGCAGTACA GAGCTTATGA ATTTTCGCAA CTGAGCAATA CAAACTTGCC AACAGACCAA
TTTGGCATAC ACAACCTGGG CATTGCAACC ACTGCTACCA TCGCACAGAC CTTATATTCT
AAAAATGCCT TGTTGTCATT CTTTGGAAGG ATAAATTATA CGTACAACGA TCGTTACTTG
GCAACAGTTA ATTTTAGAAC AGATGGATCA TCAAAGTTCA GAAAAGAAAA CAGGTGGGGA
TACTTCCCCT CCTTCTCCCT GGCCTGGAAA CTGTCGGAAG AGGATTTCCT TAAATCCAGC
GAGCTGATTA CGGATCTGAA GCTGAGGGGC GGCTGGGGCG TAACCGGAAA TAACCGTATC
GGCGACTTTG ATGCTTATAA CTTGTTTTCT GTAAACTCAT CAAGCGGATA TATCTTGGGT
GTAGACCAGA ACTTTTCACC AGGTGCTTAC CAAAGTAATA TGGCCGTACC GGATCTGCGC
TGGGAAACCA CTGCCCAAAC CAATATTGGG CTGGATTTAG AACTGTCTAA AAGATTCAGC
ATCGCTGCTG ATTATTACAA TAAAAACACC CGGGATTTAT TATTAAATGC TGACATGGCC
CTAAGTACCG GTTTCGATAA GGTGCAACAA AACGTTGGCG CAGTTTCTAA CAGAGGATTC
GAGTTTACTT TTAACTCGCA AAATTTCAGG AACAAGAACT TCAGCTGGAC TACCAATTTC
AACATCGCGT TTAACAAAAC CAAAACCCTC CGGTTAAATA GTGGTCAGAA TGAAATCCTG
ACCGATCCAC AATGGGATCT GCAGTTTATG CAATCCGAAT ATCAATACGT AACAAGAGTG
GGGCAGCCTG TAGGTATGAT GTATGGTCTT GAATTTGACG GCATTTACCA GGTGGATGAT
TTTGTACTGA CAAATGGTTC TTATCAGCTG AAAGACGGCC AACCTACTTA CAGAACGGTG
ATGAGACCCG GCATGGTAAA ATTTAAGGAC TTGAATAACG ATGGTGTAAT CAATCAGAGC
GACAGAAAAA TTATTGGAAA CCCTTACCCA AAACATACAG GCGGCTTATT CAATAATTTC
AGGTACAAAT CTTTTGATTT TCAGTTCCTA CTTCAATGGT CGTACGATTT CGACATCCTG
AACGGCAATG CATCAGAATT CGGCAGCATT TATCAGACCA ACAGAAATGG ACTAAAGTCA
CTTAACAAAA TCTGGACGCC CACTAACCCG GAAACAAACA TCGGTGGCAT GAGATACGAC
GGTGTAAATT TGCTGACTCC TTTTGGTTAC AAATTAGATT CACGTCATAT AGAAGATGGT
TCTTATTTAA AACTTAAAAC GGCAGCACTG GGCTATAACT TCTCCAGTCA GTTGTTAAAG
AAATTTAGTA TTAAGAAGTG CAGGCTCTCG CTTTCTGCTC AGAATCTTTA TACCTGGACC
AAGTATACCG GTTATGATCC GGATGTATCT GTAGGAAGAT ACGGGGCCCT AACCCCCGGA
TTGGATTATT CCGCATACCC GCAAAGTGTG ACCATATCAG GCGGAATAGA CTTTACATTT
TAG
 
Protein sequence
MKLTKLLLFL IFTAALPALA QKTIKGNVKN SSGMPMAGVN VAVKGSTKGT ATNANGNFTI 
EQVKPSDVLV FSYMGFQTIE KMVLEQQTIN VILKADDLRL SEVVVIGYDV VRKRDLTGSV
AVIDPKELAA TATANFDQAL AGRVAGVQVT SKDGTPGSPL NIIIRGGNSI TGDNSPLYVV
DGVPLEDFDP SSINTRDIKS FDILKDASAT AIYGSRGANG VVVITTIGGR NDGKTDINVT
SSAWVQFIPN RLEVLNPYEY VKYQQKIAYA NDSYAPGQNV AMFTANWIDP ELYRNEKGTN
WQDEIFQTAQ TNNHTISLRA GNKNTTLLYS GNYLNQEGTL ITTDFKKINN RLKFTHKVIN
NFEVNGQVEY SYINYNGMEV AGNTRNSVIR DAISFRPVSP VNWNANEESA IADQDPYLYD
PVKTLKNTER KRVDDVLSGT LGFNYNFLKK FDLSVSGNYR TSITENDIFY KKDTQEATRT
NRGINGTITD KRFNTLSTSN TLRFKDQKDK HAYGALLGFE AQYRAYEFSQ LSNTNLPTDQ
FGIHNLGIAT TATIAQTLYS KNALLSFFGR INYTYNDRYL ATVNFRTDGS SKFRKENRWG
YFPSFSLAWK LSEEDFLKSS ELITDLKLRG GWGVTGNNRI GDFDAYNLFS VNSSSGYILG
VDQNFSPGAY QSNMAVPDLR WETTAQTNIG LDLELSKRFS IAADYYNKNT RDLLLNADMA
LSTGFDKVQQ NVGAVSNRGF EFTFNSQNFR NKNFSWTTNF NIAFNKTKTL RLNSGQNEIL
TDPQWDLQFM QSEYQYVTRV GQPVGMMYGL EFDGIYQVDD FVLTNGSYQL KDGQPTYRTV
MRPGMVKFKD LNNDGVINQS DRKIIGNPYP KHTGGLFNNF RYKSFDFQFL LQWSYDFDIL
NGNASEFGSI YQTNRNGLKS LNKIWTPTNP ETNIGGMRYD GVNLLTPFGY KLDSRHIEDG
SYLKLKTAAL GYNFSSQLLK KFSIKKCRLS LSAQNLYTWT KYTGYDPDVS VGRYGALTPG
LDYSAYPQSV TISGGIDFTF