Gene Phep_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3110 
Symbol 
ID8254228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3714637 
End bp3717468 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content42% 
IMG OID644936764 
ProductTonB-dependent receptor 
Protein accessionYP_003093369 
Protein GI255532997 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CTATTCTTAA ATCATATCTC ATGTGGATAT TAGCGGTTAT CATTTCCGTC 
GCAGGGATAA GTGTTCCTGC ACATGCCCAA ACCGTGGGCA ACATCAGGGG TAAAGTGATC
GATGAGCAAG GAGGCGCATT GCCCGGAGCA TCTGTAAAGA TCAAGGGTAC AAATAAAGGT
GTTTCCAGTA GTACCACTGG TGATTTTCAA CTGAACGATC TGCAGCCAGG CAATTATGTA
GTTACCGTAT TTTATATGGG ATATAGCCCT GTTGAAACCA ATGTTGAACT GAAAGCCGGA
CAAACACTGG TACAAAACAT CCGGCTGGTA GCCAGCAGTG TTGCTTTAAA GGGTGTTACA
GTATCCGGGG TAATAGAAGG ACAGCAGAAA GCCCTGAACC AGCAAAGAAA TGCCGACAAC
ATCAAGCAGG TGATCTCGGC CGATCTGATG GGACGTTTCC CTGATCTCAA CGTAGCTGAA
TCGCTGCAAA GGCTTCCGGG TGTAACAATC GGCCGTGAGC AGGGTGAAGG CTCAACGGTA
CAGCTGCGTG GTACACCCGG TGGCTATACC AATATCAATA TCAATGGCGA ACAGATTATG
GGTACGCAGG AGCTTGGCCA GCGTAATGCA CAGCTGGACC TGATCCCGGC AAATGTGCTG
GCCTCTATGG AGGTTATCAA AACCCTGACC CCTGACCTGG ATGGGGATGC CATTGCGGGT
GCAATTAACT TAAAAACACC TACTGCCATC AGTTTAAAAC CACAGTTGTC GGTCGACTTA
GGTGGTGGTT ATAACAAGCT GCGCAACAAT TCAAACGGCA TTGGCAACAT CAGCTTTGGC
CGTCGTTATG GTGCAACGGA TGATATGCCC AACGGTAAAC TGGGGGTTAC GGTATCTGGT
AGTTATTACA AAACAAACAA TGGCTATGAT GAGATCAATG CACAGGCCTG GCAGAAAAAA
GATTTTATTG GTAACAAGGA CTCCATCTAT TTCCCTACAG ATATCCGTCT CGTTTATCTG
GAAAATGAAC GTACCCGTAT GGGGGCAACC ACCACCATCG ATTATAGCTT TAGCCCAACT
ACTTCCATTG TGGCGAATTT AACCTTTAAC AGTCTGGACA ACGATGCTAC CCGCTACCGC
AAACGTACAC GTATGCAAAC CGCCAATACC ACTTTTGCCA ATGGTGTTTA TACCACCACA
AGAGGCAGGG GCTACAATGA GGTGCTGGAC AGACAAATGC GAAACAGCAA TATCAATTTC
AGTTTGGAAG GTGAAACCAT GTTAAGTAAG GTTAAACTAG ATGGAGGAAT ATTTTTAACG
GCTTCTAATT TTGATCAGCG TGCTGCGGCT TTTAATTATA TTACCGGTAA TGTTCCTTTA
ACGATTACCG ATATATCTGG TGATTATATC CAGGCAACAG GTTCCACTGA TGCCAAAAAT
AACGCAGCTT TATATAATTA TAATACCATT GAAGCGAACG ATTTTAAAAC ACAAGGCCGC
AATGTTGTCG CACGCTTAAA CCTTACTTAT CCCTACAAAA TAGGAGATAA TGATGCTTAT
TTTAAAATGG GTGCAAAAGT AAAAAGGATG AGCAACAAGC GTTTCAGACC CTCAAGTACA
TTTGTTGCCA ACTATAGCGG GCCTGCTGGC GTAGGTAGCC TCAATAATTT TAAAGGAAAT
TCAGAACTCG ATGCAGATTT TCTGGATAAC AACATCAATT TCGGTTTGAA CGTGGATAAG
GATGCTACTA TAGACTTTTT TTACAACAAT CCTTCTTATT TTACACAGAA TGCAGACCAG
AAGAAAATAT CAATTGATGC TTACTTCTAT GATGCACAGG AGAATGTAGT AGCCGGATAC
CTGATGAATC GTATTCAATT TAAGCGGTTA ATGCTGTTAG GTGGTTTGAG GGTAGAAAGA
ACAGATGTAG ATTATGACGC CAAACTGGTT AATCAGGATA AAGATGGGTT TCTCACTTCT
TCTGTGCCTG TTAATTCTAA ATACAACTAT ACCAAATATT TACCAAACCT GCAGGGTAAA
TATGACCTGG CTAAAAATAC AGTAGCACGT GGCGCTGTAT CTTTTGGTTA TTCCAGGCCT
AATTTTAATG ATCTGGTACC CAGTCGTGTA GTAAGTATCC TGGCCCAAAC ATTAACTGAT
GGTAACCCGG ATCTGAAACC TGCATTTGCT ACCAATTACG ATCTGTCTAT TGAGCAATAT
TTAAGTAACC TGGGCATCCT TTCTGTTGGT GCATTTTATA AACACATTGA TAAGTTTCAG
TACAATAGCG TGATTAATTT AACCGGTACT GAGTTTCCTG ATGCAGCTGC ATATAAAGGT
TACCAATATT TCAAAGCTTA CAATGGAGAT CTGGCTAAAG TATTTGGTGT AGAAGTTAAC
GCACAAACCA ATCTTACCTT TCTTCCTGGC ATTCTGAAAG GTATTTCCCT GTATGCAAAC
TATACCTATA CCCATTCCAG AGCTGATGCT TTGGGCAGAA CTAAGCTTCG CTTACCAGGA
CAAGCAGATC ATACCGCCAA TGGTTCGGTT TCTTACACTT TAAAAGGTTT TACACTGCAG
GGAAATCTGA ACTATAATGG TGCTTATACT TCAACATTAG GTACAGATGA TGCTACTGAT
GTAATCAGGG CTGCGCGTTA CCAGTTAGAT GTGAATGCTT CTTACCGCAT CACCAAAAAG
CTGACCATTT ATGCTGAGGG GGTGAACTTA ACCAACCAGC CTCAGGTAGA ATACTTTGGC
GAGCGTTCAA GAATTTTTAC CAATACATTT TATGATTTTT CGGCCCGTGC CGGATTAAAA
TACCGTTATT AA
 
Protein sequence
MKKSILKSYL MWILAVIISV AGISVPAHAQ TVGNIRGKVI DEQGGALPGA SVKIKGTNKG 
VSSSTTGDFQ LNDLQPGNYV VTVFYMGYSP VETNVELKAG QTLVQNIRLV ASSVALKGVT
VSGVIEGQQK ALNQQRNADN IKQVISADLM GRFPDLNVAE SLQRLPGVTI GREQGEGSTV
QLRGTPGGYT NININGEQIM GTQELGQRNA QLDLIPANVL ASMEVIKTLT PDLDGDAIAG
AINLKTPTAI SLKPQLSVDL GGGYNKLRNN SNGIGNISFG RRYGATDDMP NGKLGVTVSG
SYYKTNNGYD EINAQAWQKK DFIGNKDSIY FPTDIRLVYL ENERTRMGAT TTIDYSFSPT
TSIVANLTFN SLDNDATRYR KRTRMQTANT TFANGVYTTT RGRGYNEVLD RQMRNSNINF
SLEGETMLSK VKLDGGIFLT ASNFDQRAAA FNYITGNVPL TITDISGDYI QATGSTDAKN
NAALYNYNTI EANDFKTQGR NVVARLNLTY PYKIGDNDAY FKMGAKVKRM SNKRFRPSST
FVANYSGPAG VGSLNNFKGN SELDADFLDN NINFGLNVDK DATIDFFYNN PSYFTQNADQ
KKISIDAYFY DAQENVVAGY LMNRIQFKRL MLLGGLRVER TDVDYDAKLV NQDKDGFLTS
SVPVNSKYNY TKYLPNLQGK YDLAKNTVAR GAVSFGYSRP NFNDLVPSRV VSILAQTLTD
GNPDLKPAFA TNYDLSIEQY LSNLGILSVG AFYKHIDKFQ YNSVINLTGT EFPDAAAYKG
YQYFKAYNGD LAKVFGVEVN AQTNLTFLPG ILKGISLYAN YTYTHSRADA LGRTKLRLPG
QADHTANGSV SYTLKGFTLQ GNLNYNGAYT STLGTDDATD VIRAARYQLD VNASYRITKK
LTIYAEGVNL TNQPQVEYFG ERSRIFTNTF YDFSARAGLK YRY