Gene Phep_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3966 
Symbol 
ID8255100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4773470 
End bp4776922 
Gene Length3453 bp 
Protein Length1150 aa 
Translation table11 
GC content42% 
IMG OID644937630 
ProductTonB-dependent receptor plug 
Protein accessionYP_003094219 
Protein GI255533847 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0876858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAGAA AAATAACCAC AAACCTGGGT ATAATAAATT CTTATATCCG TAAAATTTGG 
CTAATTATGC GATTAACCAC CGTAATACTA ATAGCTGCCC TGATGCAGGT AAGTGCTGCC
GGCCTTGCAC AGAAAATTAC TATTTCAAAA AAACAAGCAG ACTTACGTGC TGTACTAAAA
GTGCTAAGGA CTCAGAGTGG CTATAATTTT GTGTATGCAG ACAATGCGCT CTCTCAGGCT
AAGCCTGTTG ACATTTCTGT AAAAGCTGCC GACTTTAAAG ATGTTTTAGC GCAAATATTT
GCCAATCAGC CTTTAACTTA TAAGATTGAC AATAACGTTA TCATTGTTCA AATAAAGCCA
AGTGTGATGA TGCAACCTCT TAGAAAAAAT ATCAGCGTTT CAGGTGTGAT CAATGATGAA
GAGGGGAATC CATTGCAGGG CGCAGGCGTA AGAATGAAAG GTGGGGATAA AAAAGCAGTT
GCTGATAAGA ACGGACGCTA TAGTATAGAG GTTCCGGATA ATGCAATACT GGTTTTTACC
TACCTCGGTT TTGATGACCG GGAAATGCGG GTTGAAGGAA GACAAACCAT TAACGTTACA
CTAAAACCGC AGATCAGCAA ACTGGATGAG GTTTTAGTGG TAGGATATGG ATCCGTACGA
AAAGTTGATC TTACAGGTTC TGTTGCGCAG GTAAATATCC GTGATATGAC TAAAGCTCCG
GTGTCTTCTA TAGAACAGGC ATTGGCGGGC CGTGTGGCTG GTTTAAATGT TTCTGCCAGT
CAGGGACAGC CCGGAGAAGA AGGGATCAAT ATACGGATCA GAGGAGTGGG CTCCATCACA
CAGGATGCGT CCCCTTTGTA CGTCATTGAT GGCTTCGCAA CAGAGAACTT TGATCTTTCT
ATGTTGAACC CTGACGATGT GGAATCCATA AATGTGCTCA AAGATGCTTC CGGAACTGCG
ATCTACGGCT CCAGAGGTGC AAATGGTGTC ATTGTTGTAG AAACAAAAAA GGGTAAAGTG
GGTAAACCCA ATTTGAATTA TTCAGGATCC TACGGTTTTC AGGATGTGAC TAAAAGGGTT
GAGGTATTAA GTCCTTATGA GTTTGTTAAA TTGGAATTTG AGCGGGATTC GGTAACTGCC
AGGACGCTGT ATCTCCCGGA CGGCGTAACC TATGACTCCT ATAAAAATCT GGAAGGGATC
AACTGGCAGG ATCAGTTCTT CACCAAAGGT GTAACCAATA TTCACAATTT ATCACTGCGG
GGAGGGAATA AAGATACCAG GTATGCCATT TCAGGATCCT TGTTTGAGAC CGGCTCTGTG
GTAGTAAATA CGGGTTTGAA GAAACAACAG GGACGCATTA CTACAGATCA GAACGTCAGC
AAAAAATTGA AAGCTGGCCT GAGCACAAAT TATACACACA CGCAGACCTA TGGCCAGATC
GCTTCTTCTA TAAAAGATGG CTATGCATCG AGTGCGCTTC TTTATTCAGT ATGGGGATAC
AGGCCAGTTA ACGCCAGATA TTTGACAGGC GACATAGATC TGGAAGAAGA ACTTTTTGAT
GAAGGTGTAA CCAGCACTAC AGATTACCGC TTCAACCCGG TGCAGTCTGC AAAGAATGTT
TACAGAAATA AGGACAACAA TAACCTGGTT GCCAATGGGT ATTTTAATTA TGACATCACT
AATAATCTGG TCTTCAAATC TACCGGTGCC ATTAACCTGA ATACCATTCA ATTGGGTACA
TTCTATAACT CCAATACGAG CGAAGGTAGC CCCATAAGCC CTAGAAATAA CTATGGACAG
TGGGGGTCAA TGTCCTATAG CAGCCGCACA ACCTGGTCTA ACGAAAATAC CCTTACCTAT
AGCAAGGTAA ACAAGGTTCA TTCCTTAAAC GTATTGGGTG GTTTTAGTTT CCAAAAGGAA
AGCAGCAAAG GAAATAACTA TACAGCAATT AAAGTTCCTA ATGAGACTTT GGGCATAGAC
GGATTGGGGC AAGGTACGCC GCTGGCTGTA GGCTCTTTCG GCACTTATAA TACCCTGCAA
TCTTATTTCA GCAGAGTAAA TTATGGGTAT CAATCCAGAT ACCTGTTCAC AGCAACATTC
AGGGCCGACG GGTCGTCTAA ATTTCCGAAT AACAAATGGG GTTACTTTCC ATCAGGAGCT
TTTGCCTGGA AAATGAAGGA AGAGTCTTTT CTTAAAGGAA TTGAAACCAT TACAGAAGCC
AAATTACGTT TGAGTTACGG CCTTACCGGG AACAACAGAG TAGGGGATTT TTCTGCGCTT
TCACCAATCA ATGTAGACAA CTCTACTGGT TATTCTTTTG GGAATGAGGT GCCAACACCA
GCAGCCATCC CTACGATTGG CAATCCCAAA TTAAGATGGG AAACTACTGC GCAGCTCAAT
TTAGGATACG ATTTAGCTTT ATTGAAGAAC AGGATTGAGC TGGTGGTAGA TCTATATCGA
AAGAAAACAG ACAATTTGTT GCTGCTGGCC AATATGTCGC CAAGCACAGG GTATGCCCGT
GCCTACAAAA ACGTAGGTAA ATTAAAGAAT GAGGGTTTGG AATTTACCTT GAATACGGTG
AATATCAATA AGCGGGATTT TACCTGGCGG AGTAATTTTA ACATCAGCTT CAACCGGAAT
ACCATTTTAC AACTAGCTGA TGAAGAGGAA CGCTTGTTGA ATAGTATCAC ACCCAGATGG
CAATCCGGAT ATGCTGACCC GGTATTGTAT AGTGCAGCAA TAGGTCATTC AGTAGGTAAT
TTCGTTGGGT ACATCTTTGA TGGGATTTAT CAGTACGATG ATTTTTATCA ATTGCCCAAT
GGCGGTTACC GCCTTAAAAA TGCTGTGCCG ACAAATGGGA TGGCCAGAGA GGTCATCAAA
CCAGGATATA TCCGCTATAA AGACCTGAAC AATGACGGAG TCATCACAGC GGCCGATCAA
ACCATTATCG GCAGGGGTTT ACCTATTCAT TCTGGTGGTT TTTCCAACAA TTTTACTTAT
AAAAACTTAA GTCTGAACGT CTTCTTGCAA TGGTCTTATG GCAATGATGT ATACAATGCA
AACAAACTGA TTTTTGAGGC AAAAGCATAC AGCATGCTGA ATCAATATGC GGGTTATGCG
AATCGGTGGT CGCCTGCAAA TCCGAGTACG ACCATACCGG TTGTGGGTGG GGTTCCTGAG
GGCTATTATT CGACCAGGGA ATTGGAAGAT GCTTCCTATT TAAGGTTGAA AACTGTGGAA
TTGGCTTACA GCGTACCAGT GAATTACCTG AAAAAATTAG GACTTAAAGA GGTCATCCTG
ACAGCTTCTG CACAGAATCT GCTTACATGG ACCAACTATT CAGGAATGGA TCCGGAAGTG
TCCACGCGCA ATTCCATCTT GACTCCAGGG TTTGACTATT CTGCATTCCC TATTGCGAAA
ACATTAGTTT TTGGACTAAA GGCTTCATTA TAA
 
Protein sequence
MYRKITTNLG IINSYIRKIW LIMRLTTVIL IAALMQVSAA GLAQKITISK KQADLRAVLK 
VLRTQSGYNF VYADNALSQA KPVDISVKAA DFKDVLAQIF ANQPLTYKID NNVIIVQIKP
SVMMQPLRKN ISVSGVINDE EGNPLQGAGV RMKGGDKKAV ADKNGRYSIE VPDNAILVFT
YLGFDDREMR VEGRQTINVT LKPQISKLDE VLVVGYGSVR KVDLTGSVAQ VNIRDMTKAP
VSSIEQALAG RVAGLNVSAS QGQPGEEGIN IRIRGVGSIT QDASPLYVID GFATENFDLS
MLNPDDVESI NVLKDASGTA IYGSRGANGV IVVETKKGKV GKPNLNYSGS YGFQDVTKRV
EVLSPYEFVK LEFERDSVTA RTLYLPDGVT YDSYKNLEGI NWQDQFFTKG VTNIHNLSLR
GGNKDTRYAI SGSLFETGSV VVNTGLKKQQ GRITTDQNVS KKLKAGLSTN YTHTQTYGQI
ASSIKDGYAS SALLYSVWGY RPVNARYLTG DIDLEEELFD EGVTSTTDYR FNPVQSAKNV
YRNKDNNNLV ANGYFNYDIT NNLVFKSTGA INLNTIQLGT FYNSNTSEGS PISPRNNYGQ
WGSMSYSSRT TWSNENTLTY SKVNKVHSLN VLGGFSFQKE SSKGNNYTAI KVPNETLGID
GLGQGTPLAV GSFGTYNTLQ SYFSRVNYGY QSRYLFTATF RADGSSKFPN NKWGYFPSGA
FAWKMKEESF LKGIETITEA KLRLSYGLTG NNRVGDFSAL SPINVDNSTG YSFGNEVPTP
AAIPTIGNPK LRWETTAQLN LGYDLALLKN RIELVVDLYR KKTDNLLLLA NMSPSTGYAR
AYKNVGKLKN EGLEFTLNTV NINKRDFTWR SNFNISFNRN TILQLADEEE RLLNSITPRW
QSGYADPVLY SAAIGHSVGN FVGYIFDGIY QYDDFYQLPN GGYRLKNAVP TNGMAREVIK
PGYIRYKDLN NDGVITAADQ TIIGRGLPIH SGGFSNNFTY KNLSLNVFLQ WSYGNDVYNA
NKLIFEAKAY SMLNQYAGYA NRWSPANPST TIPVVGGVPE GYYSTRELED ASYLRLKTVE
LAYSVPVNYL KKLGLKEVIL TASAQNLLTW TNYSGMDPEV STRNSILTPG FDYSAFPIAK
TLVFGLKASL