Gene Phep_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1126 
Symbol 
ID8252220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1316160 
End bp1319330 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content44% 
IMG OID644934777 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091406 
Protein GI255531034 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00950151 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGAGAT TTTTAATACT ATTAGTTTTT AGCTTGGCTT GTATGAGCAC ATATGCTCAA 
AATGTACAAT TAAGCGGTAC TGTAACCGAT AAAGATGGAG TTTCACTTCC TGGTGTCAGT
ATTCTTGTTG AAGGCACCAA AATTGGCGCA GTTTCAAATG CAAAAGGCCA ATATTCCATT
AGCCTGCCTA ATCCTAACTC AGTTATTGTT TTTAGTTTTT TGGGCTATAT TTCTCAGAAC
GTAAACGCGA AAGGCCGCTC AAAAATAGAT GTTAGCTTAT CAGAAGATTC TAAATCCTTA
GATGAAGTCA TAATGGTAGG TTACGGCGCC CAGAAAAAGG CAAGTGTAAT TGGAGCCATT
TCCAATATTA ACATGAAGGA ACTTAGAAAA GCAGCACCTT CAAATTTGAG CAATGCGCTC
GGGGGCCGTG TTCCTGGTAT AATTAGCAGA ATGGGAGATG GAACACCGGG GGGTGTGCAG
AATAGATTCT CGAATGGAAA CGCTGATGAT GCCCAAATTT ATTTACGCGG AAGAGCCAGT
ATGAACAATA CAAGTGCCCT GGTCCTGATC GATGGTGTGG AAGGTTCCTT GTCCAGAATA
AATCCGGAAG ATATTGAGCA GTTCAGTGTG CTGAAAGATG CATCTGCAAC AGCAGTTTAC
GGTGTACGAG GGGCTAATGG CGTGATCCTG ATCACGACCA GAAAAGGTAG CATAGGGGCA
CCAAAGATCG GTATAACCAG CCAGATCAGG ATGCAAAAGG TATTGGATTT TCCGAATTTT
CTAAGATCCT ATGATTTCGC AATGTTAAAT AACGAGGCTC GTAAAAATCA GGGTCTTCCT
GAAATTTATT CAGCGGAAGA TCTGGAGCAT TACCGTACCG GTGATGATCC TTATGGCTGG
CCGGATGTTG ACTGGAAAGA AGTATTGCTG AAAGACCAGT TTTATGAACA ACAGTACGTT
GGAAATGTTT ACGGTGGGAC AGAACGGGTA TCCTATTATT TATCCGGTGA GTATAACCAG
TCGGGTGGCG CGTTCATTGA AAATAAAGAG AAGAATACAC AGCATAGGTA CAGACGTTAT
AATCTTAGGA CCAACCTTGA TTTTAAGATC ACTAAAACTA CTGATCTGGG TGTGAAATTG
AATGGCAGGT TGAATGACCT TCATTACCCA CTAAAAGGTG AAAGTAGCGG ACAGCGGGTA
ACTGGTCCCG GATGGAGCGA TATTACGGCA AGAGCCCCGC TCACTGCCCC GGTATACAAC
CCTAACGGTA CTTATGCAAA TGGTGGTTTG AACCTGCCGG GTAACCCTGT GGCTGAGTAT
ATGGAGGGCG GTTTTGCCCA GCGGCTGCAA AGCGGTCTGG AATCCAACTT TACGCTGAAC
CAGAAGTTGG ATTTTGTAAC ACCCGGGCTT TCATTCAGGG GCTTATTTGC TGCGAACTTT
GGATCGGGCA GTGCAAAAGC ATTAAATTCA AGGAGTGCTG AGATCTGGAC GTATGATAAA
ACTACCAAGA CCTATACATT AACGGCCGGA GCCGGAATTC CAACCTATAC ACTGGGTAGT
AATTTCAGTG ATTTCAACCG TATACAACAG GTTGAAGCTG CCTTGAACTA TGATAAGGCG
ATTGGCATGA ACCACAAAAT AACCGCCATG GCCATTGCTA CCCAGACCAC CAAAGAGGCT
TCGTTCATTG TTCCTACTAT TTTCAAAGGA ATGGCAGGCA GGCTTACCTA TGCTTATAAG
GATAAATACC TTGCTGAAGG AAATGTAGGC TATAATGGTT CTGATGCATT TAGTAAATCG
AAACGTTATG CATTTTTTCC TTCCGGTGCC CTTGGATGGG TAGCTTCAGA AGAAAGTTTT
ATAAAGGATA ATGTTAAGTT TCTGGATTTT CTGAAGTTCA GGGGCTCCTA TGGTGAAGTA
GGGAATGACA GGCTCGGTTT CGGGTACAGC AACTTGTATA TCTATTCATT TAGAAACCCG
CTCGCGGCTG AGACTCCCGG TACCTCTACT ACAGTTAACG GCTATTATAG CCTGGGAACT
ACGCCTACTC AAATCCTTCC GATCTTAGAA GGAACGTTGG GAAATCCAAA CGTGACCTGG
GAGGTTGCCC GCAAGGCAGA TATCGGGGTG GAGGCAAAAT TGTTTAAAAG CCGTCTCAGC
TTTGAAGCAG ATGTATTTCT GGAAAAGCGT GATGATATTC TGATCAACAG ATTCGATATA
CCGTTGATTT CTGGTTTAGT ACCAGCAAAG CTTCCTGCAT TGAATGCCGG AAAGGCAACA
AACAAAGGAT ATGAATTATC GCTTAGTTAT TCTGACAATA TTGGTGGCTT TGGTTTTACA
GTAGGGGGTA ATTATACTTT TGTCCGCAAT ACCATCGATT ACATGGCTGA AACGCCGAAA
AAATACCCAT GGCAGGAACA GACGGGCAAA CAAATTGGCA TGCTTGCACC TCAATTTATC
TGGACGGGTA AATTTTACAG TGAAGAGGAT TTGACCAATA ATGCTGTTCC CAAACCGGTT
GCAAAAGTTT GGGCCGGCGA ACTGATGTTT AAAGATCTGA ATGGCGATGG TAAAATTGAC
TCAGATGATA AGGCATATAC TGGTTATGGT CAGATTCCGG AGAAGATATT TGGCATTAAC
CTGAATATGG ATTATAAGAA TTTTTATTTG AATACGTTCT GGCAAGGTGC ATCCAACGTG
GTCATTAACC CTACTGCCGG GATGCGGCTT GAATATGCTG GTTATGGATA CAATGTTCAG
GAGTTCCATA AGGAAGATCG TTGGGTATAT GATCCTTCCC GCGGATTGGA TACGCGGGCA
ACGGCAAAAT ATCCCCTATT GATGCTCGGA GGCGCACCGC AAACCAGGGA GCTTTCTACC
TTTCATGTGC TGAATGGCGA GTATTTACGT TTGAAAGCAG CTGAATTTGG GTATACTTTT
CCTAAAACCC TAATCACAAA GCTTCATATA GCAGACCTGA GAGTGTTTGT AAGTGGTTCA
AATCTGCTGA CCTTCTCTCA TTTAAAGAGA TATCACATCG ATCCTGAATA TCTTGGAAAC
AATATACCGG GTCAGATGGT TGCTGGTCAG GGTGAGGCCA ATGGACTTGG ATCCGGAGCA
TGGTCGCCCC AAAATAAATT TTATGCCTTT GGGCTTAACG TTACTTTTTA G
 
Protein sequence
MRRFLILLVF SLACMSTYAQ NVQLSGTVTD KDGVSLPGVS ILVEGTKIGA VSNAKGQYSI 
SLPNPNSVIV FSFLGYISQN VNAKGRSKID VSLSEDSKSL DEVIMVGYGA QKKASVIGAI
SNINMKELRK AAPSNLSNAL GGRVPGIISR MGDGTPGGVQ NRFSNGNADD AQIYLRGRAS
MNNTSALVLI DGVEGSLSRI NPEDIEQFSV LKDASATAVY GVRGANGVIL ITTRKGSIGA
PKIGITSQIR MQKVLDFPNF LRSYDFAMLN NEARKNQGLP EIYSAEDLEH YRTGDDPYGW
PDVDWKEVLL KDQFYEQQYV GNVYGGTERV SYYLSGEYNQ SGGAFIENKE KNTQHRYRRY
NLRTNLDFKI TKTTDLGVKL NGRLNDLHYP LKGESSGQRV TGPGWSDITA RAPLTAPVYN
PNGTYANGGL NLPGNPVAEY MEGGFAQRLQ SGLESNFTLN QKLDFVTPGL SFRGLFAANF
GSGSAKALNS RSAEIWTYDK TTKTYTLTAG AGIPTYTLGS NFSDFNRIQQ VEAALNYDKA
IGMNHKITAM AIATQTTKEA SFIVPTIFKG MAGRLTYAYK DKYLAEGNVG YNGSDAFSKS
KRYAFFPSGA LGWVASEESF IKDNVKFLDF LKFRGSYGEV GNDRLGFGYS NLYIYSFRNP
LAAETPGTST TVNGYYSLGT TPTQILPILE GTLGNPNVTW EVARKADIGV EAKLFKSRLS
FEADVFLEKR DDILINRFDI PLISGLVPAK LPALNAGKAT NKGYELSLSY SDNIGGFGFT
VGGNYTFVRN TIDYMAETPK KYPWQEQTGK QIGMLAPQFI WTGKFYSEED LTNNAVPKPV
AKVWAGELMF KDLNGDGKID SDDKAYTGYG QIPEKIFGIN LNMDYKNFYL NTFWQGASNV
VINPTAGMRL EYAGYGYNVQ EFHKEDRWVY DPSRGLDTRA TAKYPLLMLG GAPQTRELST
FHVLNGEYLR LKAAEFGYTF PKTLITKLHI ADLRVFVSGS NLLTFSHLKR YHIDPEYLGN
NIPGQMVAGQ GEANGLGSGA WSPQNKFYAF GLNVTF