Gene Phep_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2226 
Symbol 
ID8253332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2572144 
End bp2575497 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content44% 
IMG OID644935875 
ProductTonB-dependent receptor plug 
Protein accessionYP_003092492 
Protein GI255532120 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.377553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AGACCTTTAT TTTTCTGTCC ATGCTGTGCT GTCTTTTTGT TGCGACAAAG 
GTTACAGGCC AAAACGCTGG CCTGCTGATC TCGGGAAAGG TAAGCTCCGG AACGGACAAA
GAAACACTAC CGGGTGTGAG TATAAAGCTG AAGGGAACTA CCATGGGTAC TTCTACTGAT
GTAAACGGCA AATATTTCAT CAAACTGCCA TCCGGTGATG GCGTACTGAT ATTTTCTTAT
ATCGGCCACA GAACAAAAGA AATAGCTGTA AATGGCCAGG CCAGGCTTGA CGTGGTACTG
GAACCGGATG TGAGTACAAT GGATGAGGTT TTAATTGTGG GTTATACACA ACAGTCGAGG
AGCAAGACCA CTGCAGCCAT TTCCAAACTC TCGCCGGATG AACTTAAAAA TACGGCAAAC
CCTAATCCTG TCCAGGCTTT ACAGGGCAAA ATTGCAGGTG TTTCAGTACC GGTTACCACC
GGGCAGCCGG GTGTTGGGGC TACGAACATC ATCATCAGGG GCGGTACCAA GCCCAATGCT
TACGGATCCG GACTGGGCAA CAATAACGGC AGCTCTACAG GTGCTTCTGA CGCCAATAGT
CCGTTAGTAG TAGTAGACGG GGTATTCAGA TCAATGAATG ATGTGAACCC CGATAATATA
GAATCCATGC AGGTAATGAA AGATGCCGCA TCTACGGCTA TTTATGGGGC CAGGGGTGCA
AACGGTGTCA TCGTTATTAA AACAAAAAGT GGAAATGCCA ATGGTAAGAT GAGCATTTCG
CTTAACCACC GTACCACCTG GGAAACACAG GCCCGTGATT ATGACTACCT GAACGCAGAA
GAATACCTTC GGCTGGCACG TACCACGGTC AAAAATACAG CCGATGCACT GGACAAAAAT
AATCTGCTCA ACAATGGTGG TTTTTCGGCC GGAACAAAGG TTTACACCGC CAAAGGACAG
TATGGAAAGA ACATCAACCT GACTGCCTTA TATGACAACA TTGTCTCGGT TGAAGGACAG
GCGTATGTAG ACAACCTATT GTCTAAAGGC TGGAAAGTAA TGGACGATCC CATCAATCCG
GGTACAAGGT TATTGTATGC CGATAACAAC TACCAGGACA TGCTATGGAA TACGGGTTTG
AGCAACAATG AAAATCTGGG TATTAGTGGG GGGAGTGAAA AATCTGACTA TAATTTATCG
ATGAGCTATA CCAATCAGGC AGGGGTATTT GTAGGCACCA AATATAAACG CTATGATGCC
CTTGGAAATT TCGGTTTTAA AGCAGCAGAT AATTTCAGGC TGGATGTGAT GCTCAATTAT
CAGAACGTGA TGCCCAATTA TGTTGATGCC TATCAGAATG ATCTGGTAAG GGCAGTAAGG
ATAACTCCAT TGATCCGTAC TTTTAAAGAT GACGGCAACC CTATGCCCGG TGAATTGTAT
ACCGTTCGTA ACCGCTTTCA CACTTTAAAA TATGACGATA CGCGAACCTC AACCGAACGG
CTGGTATCGA GGGTTGCGGG CGACCTGACC ATTATAAAAG GGTTGCATTT TAAACCGTCA
TTTTCTTATC TGATAGATGA TTACAAGGAA CTGTTTATGA GAAAAGGTAC TCCTGCCGAT
GAGATACAGC CTGCTACCCA ACGTCAGAAA ACGGATTATA CAAGAAACTC CAGACAACTG
ATGATCGATC AGGTATTACA GTATGATTTC AGTTTGCAAA ATGCGCATAA TTTTATGGTG
CTGGCAGGTG TTAACTATAC CAGAAATACC AATCATATTG CCAGTCTTGG TTCTCAAAGA
GGTTCCAATG ATTATATCTA TACCATAGAC GAACCCTCTA CAACCATTGT TAATGGCGTA
GTAGTTACCA ATGTGACCAA TTTTGGTACT TCGATAGGCG AAACAAGGTC TGCCAGTGCA
TTCGGACAAT TCAATTACGA TTATAAAGCT AAATATTTAC TGACCGGCTC TTTAAGGTAT
GATGGTTTTT CTAACTTCAC CCCGGAAAAT AAATATGCCT TTTTTCCTTC TGTTTCTGTA
GGATGGAACA TTCATAAAGA AGAGTTCTGG AATTTAAAGG CAGTAAATGC CTTAAAACTA
AGGGCCAGCT GGGGTACTGC AGGACTTAGC GACCTGAGTA TCACGGATAC TTATGGCGGA
TATACTACAT CAAGCTATGC CTTAGGTTCT GGTATCTTAA GAGCCAATCT GGCCAATCCC
AACCTGAAAT GGGAATCCAC TGCAACAACA GACCTGGCTT TTGACGCTTC TTTCTTTAAC
AGCCGCATTA ACCTGACAGT AGATTATTAT AATAAACTGA CCAAAGACAG GCTAGACTCT
AAACCATTAC CTTCTGAATC TCCTTTTGCC TCTATTACAT TTAACAATGG TGAACTACGG
AACCGTGGTG TTGAGATAGA ACTGGGCGCT GCAATTGTTA AAACCAGGGA TTTTAACTGG
AATACCAATT TATCCTTTTC TTACAACCAG CAGTTGATCA CCAAACTACC CGCAAACGGC
CGTTTGAAGA ACAGACAGGG AGGAGACCTG GTGTTCGATC CGGCATCAAA TGCATTGGTA
GAGAGGGGTG GTTTTGCAGA GGGGGAAAGA CCCTTTCCGA TTTATGCTTA CCGGGTAACA
GGGGTTTTTG CCACTGATGC TGAGGCTGCA GCCTGGAATG CCAAAGTGAA AGACAACCTG
GCTTCACCGC AAGGTATAGC TGTAGGAAAA CGTGGTGGAG ATTTCATTTT TGATGATGTA
AACGGCGATG GTGTAATCGA CACCAAAGAC CAGGTATTTA TGGGCTACAG AAACCCCAAT
AAAATGGGTG GGATGCAAAA TACATTTAAG TACAAAAACC TGAGCCTCAG GTTTACCATG
GATTATGCAT TGGGACATCT GATCAGTAAT GGTGCCCTGG CACGCTCACT GGGACAGGGA
AGGGCTTTTA ATGAAGGAGC TCCTGCTATT GCAATCGGCC CCGATATCTG GCAGAACCAG
GGCGACACAG ATAAAAAATA CGCACGTTTT TCTTTTGCAG ACTTCGATTT TGGACAACGC
AATTATCTGC GCAACGGAAC ACTTGGAAAT AACAACGGTT ACAATTCCGA TGTATCGGCC
ATGTTTTCAA AGGGCGACTT TCTCGCTTTC AGGGAAATTT CAATTGCTTA TGATATCCCT
AAAAAAATAC TCAATAAAAT ACGCGCATCC GGCTTAAATA TTTTTGCAAC GATATATAAT
TTGGGTTATT TAACCGCTTA TGAAGGCTTA AACCCTGAAG TATATACAGG TTTTGATCCG
GGTGGGTATC CAAGACCAAG ACAATTTTTA TTGGGTGCCA CATTAAAGTT CTGA
 
Protein sequence
MKKKTFIFLS MLCCLFVATK VTGQNAGLLI SGKVSSGTDK ETLPGVSIKL KGTTMGTSTD 
VNGKYFIKLP SGDGVLIFSY IGHRTKEIAV NGQARLDVVL EPDVSTMDEV LIVGYTQQSR
SKTTAAISKL SPDELKNTAN PNPVQALQGK IAGVSVPVTT GQPGVGATNI IIRGGTKPNA
YGSGLGNNNG SSTGASDANS PLVVVDGVFR SMNDVNPDNI ESMQVMKDAA STAIYGARGA
NGVIVIKTKS GNANGKMSIS LNHRTTWETQ ARDYDYLNAE EYLRLARTTV KNTADALDKN
NLLNNGGFSA GTKVYTAKGQ YGKNINLTAL YDNIVSVEGQ AYVDNLLSKG WKVMDDPINP
GTRLLYADNN YQDMLWNTGL SNNENLGISG GSEKSDYNLS MSYTNQAGVF VGTKYKRYDA
LGNFGFKAAD NFRLDVMLNY QNVMPNYVDA YQNDLVRAVR ITPLIRTFKD DGNPMPGELY
TVRNRFHTLK YDDTRTSTER LVSRVAGDLT IIKGLHFKPS FSYLIDDYKE LFMRKGTPAD
EIQPATQRQK TDYTRNSRQL MIDQVLQYDF SLQNAHNFMV LAGVNYTRNT NHIASLGSQR
GSNDYIYTID EPSTTIVNGV VVTNVTNFGT SIGETRSASA FGQFNYDYKA KYLLTGSLRY
DGFSNFTPEN KYAFFPSVSV GWNIHKEEFW NLKAVNALKL RASWGTAGLS DLSITDTYGG
YTTSSYALGS GILRANLANP NLKWESTATT DLAFDASFFN SRINLTVDYY NKLTKDRLDS
KPLPSESPFA SITFNNGELR NRGVEIELGA AIVKTRDFNW NTNLSFSYNQ QLITKLPANG
RLKNRQGGDL VFDPASNALV ERGGFAEGER PFPIYAYRVT GVFATDAEAA AWNAKVKDNL
ASPQGIAVGK RGGDFIFDDV NGDGVIDTKD QVFMGYRNPN KMGGMQNTFK YKNLSLRFTM
DYALGHLISN GALARSLGQG RAFNEGAPAI AIGPDIWQNQ GDTDKKYARF SFADFDFGQR
NYLRNGTLGN NNGYNSDVSA MFSKGDFLAF REISIAYDIP KKILNKIRAS GLNIFATIYN
LGYLTAYEGL NPEVYTGFDP GGYPRPRQFL LGATLKF