Gene Phep_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3579 
Symbol 
ID8254701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4257656 
End bp4260835 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content42% 
IMG OID644937231 
ProductTonB-dependent receptor 
Protein accessionYP_003093832 
Protein GI255533460 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.780186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TTTTACTCTG TTGGTTGCCA TTAATTCTCT TTCCTTTCTT TAAAGGATAC 
GCTCAGGTTC CGGTTTCCGG AACTGTTAAA GATACACAGG GAGGTGTTCT GCCTGGTGTA
AGTATCAAGC TAAAGGGTAC TACCACTGGA GTAACTACAA CTTCGTCTGG TACTTATAGT
ATTAGTGTAC CTGATGCCAG TGCAGTGCTG GTATTTTCAT TTATCGGCAT GGAAAGCCAG
GAAATCCAGG TGTCTGGAAA AAGGACAATT GACGTTGTAC TTACTGAGCA GATGGCGGCA
CTAAAGGAAG TATTGGTAAT TGGTTACGGT TCTCAATCTC GAGAGACAGT TACCACGTCG
GTGACAAAGC TGGATAATAA GGTGCTTGAA AACGTGCCTT ATGCCAATTT AACATCTGCC
ATGCAGGGAA CCCTGTCAGG TGTTCGGGTG CAAAGTACCT CGGGGCAACC CGGAGATGCT
TCCCGTGTGG TAATCAGAGG CGGAACCTCT ATTAATAATC CGAATGGTGC CGCACCACTT
TATATTGTAG ATGGGGTAAT CAAATCAAAT ATAAATGATA TCAATTCACA GGATATAGAA
TCTATGCAGG TGCTGAAAGA TGCGGCGGCT ACTGCTATTT ATGGGGCCAG AGGTTCAAAT
GGAGTAGTGA TCCTGGTAAC TAAATCAGGT AAATCCGGTA TAGCCAGGAT TAATTATAAT
TATGATCTTA CCATTTCTGA TCTTGGGAAA GGCTACGATA TGGTATCTGC CAGAGATTAT
ATTTATTTTC AAAGGTTAGG AATTGGGGCA AGAGGTACTG CCGATCCCAG CCAGTTGACA
AAACTTGGTC TTGCCAGTAG TGCGGGTACC GGTAATGACC TTACCAATAA TACGGCATTT
ACACCACAGT ATTTATCTGA TGCGAACCGA TATAAACTCA ATGAGGGTTG GGAAAGTATG
CCTGATCCGA TAGACCCTAC TAAAACTATT ATTTTTAAAA ATACAGATTT TCAGGATGTT
GTTTATCGGA CGGCGCTTTC TAACAACCAT ACCTTGTCGG GCTCGGGAGG GACTGATAAA
GCTACTTTCA GTGCGAGCCT TGGATACCAG TCTAATGAGG GGATTGCCAT TTTTACGGAT
TATAAACGAC TTTCATTTAA TTTAAATGGT GATTTTAAGG TAAACGAAAA GCTTAAGATC
TTTGGCAGGG TGATGTACTC CAATTCCTCC GGAAGGACAG TTACGGATGC CGGAAGTAAT
GTGAGTAATG TATTTGCCAG ATCTGCGACT ATACCGGCAA CCACTAAATA TAAATTTGAG
GATGGAACAC TAGCCCCTGG TTTAAATTCA AGTTTAGGTA ACCTGGAGTA TTTTTTTAAT
ACACAGGACT TGAAAAACAG CCTGGAAAAT CTGACGATGG TTACAGGTGC ACATTTTGAT
ATCTTGCCGG GGCTAAGTTT TGACCCGCAA ATCTCTTTAT ATAAAATTAC CTCGGACGGA
CGTTTTTTTC AAAAGGCTTA CCTTAACGGA CCAGGGCAAC TGGTGAATTC AAGAAACGCC
ACCGGTAGTT ATGCCAAACA GATTCAGGAA CAGGCGGATG CGGTTTTTAC TTATAAAAAA
AACCTTAAAG ATGCACACCA TTTAGAAGCG AAAATTGGTT TTTCTGCTTT CTGGAGAACG
ACTGCGGGAT TAAATGCAAG CGGAAGGGGG GCATCTACGG ATCTGATCCC AACTTTAAAT
GCTTCTGCCG TACCCGTATC GGTTGGTGGT GATGAAACTA ACCAATTGAT TTTGGGCTAC
TTTTCTAGAA TTAATTATGA TTATAAAGAG AAATACCTGC TTTCACTAAA CGCCAGGTAC
GACGGGGCCT CTAACCTGGG GACTACGCAC AAATGGGGTC TTTTTCCGGG GGTATCTGTA
GGTTGGAATA TTCATAAAGA AGATTTTTGG ACTGCATTGC CTGAACGTTT ATTCACGCTG
AAACTACGTG GAAGTTATGG CGTAAATGGA AACATCAGTG GTTTGGGCCC TTACCAGTCG
CAAGGGCAGT ATAGTGTAGG TGCTCAATAC AATGGAATAG CCGCAGTTCA AAATACAACG
CTGGCTAATG CAGATTTGCG TTGGGAGCAA TCAAAGACCT TCGATGTGGG ATTTGATCTT
GGGGTATTAA ATAACAGAAT TAATATATTG TTTGATTATT ACAGACGCAG GACGGATAAC
CTGCTGACCA ATTTCAGTTT GCCACAATCT ACGGGCTTTG CCAGTGTGTT AACCAATTTA
GGTAGCCTCC AGAACAAAGG TATAGAACTA GAGCTGAGTG CGAAGATTCT TCCTGAAAAG
TCTGATTTTC AATGGATTTT GTCTTTAAAT GCGTCCAGGG TTAAGAATAA AATACTTAAA
CTTCCAAACA ATGGTATAGA GAACAATCGC ATTGGGGGTG TATATGTTTG GGATTCATCC
AGAAACGATT ACGCTTGGTT AGGAGGGTTG CAGGAGGGCG GAGAGATTGG TGATTTATAT
GCTTATAAGC AACTAGGTAT CTATGCCACA GATGCCGAAG CCCAAAGAGG CCCGAAAGAC
ATGTTGGTGG TAGGGACAGC CAAAACAAAA TTTGGTGGTG ACGTAAATTG GCAAGATGCA
GATAATAATG GGGTAATAGA TGAAAGGGAC CGTGTGTTTG TGGGTAATAT TTATCCAAAA
TGGACTGGTG GGATGGCCAG TACCATGACC TATAGAAATT TCGATCTATA TGTGAGAATG
GATTATACCA CGGGGCATAC CATTTATAAC TATACCCGGG CAATGATGAT AGGGCAGTTT
GTTGGAGAAA ACGGTTTTGT TTCTGATGTC CTCCGATCCT GGCAAACCCA GGGACAGCAG
ACAGATATTC CGAGAATTTA TTGGGCTGAC CAACAGGCGC AAAATAATTT ATTCAGGGGT
AATTCAGCAT ATTACGAAGC GGGTGACTTC CTGGCTTTAA GGGAAGTCAC ACTCAGTTAT
AATTTCTCTC CGGAATTTTT GAAGAAAATA AAAATAGCGA ACCTGAGGCT AAATGCTACA
GGTAGCAATC TTCATTATTT CACCAAATTT AAAGGACTGA ATCCTGAAGA GGGGGGAGAT
GACCGGGGCA GATATCCAAT TCCCAGAAAC ATCATCTTCG GAGCAAACAT TACATTTTAA
 
Protein sequence
MKKLLLCWLP LILFPFFKGY AQVPVSGTVK DTQGGVLPGV SIKLKGTTTG VTTTSSGTYS 
ISVPDASAVL VFSFIGMESQ EIQVSGKRTI DVVLTEQMAA LKEVLVIGYG SQSRETVTTS
VTKLDNKVLE NVPYANLTSA MQGTLSGVRV QSTSGQPGDA SRVVIRGGTS INNPNGAAPL
YIVDGVIKSN INDINSQDIE SMQVLKDAAA TAIYGARGSN GVVILVTKSG KSGIARINYN
YDLTISDLGK GYDMVSARDY IYFQRLGIGA RGTADPSQLT KLGLASSAGT GNDLTNNTAF
TPQYLSDANR YKLNEGWESM PDPIDPTKTI IFKNTDFQDV VYRTALSNNH TLSGSGGTDK
ATFSASLGYQ SNEGIAIFTD YKRLSFNLNG DFKVNEKLKI FGRVMYSNSS GRTVTDAGSN
VSNVFARSAT IPATTKYKFE DGTLAPGLNS SLGNLEYFFN TQDLKNSLEN LTMVTGAHFD
ILPGLSFDPQ ISLYKITSDG RFFQKAYLNG PGQLVNSRNA TGSYAKQIQE QADAVFTYKK
NLKDAHHLEA KIGFSAFWRT TAGLNASGRG ASTDLIPTLN ASAVPVSVGG DETNQLILGY
FSRINYDYKE KYLLSLNARY DGASNLGTTH KWGLFPGVSV GWNIHKEDFW TALPERLFTL
KLRGSYGVNG NISGLGPYQS QGQYSVGAQY NGIAAVQNTT LANADLRWEQ SKTFDVGFDL
GVLNNRINIL FDYYRRRTDN LLTNFSLPQS TGFASVLTNL GSLQNKGIEL ELSAKILPEK
SDFQWILSLN ASRVKNKILK LPNNGIENNR IGGVYVWDSS RNDYAWLGGL QEGGEIGDLY
AYKQLGIYAT DAEAQRGPKD MLVVGTAKTK FGGDVNWQDA DNNGVIDERD RVFVGNIYPK
WTGGMASTMT YRNFDLYVRM DYTTGHTIYN YTRAMMIGQF VGENGFVSDV LRSWQTQGQQ
TDIPRIYWAD QQAQNNLFRG NSAYYEAGDF LALREVTLSY NFSPEFLKKI KIANLRLNAT
GSNLHYFTKF KGLNPEEGGD DRGRYPIPRN IIFGANITF