Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3579 |
Symbol | |
ID | 8254701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4257656 |
End bp | 4260835 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937231 |
Product | TonB-dependent receptor |
Protein accession | YP_003093832 |
Protein GI | 255533460 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.157085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.780186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTTTACTCTG TTGGTTGCCA TTAATTCTCT TTCCTTTCTT TAAAGGATAC GCTCAGGTTC CGGTTTCCGG AACTGTTAAA GATACACAGG GAGGTGTTCT GCCTGGTGTA AGTATCAAGC TAAAGGGTAC TACCACTGGA GTAACTACAA CTTCGTCTGG TACTTATAGT ATTAGTGTAC CTGATGCCAG TGCAGTGCTG GTATTTTCAT TTATCGGCAT GGAAAGCCAG GAAATCCAGG TGTCTGGAAA AAGGACAATT GACGTTGTAC TTACTGAGCA GATGGCGGCA CTAAAGGAAG TATTGGTAAT TGGTTACGGT TCTCAATCTC GAGAGACAGT TACCACGTCG GTGACAAAGC TGGATAATAA GGTGCTTGAA AACGTGCCTT ATGCCAATTT AACATCTGCC ATGCAGGGAA CCCTGTCAGG TGTTCGGGTG CAAAGTACCT CGGGGCAACC CGGAGATGCT TCCCGTGTGG TAATCAGAGG CGGAACCTCT ATTAATAATC CGAATGGTGC CGCACCACTT TATATTGTAG ATGGGGTAAT CAAATCAAAT ATAAATGATA TCAATTCACA GGATATAGAA TCTATGCAGG TGCTGAAAGA TGCGGCGGCT ACTGCTATTT ATGGGGCCAG AGGTTCAAAT GGAGTAGTGA TCCTGGTAAC TAAATCAGGT AAATCCGGTA TAGCCAGGAT TAATTATAAT TATGATCTTA CCATTTCTGA TCTTGGGAAA GGCTACGATA TGGTATCTGC CAGAGATTAT ATTTATTTTC AAAGGTTAGG AATTGGGGCA AGAGGTACTG CCGATCCCAG CCAGTTGACA AAACTTGGTC TTGCCAGTAG TGCGGGTACC GGTAATGACC TTACCAATAA TACGGCATTT ACACCACAGT ATTTATCTGA TGCGAACCGA TATAAACTCA ATGAGGGTTG GGAAAGTATG CCTGATCCGA TAGACCCTAC TAAAACTATT ATTTTTAAAA ATACAGATTT TCAGGATGTT GTTTATCGGA CGGCGCTTTC TAACAACCAT ACCTTGTCGG GCTCGGGAGG GACTGATAAA GCTACTTTCA GTGCGAGCCT TGGATACCAG TCTAATGAGG GGATTGCCAT TTTTACGGAT TATAAACGAC TTTCATTTAA TTTAAATGGT GATTTTAAGG TAAACGAAAA GCTTAAGATC TTTGGCAGGG TGATGTACTC CAATTCCTCC GGAAGGACAG TTACGGATGC CGGAAGTAAT GTGAGTAATG TATTTGCCAG ATCTGCGACT ATACCGGCAA CCACTAAATA TAAATTTGAG GATGGAACAC TAGCCCCTGG TTTAAATTCA AGTTTAGGTA ACCTGGAGTA TTTTTTTAAT ACACAGGACT TGAAAAACAG CCTGGAAAAT CTGACGATGG TTACAGGTGC ACATTTTGAT ATCTTGCCGG GGCTAAGTTT TGACCCGCAA ATCTCTTTAT ATAAAATTAC CTCGGACGGA CGTTTTTTTC AAAAGGCTTA CCTTAACGGA CCAGGGCAAC TGGTGAATTC AAGAAACGCC ACCGGTAGTT ATGCCAAACA GATTCAGGAA CAGGCGGATG CGGTTTTTAC TTATAAAAAA AACCTTAAAG ATGCACACCA TTTAGAAGCG AAAATTGGTT TTTCTGCTTT CTGGAGAACG ACTGCGGGAT TAAATGCAAG CGGAAGGGGG GCATCTACGG ATCTGATCCC AACTTTAAAT GCTTCTGCCG TACCCGTATC GGTTGGTGGT GATGAAACTA ACCAATTGAT TTTGGGCTAC TTTTCTAGAA TTAATTATGA TTATAAAGAG AAATACCTGC TTTCACTAAA CGCCAGGTAC GACGGGGCCT CTAACCTGGG GACTACGCAC AAATGGGGTC TTTTTCCGGG GGTATCTGTA GGTTGGAATA TTCATAAAGA AGATTTTTGG ACTGCATTGC CTGAACGTTT ATTCACGCTG AAACTACGTG GAAGTTATGG CGTAAATGGA AACATCAGTG GTTTGGGCCC TTACCAGTCG CAAGGGCAGT ATAGTGTAGG TGCTCAATAC AATGGAATAG CCGCAGTTCA AAATACAACG CTGGCTAATG CAGATTTGCG TTGGGAGCAA TCAAAGACCT TCGATGTGGG ATTTGATCTT GGGGTATTAA ATAACAGAAT TAATATATTG TTTGATTATT ACAGACGCAG GACGGATAAC CTGCTGACCA ATTTCAGTTT GCCACAATCT ACGGGCTTTG CCAGTGTGTT AACCAATTTA GGTAGCCTCC AGAACAAAGG TATAGAACTA GAGCTGAGTG CGAAGATTCT TCCTGAAAAG TCTGATTTTC AATGGATTTT GTCTTTAAAT GCGTCCAGGG TTAAGAATAA AATACTTAAA CTTCCAAACA ATGGTATAGA GAACAATCGC ATTGGGGGTG TATATGTTTG GGATTCATCC AGAAACGATT ACGCTTGGTT AGGAGGGTTG CAGGAGGGCG GAGAGATTGG TGATTTATAT GCTTATAAGC AACTAGGTAT CTATGCCACA GATGCCGAAG CCCAAAGAGG CCCGAAAGAC ATGTTGGTGG TAGGGACAGC CAAAACAAAA TTTGGTGGTG ACGTAAATTG GCAAGATGCA GATAATAATG GGGTAATAGA TGAAAGGGAC CGTGTGTTTG TGGGTAATAT TTATCCAAAA TGGACTGGTG GGATGGCCAG TACCATGACC TATAGAAATT TCGATCTATA TGTGAGAATG GATTATACCA CGGGGCATAC CATTTATAAC TATACCCGGG CAATGATGAT AGGGCAGTTT GTTGGAGAAA ACGGTTTTGT TTCTGATGTC CTCCGATCCT GGCAAACCCA GGGACAGCAG ACAGATATTC CGAGAATTTA TTGGGCTGAC CAACAGGCGC AAAATAATTT ATTCAGGGGT AATTCAGCAT ATTACGAAGC GGGTGACTTC CTGGCTTTAA GGGAAGTCAC ACTCAGTTAT AATTTCTCTC CGGAATTTTT GAAGAAAATA AAAATAGCGA ACCTGAGGCT AAATGCTACA GGTAGCAATC TTCATTATTT CACCAAATTT AAAGGACTGA ATCCTGAAGA GGGGGGAGAT GACCGGGGCA GATATCCAAT TCCCAGAAAC ATCATCTTCG GAGCAAACAT TACATTTTAA
|
Protein sequence | MKKLLLCWLP LILFPFFKGY AQVPVSGTVK DTQGGVLPGV SIKLKGTTTG VTTTSSGTYS ISVPDASAVL VFSFIGMESQ EIQVSGKRTI DVVLTEQMAA LKEVLVIGYG SQSRETVTTS VTKLDNKVLE NVPYANLTSA MQGTLSGVRV QSTSGQPGDA SRVVIRGGTS INNPNGAAPL YIVDGVIKSN INDINSQDIE SMQVLKDAAA TAIYGARGSN GVVILVTKSG KSGIARINYN YDLTISDLGK GYDMVSARDY IYFQRLGIGA RGTADPSQLT KLGLASSAGT GNDLTNNTAF TPQYLSDANR YKLNEGWESM PDPIDPTKTI IFKNTDFQDV VYRTALSNNH TLSGSGGTDK ATFSASLGYQ SNEGIAIFTD YKRLSFNLNG DFKVNEKLKI FGRVMYSNSS GRTVTDAGSN VSNVFARSAT IPATTKYKFE DGTLAPGLNS SLGNLEYFFN TQDLKNSLEN LTMVTGAHFD ILPGLSFDPQ ISLYKITSDG RFFQKAYLNG PGQLVNSRNA TGSYAKQIQE QADAVFTYKK NLKDAHHLEA KIGFSAFWRT TAGLNASGRG ASTDLIPTLN ASAVPVSVGG DETNQLILGY FSRINYDYKE KYLLSLNARY DGASNLGTTH KWGLFPGVSV GWNIHKEDFW TALPERLFTL KLRGSYGVNG NISGLGPYQS QGQYSVGAQY NGIAAVQNTT LANADLRWEQ SKTFDVGFDL GVLNNRINIL FDYYRRRTDN LLTNFSLPQS TGFASVLTNL GSLQNKGIEL ELSAKILPEK SDFQWILSLN ASRVKNKILK LPNNGIENNR IGGVYVWDSS RNDYAWLGGL QEGGEIGDLY AYKQLGIYAT DAEAQRGPKD MLVVGTAKTK FGGDVNWQDA DNNGVIDERD RVFVGNIYPK WTGGMASTMT YRNFDLYVRM DYTTGHTIYN YTRAMMIGQF VGENGFVSDV LRSWQTQGQQ TDIPRIYWAD QQAQNNLFRG NSAYYEAGDF LALREVTLSY NFSPEFLKKI KIANLRLNAT GSNLHYFTKF KGLNPEEGGD DRGRYPIPRN IIFGANITF
|
| |