Gene Phep_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3200 
Symbol 
ID8254319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3805555 
End bp3811494 
Gene Length5940 bp 
Protein Length1979 aa 
Translation table11 
GC content42% 
IMG OID644936853 
ProductTonB-dependent receptor plug 
Protein accessionYP_003093457 
Protein GI255533085 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.257633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATC TTTTACGCTC AATTCTGATT GTTGCGCTGG CCTTTAACTT TAGCTTCGCA 
CAGAAAAGAT TAACTGTCAG CTCCCAATCT GGCCCCTATA CTTTTATTTA CAAACTGACG
GATCAGGAAG CTTTTAATAT AGCTTCTAAA ACGAAAAGCA TTATAAACGA TAGCTTTTTT
CATACGCTTG TTGATACTTT TTACAATGCG AACAATAAAC CCTACACCCG GAAGCTTCCT
TATGGTAATT ACCTTTATGT AAGGGCAGTT AAAAACGAAC TGTTTTACCG GCTTGGACCG
GAAAATAATG TAAATCTCCA GTTCATCAAT AACAGAAAAG ATTTTCAGTT TAGCCTTACT
GATTTGAAGG GCAACCTAAT TAAAGATGCT ACGGTTAGGG TAGGAAGGGG CAGGAGCATT
AAATTTAATG AGCAGGCGGG TTTGTATAGC ACCGGATCGG TGGCAAAACT TGCGGTAATT
ACTGTTAAAT ATAATGGCGT AAGCAATTAT TTTACTTACG ATGAAGAAGA AAAAGAAAGA
CCATACCGTT CGTACAATAA TCCTTCTTTC TTTAAAAAAC TCTTTAACCC TAAACGATAT
AGCCGTAACC AGTATGGTAA GGTAGAAAAA CCAAAATATA CAGGTTATAT GGTGTTTAAC
AAACCTATGT ATAAACCTTT GGATTCGGTT AAGTTTAAAA CCTACCTGGT TACAGCCACA
GGCAAAAGCA TTCAGAATAA ACCTTTAAGG GTTATACTTG ATAAAGATTT TAAAGGAGAG
GGAATACTAC TGACCACTTT AAAGCCTTAT CGCGATGGAG GATATACGTA CAGTTTTGCA
CTGACTGACA GTTTGCAACT TAAACTGGAC AGAAACTACC GTATCGTTTT GAAAGAACAG
CAGGGAAAGG AATGGAAAAT GGTTTACCAG GGCAATTTCC GTTATGAAGA TTATGAACTG
AAGTCGGTTA ATTTTATGGT GAGGAGCGAC AGAGAAGAAC ATAGTCCGGG AAGTCCGGTT
ACCCTGTTTA TGAAAGCCAG GGATGAAAAT GAGCTTGCTG TTCCGGATGG TCGGGTAGAA
GTGGTAGTGA GGACCAATAA TGCTTCAGTT TTTTATGACC GAAAGGTCTT TGTAAAGGAT
ACTTTGTGGA AAACCAATGT GGTGCTTGAC CCTGTTGGTG AAACCAGGTT GATGTTGCCG
GACAGTATTT TTCCAAAGGC GGACCTTAGC TTTTCTGCCC ATTTTACCTT TTTAAACAGT
AACAATGAAC GGCGTACGGC CAGTAAATCA TTAAAATGTT TGGTTAAGGA CCGCAGCATT
AAATTTGAAT TTCAAAAGGA CAGCCTGCGT CTGGATTATC TGGTTAAGGG CAAATCGGTC
AGGCAAGAGG CGGTCATTTC TATGGATTAT CCTTATGGTG AGGCTACCGA TTCTATAAAA
ATAATGCTTC CGGCAACTAT AAAAATGAAT TACAGGGGCA GCGACTATGC TGTTAAAATG
GCTGATGGAT ACCGGCAGAC CATTTATCTG GAGCATTTGA AACCCGCAAT CAGTGTGAGT
GCTGTGCAAA GCAAAGATTC TCTAGCTGTA TCTGTAAGCA ATGAACATAA GGTGCCTTTC
TGGTACAGCA TTTTTTCAGG TAATAAGGTA TTGTTAAAAG GTTATACCAC CCGTTTGGAT
ACCCTGATCA AACACAGTTC GGTAAAGGCA GCGCATGTAA AATTGAATTA CATCTGGGAT
GAAGAAGAGC AAAGTACCGA AACCAGCGCT TTTTACAGAC CTCATATTTT AAATGTTAAA
CTTTTGGCCC CAGAAGTGGT GTATCCAGGC CAAACGGTAA ACATGCAGGT CAAAGTTACC
GACCAGAACA ACCTACCCGT TGCGGCAACA GACGTTACGG CCTACGCCTA TACCTCAAAG
TTTAAGAATG GCTACAGGGT GAACCTGCCC TACTTTGGAA AAAGTTTTTT TGCCCGTAAG
TTAAACCAGA AATTTGCGGC TGAAGACCTG AGTTTATCGG GCCAGCTGCC CCTTGATTGG
GAAAAATGGG GGCGGGAGCT GAACCTAGAC ACGATTGAAT ATTATAAGTT TAGTCGGACG
AAAGACCTGT ATGCCATAAT GGAAAATGGT GATGGCTCTG ATAGTGCTAT TGTGGCCCCC
TTTGTAATCA GAAACGGGAA CATTGAGCCG GTGCATGTGC TGTACATTGA CGATCGTCCG
GTATATTTTA GTCAGGCCAG TCAATTGCAG CGTTATGCAT TCAGCGTTAC TCCGGGCAGG
CATAATATCC GTATGCGTAC AGCCTGGTAC ACGGTTAGTC TGGACGATTT TGAACTGCCG
AAAGGTAAAA AGACAATTTT AAGTGTGGCT GCCGATTTGC AAAATACCAA GGCAAAGGTA
AGCACGGCCC CTTATACCAT GACCAGTCAG GAAGCTGCAT TTCTGACCGA TTACATGCTG
CGTATTGAGA ATAATTTTGG TGGTGAAAAA GCGATCCTTA CTGCAGGTAA TACCGAACAT
TTGCTGAACC TTCCGGATGT TTCTGCAAAT GCAGGCATGC AGCAGCGCTG GCAGGAAGGT
CTGACACCGG TAAATCAGGG GAATACTTTA CTCGTGGGCC CATTTGCCGA AAATCTGCTC
GAGTTTAAAT CGGGCGGACT TAATCAGAAT TTTTTAAAGG AACCGGGTTA TACTTATACC
TTTTTACCCG GATTGCTCAA ACAAAAATCG TACACCAGCC CTTATGGTTT TAATACCTCC
CTGAATACAA CTACAGGAAC CACTGATTAT AAGGAATATC CCTTAAAAAA AGGTGAGATA
GACAGCCTCT GGAATGCTTA CCTTGACCTG CGCAGCAGTA CCATACCACT ATTTAATAAC
GGCTACAGTT ATGGAAATAA TAAAGATTTT GGCCGTTTGG TAATGAAACT GGATACCAGT
ATTTCAAAAG CGATGCCTTA CTTAAAAAAT ATCATCATTT ATAAATCTGA TCAGCCCGAT
TTTTTACAAA TTTATCCTGG CAATACAGGC TATTTTAATG CACTGGAAAG CGGTAAATAC
CGCATCATTT ATCTGTTTAA GGACAACCGT TATTTTGTTG CAGAAGGGGT GGTCATCCAT
GCCGGCGGTA TCAATTATTT TGAGTGGAAG GACATGAAAA TACTTAAGGC GGATGACCTG
AGCAAAAAAA TAGACCAGCA CATAAAATCT GTAAACATTG GCCGGAACAG TAAAAGACCC
GAAAATGTAC AGCAAAAAAT ACTGGAAGAT ATCAATGGGC AATATTTTGA TGAGTCGATC
CTGACCGGTA AAATGAGCGG CAGGGTAATT GATGCCGGTG ATAAGCGACC AGTTCAGGGG
GCGGTAGTGA AAATAAAAGG GGCAAATTAT ATGACCCGTA CAGACGTTGA AGGCCGCTTT
GTACTTAAAA GCCCTAAAAA AGGAAAAGTG CTGGTCTCTA TGATAGGTTA TGATACAAGA
GAGGTCAACG TTGTGAATAC TGATGTAGGC GACATTAAAC TGGATGCAGC CACTTCTGCT
TTGGAAGAAG TTGTAGTGGT AGGTTATGGG GTACAGAAAA AACAGAATTT AACGGGCGCA
GTCAGCACAC TAAGGGAGGT TGCTTTATCG GGTGCCCTGG CCGGCAGACT TTCGGGTAGG
GTGGCTGGTG TCGCAGTTCC GGGTGCGGAG CAGCAGGTTA TGATCAGGGG TACAAGCTCA
TTGCCTGCCG ACAAAAAGCC AATGATCCTG ATAGATGGTT TACCATTTGA TGGTGATGTG
AACAGTCTTG ATCCGTCGAC AATTGAGGCC ATGAATGTAC TTAAGGATGC CTCTGCGACA
GCCATTTATG GTGCCCGGGC TGCAAATGGA GTTATCATTA TCAAAACCAA AGGTGGCCTT
ACCGTTGCAA ATGCGGCTGG CGAACTGGTG CAGCAGCAGC AAACCATGCG TACCAATTTT
TCCGATTATG CCATCTGGCA GCCTCAATTG TTTACAGATG CTGAGGGAAA AGCAAGTTTT
ACCGTTAAAT TCCCGGACGA CATTACGAAC TGGACCACCA AACTGATTGC CATGAACGGC
CGTAAACAAG GTGGCATTGC CGAAACAGCT ATCAAATCTT TTAAAGTTTT AAGTGCCAAT
TTTGTTTCGC CATTATTTGC CCTTGCGGGC GACAGCATCA ATGTGATTGG TAAACTGATG
AATTACAGCA ATACGGATGA AAATGTAACC CGCAAATTCA GTTATAATGG TGCAGAACTG
TTGAACAGCA GGCTTACCTT TAGAAATGCA AAGATTGATA CGGTAGCTAT TGTAGCCAAA
GGCGCAGGTT TAAAAGTACA GGCAGACCAC ATCAATGTAG CCGACAGCCT GACTTTCGAG
TATATAATGA AACAGGAGAA TGGCTATTTT GACGGAGAGA TCCGTAAAAT CCCTTTGTTC
CAGACCGGCG TTACCGAAAC CAAAGGTTTC TTTAATGCAC TGACCCGCGA TACTTCCATT
ACTTATAAAT TTGATGCTAC CTTGGGTAAA GTTACTTTAA GGGCCGAGGC TTCCTTATTC
CCGACACTGC TGGATGAGAT GGAGAAGCTG CGCAGGTACG AATACCTGTG CAATGAACAG
ATGGCCTCTA AACTGAAAGC CCTGTTACTT GAAAAGTCAG TAAGAAAATA TCTTGGCGAA
GATTTTAAAG AAGAAAAGAA CATCAAATAC CTGATTAAAA AGCTGCAGGA CAACAAAAGT
CCTGAAGGTA CCTGGGGTTG GTGGCAAAAC AGCGGTGAAG AACTCTGGAT TAGCTTGCAT
GTTGCAGAAA GTTTGCTTGA AGCACAAAAA CAGGGCTATG CCGTTGTGCT GGATAAAAAC
AAGCTTTATC GTTACCTGCT TAATAAAATG GCGGTCGGTA TTCATTTTGA TCAGGTTTAT
GGGGTAAAAC TGCTCCATCT TTTAAATGAA AAACATGATT TGAAGGACTG GATAACAACT
ATTGAAAAAG AAAGGGCTGC ATTAGAAGCA AAGCATTCAA AAGAAAGAAA GGCCAATTCA
GATATTCCGC CTTTGGGTAA ACAATCCTTA AATGAAAAGC TGCAACTGAT GCAATTGAAA
CAACTGGCTG GTATGGCCGT AGATATAAAA TGGCTGTTGG GTTTAAAAAA TGAGACCATA
TTTGGCAACA GTTACTGGGG AGAAGAGGGG AATCAATTTT GGGACAACAG CATTCAGAAT
ACCCTGCTGG CTTACCAGAT CCTGAAAGCC AATGGTGGAT ATAGGGACGA ACTGGACCGT
ATCCAGCGTT ACTTTCTGGA GCAGAGAAGG GACGGGCAAT GGCGCAATAC TTATGAGTCT
TCTTTAATCC TGGAAGCCAT ATTGCCGGAA CTGATGAGAA GCCCCGGTAA AAAGCTGGAA
CCTGCTTCCA TTGTCTTTAA CCAAACAGAA ACTATTACCA GTTTTCCATT CAGCAGGGTA
GTGGCACCCG AGGCTTTAAC CCTGGGGAAA AAGGGTGATG CACCAGTTTA TTTCACGGCA
TATCAGCAGT TCAACAATCC GCAACCCAAA AAGGCAAGTA AAGATTTTGG TGTGAAGTCA
TGGTTTGAGC AGCAGGGAGC TGAAGTTGGC AAACTAAAAG CAGGAACGCT GGCTACATTG
CAGGTTGAAG TTGAGGTACG TGCCGACGCA GATTATGTAA TGATAGAGAT ACCAATCCCG
GCAGGATGTT CTTATGAGCA AAAAATGCAA AGTTTTTGGG GTGTTGAAAC GCATCGGGAA
TACTTTAAGC ACAAAACATC CGTTTTCTGT ACCAAGCTGA AAAAAGGGAA ATATCGTTTT
GCAGTACAAC TGATGCCCAG GTATTCGGGC AATTATAATT TAAACCCCGC TAAGGCAGAA
ATGATGTATT TCCCGGTATT TTATGGAAGG GAAGGCATGA AACGGATTAT TGTGAAGTAG
 
Protein sequence
MKHLLRSILI VALAFNFSFA QKRLTVSSQS GPYTFIYKLT DQEAFNIASK TKSIINDSFF 
HTLVDTFYNA NNKPYTRKLP YGNYLYVRAV KNELFYRLGP ENNVNLQFIN NRKDFQFSLT
DLKGNLIKDA TVRVGRGRSI KFNEQAGLYS TGSVAKLAVI TVKYNGVSNY FTYDEEEKER
PYRSYNNPSF FKKLFNPKRY SRNQYGKVEK PKYTGYMVFN KPMYKPLDSV KFKTYLVTAT
GKSIQNKPLR VILDKDFKGE GILLTTLKPY RDGGYTYSFA LTDSLQLKLD RNYRIVLKEQ
QGKEWKMVYQ GNFRYEDYEL KSVNFMVRSD REEHSPGSPV TLFMKARDEN ELAVPDGRVE
VVVRTNNASV FYDRKVFVKD TLWKTNVVLD PVGETRLMLP DSIFPKADLS FSAHFTFLNS
NNERRTASKS LKCLVKDRSI KFEFQKDSLR LDYLVKGKSV RQEAVISMDY PYGEATDSIK
IMLPATIKMN YRGSDYAVKM ADGYRQTIYL EHLKPAISVS AVQSKDSLAV SVSNEHKVPF
WYSIFSGNKV LLKGYTTRLD TLIKHSSVKA AHVKLNYIWD EEEQSTETSA FYRPHILNVK
LLAPEVVYPG QTVNMQVKVT DQNNLPVAAT DVTAYAYTSK FKNGYRVNLP YFGKSFFARK
LNQKFAAEDL SLSGQLPLDW EKWGRELNLD TIEYYKFSRT KDLYAIMENG DGSDSAIVAP
FVIRNGNIEP VHVLYIDDRP VYFSQASQLQ RYAFSVTPGR HNIRMRTAWY TVSLDDFELP
KGKKTILSVA ADLQNTKAKV STAPYTMTSQ EAAFLTDYML RIENNFGGEK AILTAGNTEH
LLNLPDVSAN AGMQQRWQEG LTPVNQGNTL LVGPFAENLL EFKSGGLNQN FLKEPGYTYT
FLPGLLKQKS YTSPYGFNTS LNTTTGTTDY KEYPLKKGEI DSLWNAYLDL RSSTIPLFNN
GYSYGNNKDF GRLVMKLDTS ISKAMPYLKN IIIYKSDQPD FLQIYPGNTG YFNALESGKY
RIIYLFKDNR YFVAEGVVIH AGGINYFEWK DMKILKADDL SKKIDQHIKS VNIGRNSKRP
ENVQQKILED INGQYFDESI LTGKMSGRVI DAGDKRPVQG AVVKIKGANY MTRTDVEGRF
VLKSPKKGKV LVSMIGYDTR EVNVVNTDVG DIKLDAATSA LEEVVVVGYG VQKKQNLTGA
VSTLREVALS GALAGRLSGR VAGVAVPGAE QQVMIRGTSS LPADKKPMIL IDGLPFDGDV
NSLDPSTIEA MNVLKDASAT AIYGARAANG VIIIKTKGGL TVANAAGELV QQQQTMRTNF
SDYAIWQPQL FTDAEGKASF TVKFPDDITN WTTKLIAMNG RKQGGIAETA IKSFKVLSAN
FVSPLFALAG DSINVIGKLM NYSNTDENVT RKFSYNGAEL LNSRLTFRNA KIDTVAIVAK
GAGLKVQADH INVADSLTFE YIMKQENGYF DGEIRKIPLF QTGVTETKGF FNALTRDTSI
TYKFDATLGK VTLRAEASLF PTLLDEMEKL RRYEYLCNEQ MASKLKALLL EKSVRKYLGE
DFKEEKNIKY LIKKLQDNKS PEGTWGWWQN SGEELWISLH VAESLLEAQK QGYAVVLDKN
KLYRYLLNKM AVGIHFDQVY GVKLLHLLNE KHDLKDWITT IEKERAALEA KHSKERKANS
DIPPLGKQSL NEKLQLMQLK QLAGMAVDIK WLLGLKNETI FGNSYWGEEG NQFWDNSIQN
TLLAYQILKA NGGYRDELDR IQRYFLEQRR DGQWRNTYES SLILEAILPE LMRSPGKKLE
PASIVFNQTE TITSFPFSRV VAPEALTLGK KGDAPVYFTA YQQFNNPQPK KASKDFGVKS
WFEQQGAEVG KLKAGTLATL QVEVEVRADA DYVMIEIPIP AGCSYEQKMQ SFWGVETHRE
YFKHKTSVFC TKLKKGKYRF AVQLMPRYSG NYNLNPAKAE MMYFPVFYGR EGMKRIIVK