Gene Phep_0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0787 
Symbol 
ID8251876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp930364 
End bp933450 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content43% 
IMG OID644934437 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091071 
Protein GI255530699 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGTG CCTTTAGTGT TCTTGCCCAG CAAACACGAA AGGTTGAAGG GAAGGTGACA 
GACCAGACCT CGGGCGATCC TTTAATTGGT GTCAGTGTGC TGGTTAAGGG AACTAAAACA
GGCGCAACTA CGGATAGGGA TGGACGTTAT GCCATCCAGG TTCCGTCGCA GGGAAACAGC
ACACTTGTTT TTAGTTATAT CGGCTATCTT CAAAGGGAGA TGAGCGTAGG CGATAAAGGT
GTGGTTAATC TAAGCCTTGC CGAAGATAGT AAAGTGCTGA ACGATGTAGT GGTAATTGGT
TATGGTACGG TCGCAAAAAG AGATCTTACT GGTGCTGTAG GCAGCGTAAA TATGAAAGAC
CTGCAAAAGG CACCTGTTAA ATCTTTTGAT GAAGCTTTGG CGGGCCGGGT TGCAGGGGTG
CAGGTGGCAT CAAACGACGG GCAGCCTGGT AATTCATTTA ACATTGTAGT ACGTGGACAA
AACTCCATCA CTCAGGACAA TTCGCCATTA TACGTGGTAG ATGGTTTTCC TTTGGAGGTA
TCTAACAACA ATGCCATTAA TCCTGCAGAT ATTGAATCTA TTGAAGTATT GAAGGATGCA
TCAGCAACAG CAATTTATGG AGCAAGAGGT GCGAATGGTG TGATCCTGAT CACAACAAAG
GGCGGAAAAA TAGGTGCGCC TGTTATTTCG TATACCGGTA CTGTGGGCTT TCAGCAAAAC
ACAAAAAGAA TGGATGTGAT GAGCCCATAC GAATTTGTAC GGCTGCAGGA AGAGATTGAC
CCGATCAATA CCCCGCTTCT TTATTATAAA GACGGAAAAA CACTCGACTC TTATAAAGAC
ATTAAAGGAA TCGACTGGCA GGACCAGGTA TTCAGGACGG CCCCTTCAAC AGAGCATAAC
CTGTCTTTAA CTGGTGGAAC CGAAAAAACA CAATATGTTA TATCAGGGTC TATCAATTCC
CAGGACGGAG TGATCATCAA TTCAGGCTTT AACCGTTACC AGGGCAGAAT GGCCATTACT
CAAAGAGTTA ATGATAAACT GAAGGTTTTC GCGAGTATCG ATTACAGCAA TATCAAGGCC
AGCGGTACTA TTCCTACTTC CGGAACCAAT TCGGCTACCA ACAACCTGTT GTATAGTGTA
TGGGGCTACA GACCTGTTAT GGGATCGGAC GGTAACCTGC TGGACCAGCT GTTTGATCCT
GATATTGATG GGTTGAATGA TTACAGGATA AATCCGGTAT TGTCGTCAAA CAACGAATTG
CGGAATGCAA CAACCAATAA TTTACGCATA AACTCTTATG CACAATATGC TTTCAATAAA
AACCTGACCC TGAAGGTTTC GGGTGGTATT TCCAATAACC TGCGCAGAAA TGATGTTTTT
TATAACAGCC AAACTTATTA TGGTGGACCA AGCAGTACCA ATAAAGTAAA CGGATCTATC
ATTTATACAC AAAATACCAG CTGGCTAAAT GAAAACATCT TAACTTTTGC CAAAAGATTT
AATAAGGTTC ACAACCTGAA CATAGTAGCA GGTATTACCA TGCAGGAGGA TAAATATTCG
CGATATGGTT TGGCTGCTAT ACAGTTACCG AATGAGTCGG CCGGTTTGGA TGGTTTATCT
CAGGGTACAC CGCTGCCGGT TACAGCAGAA AGCTCGAACA GCAAGCTGCT CTCTTTTTTA
GGCCGGGTTA CCTACGATTA TAAGTCTAAA TATTTATTAA GTGCTTCTTT CAGGGCCGAT
GGTTCTTCTA AGTTTTTACC AGGTAAGCGC ACCAGTTATT TTCCTTCGGG ATCAATTGCC
TGGAGAATGA GCAACGAAGA CTTTATGAAA GCCCTTCCAT TTGTAAATGA TGCCAAACTG
AGGGTGGGCT ATGGCGTAAC CGGTAATAAC AGGGTGGGCG ATTTCTCTTA CCTTTCTGTA
TTGGGATTCC CTATTGGTGG GTCTTATGGT TTTAACAACA CGGTGAGCAT AGGTGCTATT
CCTTTAACTT ATGGCAACCC TGACCTGAGA TGGGAAAGTA CAGCCCAGGC CAATATTGGG
TATGACCTGA GCCTGTTTAA GGACAGGATA GGTTTTACTG CCGACGTTTA CCGTAAAAGC
ACTTACGATT TATTGCTGAA TGCTGATTTG CCTTATACTA CCGGTTATGA AACAGCATTT
AAAAATATTG GTAAAGTAAG AAACGAAGGC CTTGAATTTA CCTTAAACAC AAGGAACATC
GACAATAAAG ATTTTAAATG GTCATCCAGT TTTAACATCA GCTTTAACCG TTCAAAGGTA
ATGGCCCTGA ATGCGGGACA ATCGTTTAAA ACCACACCGA TCAGCTGGGA GAATTCCTAT
AATGCCACAC CATTGTATAT TGCAAATGTT GGCCAGCCAA TTGCCCAGTT TTATGGCTAT
GAGTTTGATG GTGTATACCA GTACAGTGAT TTTAATGAAA ACAGTCCCGG TGTATTTACA
CTGAAAGATG ATGTGCCAAA CAACGGGAAT GCGAGAAACA GCATTAAACC TGGCGACATT
AAATACAAGG ATCTGGATGG GAACCTGGTT GTAGATGCTA AAGACCGCAA AGTAATTGGT
CGCGGCCAGC CCATTCATGT AGGTGGTTTT ACCAATAACT TTACCTATAA AAACTTTGAC
CTGAGCGTAT TTTTGCAATG GTCTTATGGC AACGATATCT ACAACGCCAA CAGGATGTTG
TTTGAAGGAA ATATGCTGGA CAAAAAGAAC CTGAACCAGT TTGCGAGCTA TGCAGACCGC
TGGACACCAG ATAACCCAAG CAATACACTT TACAGGGTTA AAGGACAAGG CCCGGCAGTA
TATTCTTCAA GGGTAATAGA AGACGGATCT TTCCTGCGGA TCAAAACAGT TTCCCTGGGA
TACAATTTTA GCGCTGATGT GCTGAAAAGG ATTAAACTGA AAAGTCTGAG GGTTTCGGCT
TCGGGGCAAA ATTTATACAC TTTTACAAAA TATACCGGAA TGGATCCTGA AGTATCTGTA
AGGAATTCGG CATTGACCCC TGGATTTGAT TACTCGGCTT ATCCAAGGGC AAGGGCAGTT
GTTTTTAGCC TGAATACTTC ATTTTAA
 
Protein sequence
MICAFSVLAQ QTRKVEGKVT DQTSGDPLIG VSVLVKGTKT GATTDRDGRY AIQVPSQGNS 
TLVFSYIGYL QREMSVGDKG VVNLSLAEDS KVLNDVVVIG YGTVAKRDLT GAVGSVNMKD
LQKAPVKSFD EALAGRVAGV QVASNDGQPG NSFNIVVRGQ NSITQDNSPL YVVDGFPLEV
SNNNAINPAD IESIEVLKDA SATAIYGARG ANGVILITTK GGKIGAPVIS YTGTVGFQQN
TKRMDVMSPY EFVRLQEEID PINTPLLYYK DGKTLDSYKD IKGIDWQDQV FRTAPSTEHN
LSLTGGTEKT QYVISGSINS QDGVIINSGF NRYQGRMAIT QRVNDKLKVF ASIDYSNIKA
SGTIPTSGTN SATNNLLYSV WGYRPVMGSD GNLLDQLFDP DIDGLNDYRI NPVLSSNNEL
RNATTNNLRI NSYAQYAFNK NLTLKVSGGI SNNLRRNDVF YNSQTYYGGP SSTNKVNGSI
IYTQNTSWLN ENILTFAKRF NKVHNLNIVA GITMQEDKYS RYGLAAIQLP NESAGLDGLS
QGTPLPVTAE SSNSKLLSFL GRVTYDYKSK YLLSASFRAD GSSKFLPGKR TSYFPSGSIA
WRMSNEDFMK ALPFVNDAKL RVGYGVTGNN RVGDFSYLSV LGFPIGGSYG FNNTVSIGAI
PLTYGNPDLR WESTAQANIG YDLSLFKDRI GFTADVYRKS TYDLLLNADL PYTTGYETAF
KNIGKVRNEG LEFTLNTRNI DNKDFKWSSS FNISFNRSKV MALNAGQSFK TTPISWENSY
NATPLYIANV GQPIAQFYGY EFDGVYQYSD FNENSPGVFT LKDDVPNNGN ARNSIKPGDI
KYKDLDGNLV VDAKDRKVIG RGQPIHVGGF TNNFTYKNFD LSVFLQWSYG NDIYNANRML
FEGNMLDKKN LNQFASYADR WTPDNPSNTL YRVKGQGPAV YSSRVIEDGS FLRIKTVSLG
YNFSADVLKR IKLKSLRVSA SGQNLYTFTK YTGMDPEVSV RNSALTPGFD YSAYPRARAV
VFSLNTSF