Gene Phep_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4131 
Symbol 
ID8255266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4992941 
End bp4996024 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content40% 
IMG OID644937796 
ProductTonB-dependent receptor plug 
Protein accessionYP_003094384 
Protein GI255534012 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.339257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA AATTACTCAT GTTATTCATG GGGACTTTTT TGTTGGTGTC ACATGCCATG 
GCACAACAAA TTACCGTTTC CGGTAAAGTA ACTTCATCTG AAGATGGAGG TATTGTCCCT
GGGGCTTCTG TTTTAATTAA AGGGACGAAG ACGGCTACTC AAACAAATTC ATCGGGTGTG
TATACCATAC AGACTAAGGC GGGGGACATC CTTGTGTTTA GCTACATCGG ATTACTCCCT
CAGGAAAGAC CTGTTGGTGG TAGCTCAATA ATCAACGTTG TTTTAAGCGC GGATTCAAAA
GGGTTGAATG AAGTTGTGGT AACTGCCTAC GGTATTGAAC GGGATTCCAA ATCACTAGGT
TATTCTACAC CTAAAGTGAG TGGAGATGAG GTTTCCCAGA CACAAAGAGA ATCCTTTTTC
AGTGGTTTAC AAGGACGTGT TCCAGGCTTA TCTATCAATC CGACCAGTGG TGATCCCGGT
GCATCATCGC AAATTGTATT GAGGGGCTTT GTATCAGTAA GTGGGGATAA CAGCCCACTG
ATTGTAGTTG ATGGGTTGCC TATTGACAAT TCAATTATCA ACCAGACCAA TGATCTGATA
GGAGGGGCAC CCAACCGTAA CTCGGATTAC TCCAATCGTG CTGGTGACAT TAACCCAGCA
GATATTGAAA GTTATACCAT ATTAAAAGGG CCAGAAGCAA CAGCATTATA TGGTAACCTT
GGAGCTAGTG GGGCTATTCT GATTACGACT AAAAAAGCAA AAGCAGGAAA AGGAAGTATT
AACTACAGTA CAAACTTTAA CGTTTCTAGT GTAGTTAATA TGCCAGAAGT ACAAACCAAA
TACAATCAGG GACTGAATGG AATATATGCT TCCAATACAA CAGTTTATGG TGGTCCGGTA
TATCCTGAAG GAACCAAGCT TTATAATAAT TTTGATGCGT TCTTTCAGAA TGCAATTTCG
CAGCAGCAAA ACCTTTCTTT TGAAGGAGGT ACTGAAAAAT ATACCTATCG TTGGTCCAAT
CAATATGCCA CGTTTAATGG TACGGTGCCC AATACAAACC TGGATAAGTT TTCTTCACGT
TTAACTGCGG AAGGAGAGAT TGCTCCCTGG TTAAAATTAA CAACGTTTTT TAACTATATC
AATAGTAAGA ATGTGAAGCC AACTAAAGGT GTTTCTGGCT ACCTGTCTAC TTTACTGCGT
TTTCCTCCAA GATATGACAT CAATTACTGG CAGGATGAAC TAGGTAACAG GGTGTTGCGT
GTGGCTGATA TTTATAGTGA ATTTGACAAC CCTTTTTGGA CAGCCTATAA GAATACGTCA
ACGGATGAGA CAAACCGTTT TATGATGAAC AATACTTTTC GTATCAGGCC AACTAAGTGG
TTGAATATTA ATGTAACGAT GGCCGCAGAT GTCTCTAACA CAGCTGGATT GCAGGCTTTT
AACGGACAGT CTTATGCAGG TTCAGGTTCT GCTGATGATC CTGCTTTGGG AAGAATCACA
ACTTATGACC GGAAAACCAG GATACTGAAT GGCTCTGTTG TTGCCTCAGC AAATCATAAA
ATCGGGAACT TCAGTACCAC ATTTGTACTA GGTGGGAATA TAGGTGATAA TTATATCAAT
ACGAACTCAA TATATGGGGA AAAAATGTAT GATCCTAACT TCTATAGTAT CAACAATACA
TTGCCTACTA CACAGAGAGC ACGTAATTCT ATCAATAATT ATAGAACTGT TGGTGCATTT
GCCCAGGCGG TTTTAGGTTA CAATTCGCTT GTTTATTTAA CACTTTCAGG AAGGGTTGAT
GGTGCTTCAC GTTTGATGCC AAATGACCCA TATTTTGCCT ACCCTTCTGC CAGTTTTGCT
TTCAATTTTA CTGATCTTAA GTATTTCAAG GAAATTGACT GGATAACAGG TGGTAAGCTT
AGGGCTTCGG TAGGTATAAC AGGTAAAGAG CCCTGGAGAA CTTATGCTGT TTTAACCAAT
TTAACACCAA GAACATCGAG TGGAGGTGGC TTTTCTTATG ATTATAACGG AGGAAACCGT
AAGCTTAAGC CCGAAACAAC AATTAATTGG GAAACCGGGT TTGACTTGAA AATGTTTAAA
GACAGACTGA GTTTAGATTT TACCTACTAT CGTTTATTGA GTAAGGATCA GATCATTCAG
CCACGAATCA GCTACGCAAC GGGTTATGTC TTACGGATGC TGAATGGAGG TGAGGTACGG
AATCAGGGGG TTGAAATTCA GGTGATGGGT ACACCTATCC AAAGAAAAGA TTTTGGCTGG
GATGCGACAT TCAACTTTGC GCTGAACAGG GGTAAGGTAA TTTCTATTGC TGATGAACTG
CCGGAATTGT ATGATTCGGA TACTTGGGTA CTTGGTGGCT TGAGGTCTGC GGTATTTCCT
GGGGCAAGTA TGACTGCCAT TGGAGGTATA CGTTTTGACA GGAACAATAA TGGGGATATT
TTGATCAATC CGGCTACAGG TCTTCCGTAT ACAACTGGTG AAAACTATGA AGTGATTGGT
GATCGTCAGC CAAAATTTAC ATTTGGAATA ACAAATAATA TCAGGTTAAA AAGCTTTAAT
CTTTCCTTTT TGTGGGATTT CCGTATTGGG GGAGATATTG TAAATGGTAC CGAATACGTA
AATTATACAC GTGGTATAAG CACCAAGACT CTTGATAGAG AAGAACCACG AGTAGTAAAG
GGCGTGTTAA AAGACGGCTT GGAAAACACA AACAACCCAA CGCCAAATGC AATTGCTGTT
ACCCCGTATC TGAATTCACT ATATTATACT ACGAATGTTT CCGCAGAGAT GTTTGTTGAA
AAGAACATCA ATACAATTCG TTTAAGGGAC ATTAGCTTAA GCTATGTTAT TCCAAAAACA
GTTTTTAAGC GGTTGCCTTT TCTGCAAAGT GCAAGTGTGT TTGTAACGCT AACGGATGTG
GTGTTGTTTA CCAACTATTC AGGAATGGAT CCTGAAAGTA ATTCAAACAA TGCCTCTCTC
GGTGGAGCAG GTGGGATGGG AATAGACTAT TATAATATGG GTCGCCCTTT AACAGCAAAC
TTTGGTTTGA AATTGAAACT TTAA
 
Protein sequence
MKKKLLMLFM GTFLLVSHAM AQQITVSGKV TSSEDGGIVP GASVLIKGTK TATQTNSSGV 
YTIQTKAGDI LVFSYIGLLP QERPVGGSSI INVVLSADSK GLNEVVVTAY GIERDSKSLG
YSTPKVSGDE VSQTQRESFF SGLQGRVPGL SINPTSGDPG ASSQIVLRGF VSVSGDNSPL
IVVDGLPIDN SIINQTNDLI GGAPNRNSDY SNRAGDINPA DIESYTILKG PEATALYGNL
GASGAILITT KKAKAGKGSI NYSTNFNVSS VVNMPEVQTK YNQGLNGIYA SNTTVYGGPV
YPEGTKLYNN FDAFFQNAIS QQQNLSFEGG TEKYTYRWSN QYATFNGTVP NTNLDKFSSR
LTAEGEIAPW LKLTTFFNYI NSKNVKPTKG VSGYLSTLLR FPPRYDINYW QDELGNRVLR
VADIYSEFDN PFWTAYKNTS TDETNRFMMN NTFRIRPTKW LNINVTMAAD VSNTAGLQAF
NGQSYAGSGS ADDPALGRIT TYDRKTRILN GSVVASANHK IGNFSTTFVL GGNIGDNYIN
TNSIYGEKMY DPNFYSINNT LPTTQRARNS INNYRTVGAF AQAVLGYNSL VYLTLSGRVD
GASRLMPNDP YFAYPSASFA FNFTDLKYFK EIDWITGGKL RASVGITGKE PWRTYAVLTN
LTPRTSSGGG FSYDYNGGNR KLKPETTINW ETGFDLKMFK DRLSLDFTYY RLLSKDQIIQ
PRISYATGYV LRMLNGGEVR NQGVEIQVMG TPIQRKDFGW DATFNFALNR GKVISIADEL
PELYDSDTWV LGGLRSAVFP GASMTAIGGI RFDRNNNGDI LINPATGLPY TTGENYEVIG
DRQPKFTFGI TNNIRLKSFN LSFLWDFRIG GDIVNGTEYV NYTRGISTKT LDREEPRVVK
GVLKDGLENT NNPTPNAIAV TPYLNSLYYT TNVSAEMFVE KNINTIRLRD ISLSYVIPKT
VFKRLPFLQS ASVFVTLTDV VLFTNYSGMD PESNSNNASL GGAGGMGIDY YNMGRPLTAN
FGLKLKL