Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4131 |
Symbol | |
ID | 8255266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4992941 |
End bp | 4996024 |
Gene Length | 3084 bp |
Protein Length | 1027 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644937796 |
Product | TonB-dependent receptor plug |
Protein accession | YP_003094384 |
Protein GI | 255534012 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.28448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.339257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA AATTACTCAT GTTATTCATG GGGACTTTTT TGTTGGTGTC ACATGCCATG GCACAACAAA TTACCGTTTC CGGTAAAGTA ACTTCATCTG AAGATGGAGG TATTGTCCCT GGGGCTTCTG TTTTAATTAA AGGGACGAAG ACGGCTACTC AAACAAATTC ATCGGGTGTG TATACCATAC AGACTAAGGC GGGGGACATC CTTGTGTTTA GCTACATCGG ATTACTCCCT CAGGAAAGAC CTGTTGGTGG TAGCTCAATA ATCAACGTTG TTTTAAGCGC GGATTCAAAA GGGTTGAATG AAGTTGTGGT AACTGCCTAC GGTATTGAAC GGGATTCCAA ATCACTAGGT TATTCTACAC CTAAAGTGAG TGGAGATGAG GTTTCCCAGA CACAAAGAGA ATCCTTTTTC AGTGGTTTAC AAGGACGTGT TCCAGGCTTA TCTATCAATC CGACCAGTGG TGATCCCGGT GCATCATCGC AAATTGTATT GAGGGGCTTT GTATCAGTAA GTGGGGATAA CAGCCCACTG ATTGTAGTTG ATGGGTTGCC TATTGACAAT TCAATTATCA ACCAGACCAA TGATCTGATA GGAGGGGCAC CCAACCGTAA CTCGGATTAC TCCAATCGTG CTGGTGACAT TAACCCAGCA GATATTGAAA GTTATACCAT ATTAAAAGGG CCAGAAGCAA CAGCATTATA TGGTAACCTT GGAGCTAGTG GGGCTATTCT GATTACGACT AAAAAAGCAA AAGCAGGAAA AGGAAGTATT AACTACAGTA CAAACTTTAA CGTTTCTAGT GTAGTTAATA TGCCAGAAGT ACAAACCAAA TACAATCAGG GACTGAATGG AATATATGCT TCCAATACAA CAGTTTATGG TGGTCCGGTA TATCCTGAAG GAACCAAGCT TTATAATAAT TTTGATGCGT TCTTTCAGAA TGCAATTTCG CAGCAGCAAA ACCTTTCTTT TGAAGGAGGT ACTGAAAAAT ATACCTATCG TTGGTCCAAT CAATATGCCA CGTTTAATGG TACGGTGCCC AATACAAACC TGGATAAGTT TTCTTCACGT TTAACTGCGG AAGGAGAGAT TGCTCCCTGG TTAAAATTAA CAACGTTTTT TAACTATATC AATAGTAAGA ATGTGAAGCC AACTAAAGGT GTTTCTGGCT ACCTGTCTAC TTTACTGCGT TTTCCTCCAA GATATGACAT CAATTACTGG CAGGATGAAC TAGGTAACAG GGTGTTGCGT GTGGCTGATA TTTATAGTGA ATTTGACAAC CCTTTTTGGA CAGCCTATAA GAATACGTCA ACGGATGAGA CAAACCGTTT TATGATGAAC AATACTTTTC GTATCAGGCC AACTAAGTGG TTGAATATTA ATGTAACGAT GGCCGCAGAT GTCTCTAACA CAGCTGGATT GCAGGCTTTT AACGGACAGT CTTATGCAGG TTCAGGTTCT GCTGATGATC CTGCTTTGGG AAGAATCACA ACTTATGACC GGAAAACCAG GATACTGAAT GGCTCTGTTG TTGCCTCAGC AAATCATAAA ATCGGGAACT TCAGTACCAC ATTTGTACTA GGTGGGAATA TAGGTGATAA TTATATCAAT ACGAACTCAA TATATGGGGA AAAAATGTAT GATCCTAACT TCTATAGTAT CAACAATACA TTGCCTACTA CACAGAGAGC ACGTAATTCT ATCAATAATT ATAGAACTGT TGGTGCATTT GCCCAGGCGG TTTTAGGTTA CAATTCGCTT GTTTATTTAA CACTTTCAGG AAGGGTTGAT GGTGCTTCAC GTTTGATGCC AAATGACCCA TATTTTGCCT ACCCTTCTGC CAGTTTTGCT TTCAATTTTA CTGATCTTAA GTATTTCAAG GAAATTGACT GGATAACAGG TGGTAAGCTT AGGGCTTCGG TAGGTATAAC AGGTAAAGAG CCCTGGAGAA CTTATGCTGT TTTAACCAAT TTAACACCAA GAACATCGAG TGGAGGTGGC TTTTCTTATG ATTATAACGG AGGAAACCGT AAGCTTAAGC CCGAAACAAC AATTAATTGG GAAACCGGGT TTGACTTGAA AATGTTTAAA GACAGACTGA GTTTAGATTT TACCTACTAT CGTTTATTGA GTAAGGATCA GATCATTCAG CCACGAATCA GCTACGCAAC GGGTTATGTC TTACGGATGC TGAATGGAGG TGAGGTACGG AATCAGGGGG TTGAAATTCA GGTGATGGGT ACACCTATCC AAAGAAAAGA TTTTGGCTGG GATGCGACAT TCAACTTTGC GCTGAACAGG GGTAAGGTAA TTTCTATTGC TGATGAACTG CCGGAATTGT ATGATTCGGA TACTTGGGTA CTTGGTGGCT TGAGGTCTGC GGTATTTCCT GGGGCAAGTA TGACTGCCAT TGGAGGTATA CGTTTTGACA GGAACAATAA TGGGGATATT TTGATCAATC CGGCTACAGG TCTTCCGTAT ACAACTGGTG AAAACTATGA AGTGATTGGT GATCGTCAGC CAAAATTTAC ATTTGGAATA ACAAATAATA TCAGGTTAAA AAGCTTTAAT CTTTCCTTTT TGTGGGATTT CCGTATTGGG GGAGATATTG TAAATGGTAC CGAATACGTA AATTATACAC GTGGTATAAG CACCAAGACT CTTGATAGAG AAGAACCACG AGTAGTAAAG GGCGTGTTAA AAGACGGCTT GGAAAACACA AACAACCCAA CGCCAAATGC AATTGCTGTT ACCCCGTATC TGAATTCACT ATATTATACT ACGAATGTTT CCGCAGAGAT GTTTGTTGAA AAGAACATCA ATACAATTCG TTTAAGGGAC ATTAGCTTAA GCTATGTTAT TCCAAAAACA GTTTTTAAGC GGTTGCCTTT TCTGCAAAGT GCAAGTGTGT TTGTAACGCT AACGGATGTG GTGTTGTTTA CCAACTATTC AGGAATGGAT CCTGAAAGTA ATTCAAACAA TGCCTCTCTC GGTGGAGCAG GTGGGATGGG AATAGACTAT TATAATATGG GTCGCCCTTT AACAGCAAAC TTTGGTTTGA AATTGAAACT TTAA
|
Protein sequence | MKKKLLMLFM GTFLLVSHAM AQQITVSGKV TSSEDGGIVP GASVLIKGTK TATQTNSSGV YTIQTKAGDI LVFSYIGLLP QERPVGGSSI INVVLSADSK GLNEVVVTAY GIERDSKSLG YSTPKVSGDE VSQTQRESFF SGLQGRVPGL SINPTSGDPG ASSQIVLRGF VSVSGDNSPL IVVDGLPIDN SIINQTNDLI GGAPNRNSDY SNRAGDINPA DIESYTILKG PEATALYGNL GASGAILITT KKAKAGKGSI NYSTNFNVSS VVNMPEVQTK YNQGLNGIYA SNTTVYGGPV YPEGTKLYNN FDAFFQNAIS QQQNLSFEGG TEKYTYRWSN QYATFNGTVP NTNLDKFSSR LTAEGEIAPW LKLTTFFNYI NSKNVKPTKG VSGYLSTLLR FPPRYDINYW QDELGNRVLR VADIYSEFDN PFWTAYKNTS TDETNRFMMN NTFRIRPTKW LNINVTMAAD VSNTAGLQAF NGQSYAGSGS ADDPALGRIT TYDRKTRILN GSVVASANHK IGNFSTTFVL GGNIGDNYIN TNSIYGEKMY DPNFYSINNT LPTTQRARNS INNYRTVGAF AQAVLGYNSL VYLTLSGRVD GASRLMPNDP YFAYPSASFA FNFTDLKYFK EIDWITGGKL RASVGITGKE PWRTYAVLTN LTPRTSSGGG FSYDYNGGNR KLKPETTINW ETGFDLKMFK DRLSLDFTYY RLLSKDQIIQ PRISYATGYV LRMLNGGEVR NQGVEIQVMG TPIQRKDFGW DATFNFALNR GKVISIADEL PELYDSDTWV LGGLRSAVFP GASMTAIGGI RFDRNNNGDI LINPATGLPY TTGENYEVIG DRQPKFTFGI TNNIRLKSFN LSFLWDFRIG GDIVNGTEYV NYTRGISTKT LDREEPRVVK GVLKDGLENT NNPTPNAIAV TPYLNSLYYT TNVSAEMFVE KNINTIRLRD ISLSYVIPKT VFKRLPFLQS ASVFVTLTDV VLFTNYSGMD PESNSNNASL GGAGGMGIDY YNMGRPLTAN FGLKLKL
|
| |