Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0501 |
Symbol | |
ID | 8251588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 599429 |
End bp | 602689 |
Gene Length | 3261 bp |
Protein Length | 1086 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644934151 |
Product | TonB-dependent receptor plug |
Protein accession | YP_003090787 |
Protein GI | 255530415 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00157557 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAA AAACTACAAA ATTTAGTCGA AAGGACTCAA TCAGGTGGTT AGGTACAGTT CTATCTATGG CTGTAGTAAT CGTGCCGCCT TTTATTCCCC TTTCACTTAG TGCAAAGGCT GTGGATATTG GTAAAGTGGC GTCCGTTACC TATGGTCTTA ATGAACGCAG GTCCTTGTTT CAGGAGCGTG TCATAACCGG CCGGGTCAGC GACGTTAATG AGGCTGGAAT AATCGGAGCG GGTATAAGGG TTAAAGGGAC AAAAATTGCT ACTGTTTCGG ATTTGAATGG AAACTTTTCT ATCACAATTC CGAACAACGA TGCCATACTG GAGTTTACCT CTATTGGCTA TGCCTCAAAA GAAGTGGCTG TTAAAGGCTT AAAGGAAGTT CGTGTCACCT TGCAAGAGTC AACATCTACG ATGGACGAGG TTGTCATCAC CTCTTTTGGT ACACAGAAAA AGGAAAGTGT GGTTAGTGCT ATTTCAACAG TTCGACCTTC TACCATGAGA AATTCGTCAA GTAACCTGAC GACCGCTTTG GCAGGAAGGG TTGGTGGTGT GATTGCCTAT CAACGGTCTG GAGAACCAGG ACTGGATAAC GCCGAATTTT ACATCCGTGG CGTTACGACT TTTAGTACAT CGGGTAAACG GGATCCGCTG ATCCTGATTG ATGGGGTAGA AATGGCAACA AACGACCTTG CCCGGTTGAA CGTAGACGAC ATCGAATCCT TTTCCGTACT AAAGGATGCC AGTTCTGCGG CACTGTATGG CGCACGTGGA GCAAACGGCG TGATTCTCGT GACAACAAAG GTAGGAGCGG TAGACAAACT GGCGATCAGT GTAAGAGCTG AACAGTCCAA CTCTTATAAT TCTAAGTTAG CCCAACTGGC CGATCCTATT ACCTACATGA AACTTCACAA CGAAGCGGTG CGAACGCGCA ACGCCCTGGT AGACTTGCCC TATTCTTCTT CGAAAATCAG GGAAACAGAG CTGGGCACCG ACCCGCTTCG ATATCCATCC GTAAACTGGT ATGACTATTT AATAGACGAT AAGGCGATCA ACCGCCGTCT TAACCTGAAC CTCAACGGTG GGGGACAATC TGTACAATAT TACCTGGCTT CCAACTTTCA GAATGACAAG GGAATACTAA AACAAAGCGA AGAAAATCTG GTCGAGAACA ACATCAATAT CAATCGGCTT CAGATACGCT CGAATGTGAC CATTAAGTTT GCGCCAACCA CTACTGGTGT TGTGCGTGCA TACGGATCAT TTGATGATCG GACTGGCCCT TATATTCCTA ATATGGTAGA CGAGGATAAT AAGACAGTAT CGGGCGGCGC TGCAGTATTC CGCGCTGCCC GAAATGCGTC GCCGGTACGG TTCCTCCCAT TTTATCCGGC AGACGCAGCC AACGAATATA CTAACCATAT CCTTTTCGGG ATGAATCCGG AAATGAGTTT TTCCAACCCC TGGGCGCAAG TGGTGAGCTC ATTCCAGGAA TCAAAAGAGT CCATGATGCT GTTGCAAATG GAAATGGATC ATAAATTTAC CGGGAATCTG GAGGGTTTAA ACGTAAGAGG AGCATTCAAC GCGATGCGAA AAGCGTATTA TGCGCAAACC CGTGGCTATG TTCCATTTTT TTACAGTCTT GCCAATACCA TAGACGGTTC CTACCAATTG ACACCGCTCA ATCCAGACAG TGGTACTGAG TATCTGAACT TTGTAAGTCA GGGCAGGACG GTCAATGCGT CCCAGTACGG TGAACTTCGC CTAACGTACA ACAAAATCTT TAATAAAAAG CATGATCTCA ATGCCACATT GGTTGGTACG ATCAGGAATG AAACTGGTAC CATTCAATAT GATGCGAGAG TGTCTGATGA CCTTCAGGCC TCTCTTGCAC GAAGGAACAT CTCTTCAGCG GGCAGGTTAT CTTATAATTA TGATACCCGC TACGTTCTGG AGCTAAACTT TGGATATAAC GGTACCGAAC GTTTCGCCGA GAAGAACCGT TGGGGATTTT TTCCTACTGC CGGAGTTGGG TGGATGATCA GCAACGAACC GTTTATGAAA GGTGTGAAGG ATGTGATATC TAAATTACAA TTACGTGCTA CGTATGGTAA GGTTGGAAAC GATCAGATAG GGTCTTTGTA TGATCGGTTT TTCTACCTGT CTCAGATCGA TATGAACGGG ACCGGATATT GGTTTGGTTT GAACAGAAAC TACCGTTCGG GTATTTCAAT CAATCGGTAT GCCAACGACC TGATCACCTG GGAAGTTGCA AAAAAGTTGA ATATAGGTTT AAATATTGGA TTGTTCAACG ATCTGACGCT TATCGCTGAC TTTTTTCAGG AAACCCGAAG TAATATTCTT CAGGACCGGG TAGATATACC AACTACTATG GGTCTTCGGG GGATCCCTCA GGCAAATGTT GGAGTGGCAC AGGGAAGGGG ATTTGATTTG GAGCTAACCT ACAACAAAAT GTTCAATAAC GGTTTATCGT TAATCGTAAA TGGCAATTTC ACTTATGCAG CCAGCACAGT TAAGAAGTGG GAAGAACCTG ATTATAGTGA TGTTCCCTGG CGTACCCGCG TTGGACAGAA GATCAATCAG AAGATAGGTT ATATCGCCGA GCGACTGTTC ATTGATGAGG AAGAAGTAAA CAACTCGCCC AGACAATTAT TTGGAGAATA CGGTGCAGGA GATATTAAAT ACAAGGACAT CAACAATGAC GGTCAGATCA ATACGGACGA TATGGTAGCC ATTGGTTACC CGACTGTTCC TGAGATCATT TACGGAAACT CGATCTCGTT AGCTTATAAA GCCTTCGACA TCAACTTCTT TATTCAGGGG TCTGCCAGAT CCTCGTTCTT TATAAACCCC GCTGGTATCT CTCCCTTCCT CAACCAGGGA CAAAAAGCGC TGATGCAGAC GATCGCAGAC GATCACTGGT CTGAAACGAA CAGAAATATA GAAGCATTCT GGCCCCGCCT ATCTGAGTAC ACCATCTCAA ATAACAATCA GACGAGTACG CATTGGCTGC GAAACGGAAC CTTTATAAGA CTAAAACAGG CGGAAATAGG TTATACTTTA CCCAATCGCC TGACTAAAAG AGCCCGCATG AGTATGATGC GGGTGTACCT TAGCGGAACC AATTTGTTTT ACCTTTCAAA GTTTAAGATG TGGGATCCGG AAATGGGCGG GCTAGGCCTG GGATATCCGG TTCAGCGCGT GTTTAACTTA GGTTTGAATG TTAAATTTTA A
|
Protein sequence | MTKKTTKFSR KDSIRWLGTV LSMAVVIVPP FIPLSLSAKA VDIGKVASVT YGLNERRSLF QERVITGRVS DVNEAGIIGA GIRVKGTKIA TVSDLNGNFS ITIPNNDAIL EFTSIGYASK EVAVKGLKEV RVTLQESTST MDEVVITSFG TQKKESVVSA ISTVRPSTMR NSSSNLTTAL AGRVGGVIAY QRSGEPGLDN AEFYIRGVTT FSTSGKRDPL ILIDGVEMAT NDLARLNVDD IESFSVLKDA SSAALYGARG ANGVILVTTK VGAVDKLAIS VRAEQSNSYN SKLAQLADPI TYMKLHNEAV RTRNALVDLP YSSSKIRETE LGTDPLRYPS VNWYDYLIDD KAINRRLNLN LNGGGQSVQY YLASNFQNDK GILKQSEENL VENNININRL QIRSNVTIKF APTTTGVVRA YGSFDDRTGP YIPNMVDEDN KTVSGGAAVF RAARNASPVR FLPFYPADAA NEYTNHILFG MNPEMSFSNP WAQVVSSFQE SKESMMLLQM EMDHKFTGNL EGLNVRGAFN AMRKAYYAQT RGYVPFFYSL ANTIDGSYQL TPLNPDSGTE YLNFVSQGRT VNASQYGELR LTYNKIFNKK HDLNATLVGT IRNETGTIQY DARVSDDLQA SLARRNISSA GRLSYNYDTR YVLELNFGYN GTERFAEKNR WGFFPTAGVG WMISNEPFMK GVKDVISKLQ LRATYGKVGN DQIGSLYDRF FYLSQIDMNG TGYWFGLNRN YRSGISINRY ANDLITWEVA KKLNIGLNIG LFNDLTLIAD FFQETRSNIL QDRVDIPTTM GLRGIPQANV GVAQGRGFDL ELTYNKMFNN GLSLIVNGNF TYAASTVKKW EEPDYSDVPW RTRVGQKINQ KIGYIAERLF IDEEEVNNSP RQLFGEYGAG DIKYKDINND GQINTDDMVA IGYPTVPEII YGNSISLAYK AFDINFFIQG SARSSFFINP AGISPFLNQG QKALMQTIAD DHWSETNRNI EAFWPRLSEY TISNNNQTST HWLRNGTFIR LKQAEIGYTL PNRLTKRARM SMMRVYLSGT NLFYLSKFKM WDPEMGGLGL GYPVQRVFNL GLNVKF
|
| |