Gene Phep_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0501 
Symbol 
ID8251588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp599429 
End bp602689 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content45% 
IMG OID644934151 
ProductTonB-dependent receptor plug 
Protein accessionYP_003090787 
Protein GI255530415 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00157557 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA AAACTACAAA ATTTAGTCGA AAGGACTCAA TCAGGTGGTT AGGTACAGTT 
CTATCTATGG CTGTAGTAAT CGTGCCGCCT TTTATTCCCC TTTCACTTAG TGCAAAGGCT
GTGGATATTG GTAAAGTGGC GTCCGTTACC TATGGTCTTA ATGAACGCAG GTCCTTGTTT
CAGGAGCGTG TCATAACCGG CCGGGTCAGC GACGTTAATG AGGCTGGAAT AATCGGAGCG
GGTATAAGGG TTAAAGGGAC AAAAATTGCT ACTGTTTCGG ATTTGAATGG AAACTTTTCT
ATCACAATTC CGAACAACGA TGCCATACTG GAGTTTACCT CTATTGGCTA TGCCTCAAAA
GAAGTGGCTG TTAAAGGCTT AAAGGAAGTT CGTGTCACCT TGCAAGAGTC AACATCTACG
ATGGACGAGG TTGTCATCAC CTCTTTTGGT ACACAGAAAA AGGAAAGTGT GGTTAGTGCT
ATTTCAACAG TTCGACCTTC TACCATGAGA AATTCGTCAA GTAACCTGAC GACCGCTTTG
GCAGGAAGGG TTGGTGGTGT GATTGCCTAT CAACGGTCTG GAGAACCAGG ACTGGATAAC
GCCGAATTTT ACATCCGTGG CGTTACGACT TTTAGTACAT CGGGTAAACG GGATCCGCTG
ATCCTGATTG ATGGGGTAGA AATGGCAACA AACGACCTTG CCCGGTTGAA CGTAGACGAC
ATCGAATCCT TTTCCGTACT AAAGGATGCC AGTTCTGCGG CACTGTATGG CGCACGTGGA
GCAAACGGCG TGATTCTCGT GACAACAAAG GTAGGAGCGG TAGACAAACT GGCGATCAGT
GTAAGAGCTG AACAGTCCAA CTCTTATAAT TCTAAGTTAG CCCAACTGGC CGATCCTATT
ACCTACATGA AACTTCACAA CGAAGCGGTG CGAACGCGCA ACGCCCTGGT AGACTTGCCC
TATTCTTCTT CGAAAATCAG GGAAACAGAG CTGGGCACCG ACCCGCTTCG ATATCCATCC
GTAAACTGGT ATGACTATTT AATAGACGAT AAGGCGATCA ACCGCCGTCT TAACCTGAAC
CTCAACGGTG GGGGACAATC TGTACAATAT TACCTGGCTT CCAACTTTCA GAATGACAAG
GGAATACTAA AACAAAGCGA AGAAAATCTG GTCGAGAACA ACATCAATAT CAATCGGCTT
CAGATACGCT CGAATGTGAC CATTAAGTTT GCGCCAACCA CTACTGGTGT TGTGCGTGCA
TACGGATCAT TTGATGATCG GACTGGCCCT TATATTCCTA ATATGGTAGA CGAGGATAAT
AAGACAGTAT CGGGCGGCGC TGCAGTATTC CGCGCTGCCC GAAATGCGTC GCCGGTACGG
TTCCTCCCAT TTTATCCGGC AGACGCAGCC AACGAATATA CTAACCATAT CCTTTTCGGG
ATGAATCCGG AAATGAGTTT TTCCAACCCC TGGGCGCAAG TGGTGAGCTC ATTCCAGGAA
TCAAAAGAGT CCATGATGCT GTTGCAAATG GAAATGGATC ATAAATTTAC CGGGAATCTG
GAGGGTTTAA ACGTAAGAGG AGCATTCAAC GCGATGCGAA AAGCGTATTA TGCGCAAACC
CGTGGCTATG TTCCATTTTT TTACAGTCTT GCCAATACCA TAGACGGTTC CTACCAATTG
ACACCGCTCA ATCCAGACAG TGGTACTGAG TATCTGAACT TTGTAAGTCA GGGCAGGACG
GTCAATGCGT CCCAGTACGG TGAACTTCGC CTAACGTACA ACAAAATCTT TAATAAAAAG
CATGATCTCA ATGCCACATT GGTTGGTACG ATCAGGAATG AAACTGGTAC CATTCAATAT
GATGCGAGAG TGTCTGATGA CCTTCAGGCC TCTCTTGCAC GAAGGAACAT CTCTTCAGCG
GGCAGGTTAT CTTATAATTA TGATACCCGC TACGTTCTGG AGCTAAACTT TGGATATAAC
GGTACCGAAC GTTTCGCCGA GAAGAACCGT TGGGGATTTT TTCCTACTGC CGGAGTTGGG
TGGATGATCA GCAACGAACC GTTTATGAAA GGTGTGAAGG ATGTGATATC TAAATTACAA
TTACGTGCTA CGTATGGTAA GGTTGGAAAC GATCAGATAG GGTCTTTGTA TGATCGGTTT
TTCTACCTGT CTCAGATCGA TATGAACGGG ACCGGATATT GGTTTGGTTT GAACAGAAAC
TACCGTTCGG GTATTTCAAT CAATCGGTAT GCCAACGACC TGATCACCTG GGAAGTTGCA
AAAAAGTTGA ATATAGGTTT AAATATTGGA TTGTTCAACG ATCTGACGCT TATCGCTGAC
TTTTTTCAGG AAACCCGAAG TAATATTCTT CAGGACCGGG TAGATATACC AACTACTATG
GGTCTTCGGG GGATCCCTCA GGCAAATGTT GGAGTGGCAC AGGGAAGGGG ATTTGATTTG
GAGCTAACCT ACAACAAAAT GTTCAATAAC GGTTTATCGT TAATCGTAAA TGGCAATTTC
ACTTATGCAG CCAGCACAGT TAAGAAGTGG GAAGAACCTG ATTATAGTGA TGTTCCCTGG
CGTACCCGCG TTGGACAGAA GATCAATCAG AAGATAGGTT ATATCGCCGA GCGACTGTTC
ATTGATGAGG AAGAAGTAAA CAACTCGCCC AGACAATTAT TTGGAGAATA CGGTGCAGGA
GATATTAAAT ACAAGGACAT CAACAATGAC GGTCAGATCA ATACGGACGA TATGGTAGCC
ATTGGTTACC CGACTGTTCC TGAGATCATT TACGGAAACT CGATCTCGTT AGCTTATAAA
GCCTTCGACA TCAACTTCTT TATTCAGGGG TCTGCCAGAT CCTCGTTCTT TATAAACCCC
GCTGGTATCT CTCCCTTCCT CAACCAGGGA CAAAAAGCGC TGATGCAGAC GATCGCAGAC
GATCACTGGT CTGAAACGAA CAGAAATATA GAAGCATTCT GGCCCCGCCT ATCTGAGTAC
ACCATCTCAA ATAACAATCA GACGAGTACG CATTGGCTGC GAAACGGAAC CTTTATAAGA
CTAAAACAGG CGGAAATAGG TTATACTTTA CCCAATCGCC TGACTAAAAG AGCCCGCATG
AGTATGATGC GGGTGTACCT TAGCGGAACC AATTTGTTTT ACCTTTCAAA GTTTAAGATG
TGGGATCCGG AAATGGGCGG GCTAGGCCTG GGATATCCGG TTCAGCGCGT GTTTAACTTA
GGTTTGAATG TTAAATTTTA A
 
Protein sequence
MTKKTTKFSR KDSIRWLGTV LSMAVVIVPP FIPLSLSAKA VDIGKVASVT YGLNERRSLF 
QERVITGRVS DVNEAGIIGA GIRVKGTKIA TVSDLNGNFS ITIPNNDAIL EFTSIGYASK
EVAVKGLKEV RVTLQESTST MDEVVITSFG TQKKESVVSA ISTVRPSTMR NSSSNLTTAL
AGRVGGVIAY QRSGEPGLDN AEFYIRGVTT FSTSGKRDPL ILIDGVEMAT NDLARLNVDD
IESFSVLKDA SSAALYGARG ANGVILVTTK VGAVDKLAIS VRAEQSNSYN SKLAQLADPI
TYMKLHNEAV RTRNALVDLP YSSSKIRETE LGTDPLRYPS VNWYDYLIDD KAINRRLNLN
LNGGGQSVQY YLASNFQNDK GILKQSEENL VENNININRL QIRSNVTIKF APTTTGVVRA
YGSFDDRTGP YIPNMVDEDN KTVSGGAAVF RAARNASPVR FLPFYPADAA NEYTNHILFG
MNPEMSFSNP WAQVVSSFQE SKESMMLLQM EMDHKFTGNL EGLNVRGAFN AMRKAYYAQT
RGYVPFFYSL ANTIDGSYQL TPLNPDSGTE YLNFVSQGRT VNASQYGELR LTYNKIFNKK
HDLNATLVGT IRNETGTIQY DARVSDDLQA SLARRNISSA GRLSYNYDTR YVLELNFGYN
GTERFAEKNR WGFFPTAGVG WMISNEPFMK GVKDVISKLQ LRATYGKVGN DQIGSLYDRF
FYLSQIDMNG TGYWFGLNRN YRSGISINRY ANDLITWEVA KKLNIGLNIG LFNDLTLIAD
FFQETRSNIL QDRVDIPTTM GLRGIPQANV GVAQGRGFDL ELTYNKMFNN GLSLIVNGNF
TYAASTVKKW EEPDYSDVPW RTRVGQKINQ KIGYIAERLF IDEEEVNNSP RQLFGEYGAG
DIKYKDINND GQINTDDMVA IGYPTVPEII YGNSISLAYK AFDINFFIQG SARSSFFINP
AGISPFLNQG QKALMQTIAD DHWSETNRNI EAFWPRLSEY TISNNNQTST HWLRNGTFIR
LKQAEIGYTL PNRLTKRARM SMMRVYLSGT NLFYLSKFKM WDPEMGGLGL GYPVQRVFNL
GLNVKF