Gene Phep_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1300 
Symbol 
ID8252400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1545488 
End bp1548577 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content45% 
IMG OID644934954 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091577 
Protein GI255531205 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAATACAAAT GATCATCCTG CTCAACTGTT GTTTGCTGTC ATTTATTGCA 
ACTGCACAAA CAACAATTCG TGGTACGGTT AAAGATAATG CAGGTGGTTT ACCGGGGGTG
AGCATACAGG AAAAAAATGG AAAGGGCAAC GGGACCAGCA CCAATGAAAC AGGCGAGTTC
CGGATTACCC TGAAAGGGAA TTCCAATATT TTAGAGGTTA CAGCCATAGG CTACCTGAAA
CAGGAAGTGA ATGTGGCAGG CAAAACCACC GTAAACATTA CACTGAAAGA AGACACCAAG
GGCCTGGAAG AGGTGGTGGT AATTGGTTTT GGAACAGCCA AGAAAATAAC AAATACAGGT
GCCATCAGTA CCATTTCGGC TGCTGATGTG CGCACTACAC CCACGCCAAA TATCCAGAAT
ACGCTGGCAG GCCGTGCGCC GGGTTTTATT TCGCAGCAGC GCTCTGGCCA GCCCGGAAAA
ACCGGCGCCG ATTTTTACAT CCGGGGGGTC AACTCACTTT CGGGCGAATC GCAGAAACCT
TTGATCATCG TCGATGATGT AGAATATACT TACGATCAGG TGGCACAGTT AGATGTGAAC
GAAATTGAAA CCTTTACCAT CCTGAAGGAT GCCTCTACTA CTTCTGTATA TGGAATCAAG
GGCGCCAACG GTGTATTGGT GATCACTACA AGACGTGGCA AGATAGGTAA GCCAAAAGTA
TTTTTCAATA CGGAATCTGG TTTGCAGTCT GCCGTACACA AACCAAATTT CCTGGACGCC
TATACCGTCG CCAGCTTAAA AAACGAGGCC ATCAGGAATG ACGACAACGG CACACCGCCG
GAGTTTACTG ATGCAGACCT GGAGCACTGG CGTTTAAAAG ACGACCCTTA TGGACATCCG
GATGTAGACT GGTACAATGC CGTGTTTAGA AACAATGCTT ACCAGGTCCG CAATACCGTA
GACATTTCCG GAGGCAGTGA AAAGATCAAA TACTTTGTAT CCGCCGGACA GGTATTTCAG
AACGGGGCCC TCAGAAACTT CAGCAAAGGT ACGTATGAAG CCCCCGATAA CAATTATGCT
TATCAGCGCT ATACCTTCCG GTCCAACCTG GATATGCAGG CCACCAAAGA CCTGGCCTTA
CGTTTAGACG TAACCGGACG CACTGGTACC ATTACAGAAC CACATATTGC CACCTCGCCT
TTAAGCACCG TATATAGCTT TCAGCGCCTT CCGCCTTATG CAGAGCCCTT ACTTAACCCC
GACGGGAGTT ACCCCTGGGC ATTTCGCTCA AGGAGTTCCT TCTATGAAAC CAGTCTGATT
GGCCGCTTGG CTTTACAGGG CTACGATAAA ACCTACAGAA ATGAGTTTAA TGTACTCGTG
AGTGCCGATC ATAAGCTCGA TTTTATTACC CAGGGTTTAT CTGTACAGGC CAGAATTGCC
TATTCAGGTG ATGTGAGTTA TGACAGAAAA CTTTACCGCA ACAACATCCC GGCCTTTTAT
TATAACCCGG TAAACAATAC CTACACCATT CACAGCAATA ACCTGTATCG TTTGGAGCCC
TTAACCCTGG AGAGTTCTGC GGAAAATGCC ATTACCAGGA AAACGCTGAA TACCCTGGCC
AAGATTAATT ATAACCGGTC TTTCGGAAAC CACAATATTG GGGGGCTGGT GCTTTATAAT
ATAAATGATG TTACAAAAGG CAGTTACAAT ACAGATACCA ATGTTAAACT GCTACAGGAG
TATGCACCGG TAAGTTCCAA CGGATTTTCA TACAGGGCCA GTTATGATTA TAAGCAGCGT
TACCTGGTCG ATTTTAGCGG GGCATATAAC GGGACCAGCC AGTTTGTTGG AAAAAAAACC
AAAGGTTTTT TCCCGGCAGT ATCTGCAGGC TGGAACATTG CTGAAGAACC TTTTATGAAA
AATAACTTTA AATTCATCGA TCTCTTAAAA TTAAGAGGTT CCTGGGGAAT GACCGGGTCC
GACATTACAA AGGGCAACAA CTATAAAACA GAACAGATTT ATGGAACGGG TCCAAACTAT
AATTTTGGTG AAAGTTCTAA TACTTTTACC AGTATACAGG AAGGTAGTTT AGGAAATCTG
GACATTACCT GGGAGAAATC CAAAAAAACA GATATCGGTT TGGATGCACA ATTTTTTGGC
GGAAAGCTAA GCTTAACGGC AGATTATTTC TATGATTACC GTTACGACCA GTTGTATATC
AAAGAAGACG TGTTGAAGAT TATCGGTGTC GACCTGCCCT ATACCAATTC GGCCATTACC
GAAAACAAAG GTTTTGACGG GCAGCTTGGG TTTCGTCATA AAACCGGTAA CCTGAATTAT
TCGGCCAGCT TTACCTTCTC CAGGGCAAGA AACAAAGTGG TGTACCAGGG AGAGGCTGCA
CCAAGATATG CATACCTGGC AAAAACTGGC CTTCCAATAG GCCAGGGTTT TGGCTATAAT
GCCCTGGGCT TTTTCCAGAC ACAGGAAGAA GTTGATAATT ATGCGCATGT TGCAAATGCC
AAACCTGGAG ATATCAAATA TGAAGATGCC AATAATGACG GGTTAATTGA TCAGGAAGAT
TACCGGGCCA TTGGTAAACC TAATTTGCCA CAAACCGTAT TGGGTACAGC CTTAGGTGTA
GAATACAAGG GCTTCAGCTT AAATGTATTT TTTCAGGGCA GTTTCGATTA CAGTTATCGG
ATCGCAACGG CAGGGGTTAT CCCTTTCCAG GGCAACCTGC AAAAATCTGC CCTGGGCAGG
TGGACACCAG AAACCGCGGC AACAGCAACG TTTCCACGCT TAAGCAATGA TCTTGCCGGG
CCAAGCAGTC CTTCAAATGC CTCTTCTTTC TGGATGGTCG ATGCCCATTA CATCCGGTTA
AAATCGGTCG ATATCGGATA TATGCTGCCC AAACAATGGA CCAGTAAAGT CAAGATCAGT
TCGGCCAGGA TTTATGTAAG CGGTTATGAC CTGTACACCT GGGCCAATTT TGATCTGTAT
TCACAGGATC CGGAAATCGC CAGCGGTGGC AGTGCCGGAA CCTATCCTGT GCAAAAAGTA
ATTAACCTGG GCCTGCAAGT TGGATTTTAA
 
Protein sequence
MKKIIQMIIL LNCCLLSFIA TAQTTIRGTV KDNAGGLPGV SIQEKNGKGN GTSTNETGEF 
RITLKGNSNI LEVTAIGYLK QEVNVAGKTT VNITLKEDTK GLEEVVVIGF GTAKKITNTG
AISTISAADV RTTPTPNIQN TLAGRAPGFI SQQRSGQPGK TGADFYIRGV NSLSGESQKP
LIIVDDVEYT YDQVAQLDVN EIETFTILKD ASTTSVYGIK GANGVLVITT RRGKIGKPKV
FFNTESGLQS AVHKPNFLDA YTVASLKNEA IRNDDNGTPP EFTDADLEHW RLKDDPYGHP
DVDWYNAVFR NNAYQVRNTV DISGGSEKIK YFVSAGQVFQ NGALRNFSKG TYEAPDNNYA
YQRYTFRSNL DMQATKDLAL RLDVTGRTGT ITEPHIATSP LSTVYSFQRL PPYAEPLLNP
DGSYPWAFRS RSSFYETSLI GRLALQGYDK TYRNEFNVLV SADHKLDFIT QGLSVQARIA
YSGDVSYDRK LYRNNIPAFY YNPVNNTYTI HSNNLYRLEP LTLESSAENA ITRKTLNTLA
KINYNRSFGN HNIGGLVLYN INDVTKGSYN TDTNVKLLQE YAPVSSNGFS YRASYDYKQR
YLVDFSGAYN GTSQFVGKKT KGFFPAVSAG WNIAEEPFMK NNFKFIDLLK LRGSWGMTGS
DITKGNNYKT EQIYGTGPNY NFGESSNTFT SIQEGSLGNL DITWEKSKKT DIGLDAQFFG
GKLSLTADYF YDYRYDQLYI KEDVLKIIGV DLPYTNSAIT ENKGFDGQLG FRHKTGNLNY
SASFTFSRAR NKVVYQGEAA PRYAYLAKTG LPIGQGFGYN ALGFFQTQEE VDNYAHVANA
KPGDIKYEDA NNDGLIDQED YRAIGKPNLP QTVLGTALGV EYKGFSLNVF FQGSFDYSYR
IATAGVIPFQ GNLQKSALGR WTPETAATAT FPRLSNDLAG PSSPSNASSF WMVDAHYIRL
KSVDIGYMLP KQWTSKVKIS SARIYVSGYD LYTWANFDLY SQDPEIASGG SAGTYPVQKV
INLGLQVGF