Gene Phep_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4101 
Symbol 
ID8255235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4944735 
End bp4947941 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content43% 
IMG OID644937765 
ProductTonB-dependent receptor plug 
Protein accessionYP_003094354 
Protein GI255533982 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TACCTATTCT TCTTGCCTGT ATCCTGATGT GCGGTGCCGT TTCGGTAAAG 
GCCTTCAATA CAAAAAAGCA GGGAGTTTTT AATTACACCA CCAACTGTAT TAACGAAAGT
AGTCAAACCG TCCAGCGGGT CTTAACTGGT ACAGTTTACG ACGAAACAGG TCTGTCCATG
CCCGGAGTAA CTATAAAAGT ACAGGGTAAA GAACGGGGAG CAGTAACCAG CGGAGATGGA
AAATTTACCA TACAGGTCAA TGATGATGCC GAGGTTCTGG TATTTTCTTT TGTGGGGTAC
ATCACACAAA GAATTACAGT CGGCAGTCAG AAAACACTCG ACGTGAAATT ACTACCGGAT
CCTAAAAATG CACTGGAAGA GGTTGCTGTT GTAGCTTTCG GCACCCAGAA AAAAGAAAGC
GTAATCGGAT CCATCACTAC TATAAAACCG GGCGACCTGA AAATTCCTTC AAGTAACCTT
ACCCAGTCGC TTGCAGGTAG GGTGGCCGGT GTAATAGCCT ATCAAAGGAG TGGCGAACCC
GGAGCGGACA ATGCCGATTT CTTCGTACGC GGAATCACTA CCTTTGGTAC CAATACCAAA
CCCTTGATCC TAATCGACGG GATCGAGCTT ACTACAACTG ATCTGGCCCG GTTGCAAACC
GATGATATTG CCACTTTCTC TATCATGAAA GATGCTACAG CAACGGCGCT ATATGGTGCG
CGAGGAGCCA ATGGGGTAAT CCTGGTAACC ACTAAACAGG GGGTTGCCGG TAAGCCAAAA
GTGTCACTAA GAGTAGAAAA CTCTGCAACG GCACCCACCA GCAATGTTGA ACTGGCAGAC
CCGGTAACCT ATATGAAACT TGCAAATGAG GCAACTGCAA CAAGAAATCC TTTACTGGCA
TTGCCCTACC TTGAAGATAA AATAGAAAAT ACCGCTAAAG GCATCAACCC ATTGGTTTAT
CCGGCCAACG ACTGGCGCAA AATGTTGTTT AAAGACTATG CCACCAATCA GCGTTATAAT
TTAAATGTAA GTGGAGGCGG GCCGATAACG AGGTATTATG TTGCAGGGTC TTATACTAAA
GATAACGGTA TATTAAATGT TGATAATAAG AACAACTTTA ACAACAACAT TGACCTGAAA
AGTTATTCAC TGCGTGCCAA TGTAAATATA GACCTGACCA AGTCTACCGA GTTGATTGTT
CGTTTAAGCG GTAATTTTGA TGATTATACA GGTCCGATTG ATGGAGGTAC GGAGATGTAC
AGGAAGGTTA TGCGTTCCAA TCCCGTACTT TTCCCTGCTT ATTTCCCCAT AGATGAGGAA
CATAAGTTTG TTAAACACAT TATGTTCGGA AATTATGATT TGGGCAATAA GTACATTAAT
CCTTATGCCG ACATGGTAAG GGGATATAAA GATGAAAGCA GGTCCCAGAT GCTGGCCCAG
TTTGAATTAA AACAGGGATT GGATGTCATT ACCAAGGGCT TATCAGCAAG GGCGATGGTT
AACTTAACCA GAACCTCCAA ATTTAACTAC AACAGGGCTT ATAATCCTTT CTATTATCAG
GTTGGTGCTT ATGATCCCAT CTCTAACCAA TACTCAATAG CCCAGATCAA TACCAATGGG
ACAGAATACC TGGGCTTTGA CCAGGGGCCT AAAGAGCTAA CCTCTACTTT CTATTTTGAA
TCCACCGTTA ACTATAACAG AACTTTTGGC GCTAAACATG ACGTTGGCGG CTTATTGGTT
ATGATTGCCA GGCAAAGTCT TAATGCCAAT GCCGGAGATC TTTTACAATC ACTCCCATCC
AGAAATATGG GCGTATCAGG TCGTGCAACT TATGCTTACG ACAAACGGTA TTTTGCAGAA
TTTAACTTTG GCTACAATGG AACGGAACGC TTTGCAGAAG CCAACAGGTT CGGTTTTTTC
CCGTCGGCAG GTGTAGCCTG GAGTGCTTCC AACGAGAAGT TTTTTGAACC GCTTAAAAAC
ATCGTTACCA ATTTACGTTT CCGTTATACC TACGGATTGG TGGGGAACGA CCAGATCGGC
GATGTCAAAG ACCGCTTCTT CTACTTGTCT AATGTATTAA TAGGGCAAAC AAACGTCCGC
CGTGCAGTAT TCGGCCGTGA TCTTACCGAA TTTAAAGATG GAGTATTGGT AACGAGATAC
GCCAACCCAT ACATCACCTG GGAAAGGGCC ACCAAACAAA ATATGGCCAT GGAATTAAGC
CTTTTCGGTA AATCGAACCT GGTAGCAGAG TATTTCACAG AAAAAAGGGA CAATATCCTG
ATGTCGAGGG CCTCCATTCC AAATACCATG GGTTTATCTG CCGATACCAA GTCAAATCTG
GGTGAAGCAA GCGGCAGGGG AGTAGATATC TCTCTCGACT TTCAGCAGGC CTGGTCTAAA
GACCTTTGGC TATCCGTACG CGGTAATTTT ACCTATGCAA CCAGTAAATA CAGGGTATAT
GAAGAACCTG ATTATGCAGA ACCATGGCGG TCAAGGGTTG GAAATTCTTT ACAGCAGACC
TATGGTTATA TCGCAGAACG TTTGTTTGTA GATGATCAGG AAGCACTCAA CTCTCCAAAA
CAGGAATTTG GTGTGTATGG TGGCGGAGAT ATTAAGTATA CGGACGTAAA CCGTGATGGA
AAAATTAATG AGGCAGACAT GGTGCCAATT GGTAACCCCA CAGTTCCAGA AGTGGTGTAC
GGGTTTGGGT TCTCTTTAAG CTACAAGAAA TTTGACATTT CAGCATTTGC ACAAGGGGCA
GCCAACCAGT CATTCTGGAT TGATCCTGCA GCTACATCTC CTTTTGTACC CTATTACTAT
CCAAATACTT TAGAGTCAAC ATCAGGTCGC ATCTTTACCA ATCAGTTGTT AAAGGCTTAC
GCAGATAGCC ACTGGTCGGA AGAAAACAGG GATGTATATG CTACTTTACC AAGGTTAAGC
AATACACCAA ATGCAAATAA TAACCAGCCT AGTACCTGGT TTATGCGCAA TGCAGCCTTT
TTAAGGTTAA AACAAGTCGA AATAGGCTAT ACCTTCTCTA AAAAACTTGT AGAGCGCATC
AAGGCCACAA ACTTCAGGAT CTATGTGAGT GGAACCAATT TGCTGATGTT GAGCAAATTT
AAGATATGGG ATGCTGAAAT GGCTGGTAAT GGCCTTGGCT ATCCTTTGCA AAAGGTATTT
AACGCAGGTC TTAATTTAAC CTTTTAA
 
Protein sequence
MKKIPILLAC ILMCGAVSVK AFNTKKQGVF NYTTNCINES SQTVQRVLTG TVYDETGLSM 
PGVTIKVQGK ERGAVTSGDG KFTIQVNDDA EVLVFSFVGY ITQRITVGSQ KTLDVKLLPD
PKNALEEVAV VAFGTQKKES VIGSITTIKP GDLKIPSSNL TQSLAGRVAG VIAYQRSGEP
GADNADFFVR GITTFGTNTK PLILIDGIEL TTTDLARLQT DDIATFSIMK DATATALYGA
RGANGVILVT TKQGVAGKPK VSLRVENSAT APTSNVELAD PVTYMKLANE ATATRNPLLA
LPYLEDKIEN TAKGINPLVY PANDWRKMLF KDYATNQRYN LNVSGGGPIT RYYVAGSYTK
DNGILNVDNK NNFNNNIDLK SYSLRANVNI DLTKSTELIV RLSGNFDDYT GPIDGGTEMY
RKVMRSNPVL FPAYFPIDEE HKFVKHIMFG NYDLGNKYIN PYADMVRGYK DESRSQMLAQ
FELKQGLDVI TKGLSARAMV NLTRTSKFNY NRAYNPFYYQ VGAYDPISNQ YSIAQINTNG
TEYLGFDQGP KELTSTFYFE STVNYNRTFG AKHDVGGLLV MIARQSLNAN AGDLLQSLPS
RNMGVSGRAT YAYDKRYFAE FNFGYNGTER FAEANRFGFF PSAGVAWSAS NEKFFEPLKN
IVTNLRFRYT YGLVGNDQIG DVKDRFFYLS NVLIGQTNVR RAVFGRDLTE FKDGVLVTRY
ANPYITWERA TKQNMAMELS LFGKSNLVAE YFTEKRDNIL MSRASIPNTM GLSADTKSNL
GEASGRGVDI SLDFQQAWSK DLWLSVRGNF TYATSKYRVY EEPDYAEPWR SRVGNSLQQT
YGYIAERLFV DDQEALNSPK QEFGVYGGGD IKYTDVNRDG KINEADMVPI GNPTVPEVVY
GFGFSLSYKK FDISAFAQGA ANQSFWIDPA ATSPFVPYYY PNTLESTSGR IFTNQLLKAY
ADSHWSEENR DVYATLPRLS NTPNANNNQP STWFMRNAAF LRLKQVEIGY TFSKKLVERI
KATNFRIYVS GTNLLMLSKF KIWDAEMAGN GLGYPLQKVF NAGLNLTF