Gene Phep_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4006 
Symbol 
ID8255140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4838145 
End bp4841207 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content44% 
IMG OID644937670 
ProductTonB-dependent receptor 
Protein accessionYP_003094259 
Protein GI255533887 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.629585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TTTACCATGT ATGGCTCATT TTGCTGATCA TAGCATTTTC AGGCCAAAAG 
CTTTATGCCC AAACCAAGGT TTCCGGAACG GTTAGGGATG CAAGTGGAAT AGGTTTACCT
GGCGTAAGTG TGGTCCAGGA AAACACCCAG AACGGAACGG TAACTGACCA ACAGGGCCGG
TATACACTGA GCCTAAAAGA GGGGGCAGCC CAAACACTTA CGTTTAATTA TGTTGGCTTT
TTAAAGCAGA CGATCCCTGT AAATGGAAGT TCAAGTGTGA ATGTTACACT GAAAGAGAAC
AATGAATCGC TGAATGAAGT GGTGGTGCTG GGCTATACTT CACAGAAAAA ATCAAACCTG
ACGGGTGCTG TAACTTCAGT GCACATGCCC GACCTGGAAG ACAGGAGGGT TGCGGATGTG
GCCCAGGTGT TACAGGGACA AGTTGCCGGG GTACAGATTA CCCAAAGCAC CGGAGCGCCG
GGCGACCCGA TCAGCATCCT GATCCGTGGC CAGGGTACAT TTGGAGATAA CAGTCCGCTT
TTTATCGTTG ACGGCAACCC GACTCAGGAT ATTTCCTTTC TGAATCCGGC CGATATGCAA
TCGGTAACTG TACTGAAGGA TGCTTCTGCG GCTGCAATTT ATGGCTCAAG GGCTTCGGCC
GGGGTAATTG TGATCACTAC CAAAATGGGC AGTGCGGGAT TATCTACCAT TGACATCAAC
TACTATAATG GCATTCAAAA GGTAGCCAAT TTGCCTAAAA TGTTAAATAC CACCCAGTAC
ATGAATAAAA TGGAAGAGTC CTGGAACAAT TCCGGATATG CTGGTACCAA TCCCTACACA
GCAGATAAAA ACCGGACTGA TCTGGCCAAC ACCAATTGGC TGGACGAATT GTTTGAGACC
GGCCGTTCGC AAAATGTACA GCTGAGCGCA AGTGGGGGAA GTGAGAAGAT CAAATATTTA
ATTTCAGGAG CCTATTATGG GCAGGATGGG ATTGTGGTTT ATAAAAACGA TAAATATCAG
CGGGTTAATT TCCGTACCAA TGTTACCGGT AATCTGTCGG ACCGTTTTAC TGTTGGGGCT
AACCTTCAGC TTTCTTACGC CAAACAGGAT AAAATGTCGT CTAAAGGAGA TGAACCCGGT
GTGATCCGCC ATGCATTTAT CCGCCCGCCG GTAATCCCAG TATACAAAGA TCCGAGTGAC
CCTACTTATT CTGCTGCTGA CCCTTTTACC GATTTACCTT TTTATAAAGT CGATGGCACT
TACCAGAGCA GATATGAGTA CAGCAGTAAT CCTATAGCCC TTGCTTATTT TACCAACGAC
AAAAGGTCGT TGTTTAAGAC CTTTGGAAAT GTATATGCAG AATATGCACT GCTTAGCAAC
AAGGAGCTAA AATTCAGGAC CAATGTGGGC CTTGACCTTA ATTTTACCCA CAATAAAGCC
TTTAACCAGA ACTTTGGTGA TGATGATGGC GGCGGTGCTG CGGAAGATAA AGGTTTGGGC
AGGAAAAACA GGCCAAATTC TTTAAATGAG GACCGTGGCC AGGAAAGTAC CATTACCTGG
AACAATACCC TGAATTATGA AAAAACGATC GGGAAGCACC TGATCAATGC CATGGTGGGA
AGTGAGTACA TCACCAATTA CTCATCATCT ATTGGTGCAA CGAGGAACAG GTTTGATTAT
ACCGCTCCGG AATTTCAGTT TATTGATTAC GGTAATACTT TGACCAATTT GTGGAATGGA
GGAAATGGTG CAGAATGGAC TTTGTTTTCT TTATTTAGCT CTGCTACCTA CGTATATGAT
TCCAAATATA TGATCACGGG TAATTTCAGA GCAGATGCTT CGTCGAGATT TGGACCCAAT
AACCACTGGG GTTATTTCCC CTCTGTATCT GCGGGCTGGA AAATTTCGCA AGAGGATTTC
ATGAAAGATG TGCGCTGGAT CTCTGATTTG AAATTAAGGG CAAGTGTAGG TACGCTCGGA
AATCAGAACA TCGGGAATTA TACTTATTTA ACACTATATA CCAAGGTAGG GGATGAGACA
AAACTGCTTC GTTATGGTAA TCCAGACCTG AAATGGGAGA GTACCACCCA AACCAATATT
GGTTTGGACA TGGGGATGCT CCAAAACAAA ATTTATTTAA GTGTTGATTA TTTTAAAAAG
AAAACAAGCG GAATTTTGCT GCCCCTTTCT TTACCACACT TAGTGGGAGA CGTACAACCT
ACTATTGTGA ACGCTGCAGA AGTGAAAAAC TCGGGACTTG AGGTTTCTTT AAGTTACCGC
AACAACGATG GCGTGTTTAA ATATGGTGTG AACGGAAACA TTGGTACATT GAAAAACCAG
GTGGTAAAGC TACACCCGAA TCTGCCCAAC ATGATAGGAC AGGTAACCAA GACAGAACCC
GGTCATCCGA TCAATTCTCT GTTTGGTTTT GTAATGGAAG GTATTTATCA GAACCAGGCT
GAAATAAACA GTCATTTATC GGGTACGCTC AACCCTTCCG AACTTCCAGG TGACATTAGG
TTTAAAGACC TTAACGGGGA TGGGGTGATC AACGATTCAG ACCGGGATTA TATCGGGAAC
CCAAACCCTA AACTTTCTTA CGGACTAAAC CTTTCAGCTG GTTACAAGGG TTTTGACCTT
TCGGCATTGT TCCAGGGCGT TCAGGGTGTA GATCGTTATA ACGACCTGAA AAAGATTATT
GATTATGATT CCCGGCCTTT TAACCATTCT GTGAGGGTGC TGGACAGCTG GCACGGTGAA
GGAACCAGTA ACAGCATACC GCGGTCTACC TTTACTGACA ATGGTAGCAG TAAAACATCC
AGTATTTTTG TGGAAGACGC TTCTTACCTG CGCTTAAAAA ACCTGGAAAT AGGGTACTCC
TTTAAGTCGC TGTTAACAAA AACGAAACTG GGTGTCCAGA ATATCCGTTT ATATGTTTCT
GCGCAAAACC TGTTTACGGT TACAAATTAT ACAGGGCTGG ACCCGGAATC AACAGATGTG
ATAGATATGG GTACTTATCC ACAATCCAAA GCCTTTCTGT TTGGTGTAAA CGTTAAATTT
TAA
 
Protein sequence
MKKIYHVWLI LLIIAFSGQK LYAQTKVSGT VRDASGIGLP GVSVVQENTQ NGTVTDQQGR 
YTLSLKEGAA QTLTFNYVGF LKQTIPVNGS SSVNVTLKEN NESLNEVVVL GYTSQKKSNL
TGAVTSVHMP DLEDRRVADV AQVLQGQVAG VQITQSTGAP GDPISILIRG QGTFGDNSPL
FIVDGNPTQD ISFLNPADMQ SVTVLKDASA AAIYGSRASA GVIVITTKMG SAGLSTIDIN
YYNGIQKVAN LPKMLNTTQY MNKMEESWNN SGYAGTNPYT ADKNRTDLAN TNWLDELFET
GRSQNVQLSA SGGSEKIKYL ISGAYYGQDG IVVYKNDKYQ RVNFRTNVTG NLSDRFTVGA
NLQLSYAKQD KMSSKGDEPG VIRHAFIRPP VIPVYKDPSD PTYSAADPFT DLPFYKVDGT
YQSRYEYSSN PIALAYFTND KRSLFKTFGN VYAEYALLSN KELKFRTNVG LDLNFTHNKA
FNQNFGDDDG GGAAEDKGLG RKNRPNSLNE DRGQESTITW NNTLNYEKTI GKHLINAMVG
SEYITNYSSS IGATRNRFDY TAPEFQFIDY GNTLTNLWNG GNGAEWTLFS LFSSATYVYD
SKYMITGNFR ADASSRFGPN NHWGYFPSVS AGWKISQEDF MKDVRWISDL KLRASVGTLG
NQNIGNYTYL TLYTKVGDET KLLRYGNPDL KWESTTQTNI GLDMGMLQNK IYLSVDYFKK
KTSGILLPLS LPHLVGDVQP TIVNAAEVKN SGLEVSLSYR NNDGVFKYGV NGNIGTLKNQ
VVKLHPNLPN MIGQVTKTEP GHPINSLFGF VMEGIYQNQA EINSHLSGTL NPSELPGDIR
FKDLNGDGVI NDSDRDYIGN PNPKLSYGLN LSAGYKGFDL SALFQGVQGV DRYNDLKKII
DYDSRPFNHS VRVLDSWHGE GTSNSIPRST FTDNGSSKTS SIFVEDASYL RLKNLEIGYS
FKSLLTKTKL GVQNIRLYVS AQNLFTVTNY TGLDPESTDV IDMGTYPQSK AFLFGVNVKF