Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4006 |
Symbol | |
ID | 8255140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4838145 |
End bp | 4841207 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937670 |
Product | TonB-dependent receptor |
Protein accession | YP_003094259 |
Protein GI | 255533887 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.629585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTTACCATGT ATGGCTCATT TTGCTGATCA TAGCATTTTC AGGCCAAAAG CTTTATGCCC AAACCAAGGT TTCCGGAACG GTTAGGGATG CAAGTGGAAT AGGTTTACCT GGCGTAAGTG TGGTCCAGGA AAACACCCAG AACGGAACGG TAACTGACCA ACAGGGCCGG TATACACTGA GCCTAAAAGA GGGGGCAGCC CAAACACTTA CGTTTAATTA TGTTGGCTTT TTAAAGCAGA CGATCCCTGT AAATGGAAGT TCAAGTGTGA ATGTTACACT GAAAGAGAAC AATGAATCGC TGAATGAAGT GGTGGTGCTG GGCTATACTT CACAGAAAAA ATCAAACCTG ACGGGTGCTG TAACTTCAGT GCACATGCCC GACCTGGAAG ACAGGAGGGT TGCGGATGTG GCCCAGGTGT TACAGGGACA AGTTGCCGGG GTACAGATTA CCCAAAGCAC CGGAGCGCCG GGCGACCCGA TCAGCATCCT GATCCGTGGC CAGGGTACAT TTGGAGATAA CAGTCCGCTT TTTATCGTTG ACGGCAACCC GACTCAGGAT ATTTCCTTTC TGAATCCGGC CGATATGCAA TCGGTAACTG TACTGAAGGA TGCTTCTGCG GCTGCAATTT ATGGCTCAAG GGCTTCGGCC GGGGTAATTG TGATCACTAC CAAAATGGGC AGTGCGGGAT TATCTACCAT TGACATCAAC TACTATAATG GCATTCAAAA GGTAGCCAAT TTGCCTAAAA TGTTAAATAC CACCCAGTAC ATGAATAAAA TGGAAGAGTC CTGGAACAAT TCCGGATATG CTGGTACCAA TCCCTACACA GCAGATAAAA ACCGGACTGA TCTGGCCAAC ACCAATTGGC TGGACGAATT GTTTGAGACC GGCCGTTCGC AAAATGTACA GCTGAGCGCA AGTGGGGGAA GTGAGAAGAT CAAATATTTA ATTTCAGGAG CCTATTATGG GCAGGATGGG ATTGTGGTTT ATAAAAACGA TAAATATCAG CGGGTTAATT TCCGTACCAA TGTTACCGGT AATCTGTCGG ACCGTTTTAC TGTTGGGGCT AACCTTCAGC TTTCTTACGC CAAACAGGAT AAAATGTCGT CTAAAGGAGA TGAACCCGGT GTGATCCGCC ATGCATTTAT CCGCCCGCCG GTAATCCCAG TATACAAAGA TCCGAGTGAC CCTACTTATT CTGCTGCTGA CCCTTTTACC GATTTACCTT TTTATAAAGT CGATGGCACT TACCAGAGCA GATATGAGTA CAGCAGTAAT CCTATAGCCC TTGCTTATTT TACCAACGAC AAAAGGTCGT TGTTTAAGAC CTTTGGAAAT GTATATGCAG AATATGCACT GCTTAGCAAC AAGGAGCTAA AATTCAGGAC CAATGTGGGC CTTGACCTTA ATTTTACCCA CAATAAAGCC TTTAACCAGA ACTTTGGTGA TGATGATGGC GGCGGTGCTG CGGAAGATAA AGGTTTGGGC AGGAAAAACA GGCCAAATTC TTTAAATGAG GACCGTGGCC AGGAAAGTAC CATTACCTGG AACAATACCC TGAATTATGA AAAAACGATC GGGAAGCACC TGATCAATGC CATGGTGGGA AGTGAGTACA TCACCAATTA CTCATCATCT ATTGGTGCAA CGAGGAACAG GTTTGATTAT ACCGCTCCGG AATTTCAGTT TATTGATTAC GGTAATACTT TGACCAATTT GTGGAATGGA GGAAATGGTG CAGAATGGAC TTTGTTTTCT TTATTTAGCT CTGCTACCTA CGTATATGAT TCCAAATATA TGATCACGGG TAATTTCAGA GCAGATGCTT CGTCGAGATT TGGACCCAAT AACCACTGGG GTTATTTCCC CTCTGTATCT GCGGGCTGGA AAATTTCGCA AGAGGATTTC ATGAAAGATG TGCGCTGGAT CTCTGATTTG AAATTAAGGG CAAGTGTAGG TACGCTCGGA AATCAGAACA TCGGGAATTA TACTTATTTA ACACTATATA CCAAGGTAGG GGATGAGACA AAACTGCTTC GTTATGGTAA TCCAGACCTG AAATGGGAGA GTACCACCCA AACCAATATT GGTTTGGACA TGGGGATGCT CCAAAACAAA ATTTATTTAA GTGTTGATTA TTTTAAAAAG AAAACAAGCG GAATTTTGCT GCCCCTTTCT TTACCACACT TAGTGGGAGA CGTACAACCT ACTATTGTGA ACGCTGCAGA AGTGAAAAAC TCGGGACTTG AGGTTTCTTT AAGTTACCGC AACAACGATG GCGTGTTTAA ATATGGTGTG AACGGAAACA TTGGTACATT GAAAAACCAG GTGGTAAAGC TACACCCGAA TCTGCCCAAC ATGATAGGAC AGGTAACCAA GACAGAACCC GGTCATCCGA TCAATTCTCT GTTTGGTTTT GTAATGGAAG GTATTTATCA GAACCAGGCT GAAATAAACA GTCATTTATC GGGTACGCTC AACCCTTCCG AACTTCCAGG TGACATTAGG TTTAAAGACC TTAACGGGGA TGGGGTGATC AACGATTCAG ACCGGGATTA TATCGGGAAC CCAAACCCTA AACTTTCTTA CGGACTAAAC CTTTCAGCTG GTTACAAGGG TTTTGACCTT TCGGCATTGT TCCAGGGCGT TCAGGGTGTA GATCGTTATA ACGACCTGAA AAAGATTATT GATTATGATT CCCGGCCTTT TAACCATTCT GTGAGGGTGC TGGACAGCTG GCACGGTGAA GGAACCAGTA ACAGCATACC GCGGTCTACC TTTACTGACA ATGGTAGCAG TAAAACATCC AGTATTTTTG TGGAAGACGC TTCTTACCTG CGCTTAAAAA ACCTGGAAAT AGGGTACTCC TTTAAGTCGC TGTTAACAAA AACGAAACTG GGTGTCCAGA ATATCCGTTT ATATGTTTCT GCGCAAAACC TGTTTACGGT TACAAATTAT ACAGGGCTGG ACCCGGAATC AACAGATGTG ATAGATATGG GTACTTATCC ACAATCCAAA GCCTTTCTGT TTGGTGTAAA CGTTAAATTT TAA
|
Protein sequence | MKKIYHVWLI LLIIAFSGQK LYAQTKVSGT VRDASGIGLP GVSVVQENTQ NGTVTDQQGR YTLSLKEGAA QTLTFNYVGF LKQTIPVNGS SSVNVTLKEN NESLNEVVVL GYTSQKKSNL TGAVTSVHMP DLEDRRVADV AQVLQGQVAG VQITQSTGAP GDPISILIRG QGTFGDNSPL FIVDGNPTQD ISFLNPADMQ SVTVLKDASA AAIYGSRASA GVIVITTKMG SAGLSTIDIN YYNGIQKVAN LPKMLNTTQY MNKMEESWNN SGYAGTNPYT ADKNRTDLAN TNWLDELFET GRSQNVQLSA SGGSEKIKYL ISGAYYGQDG IVVYKNDKYQ RVNFRTNVTG NLSDRFTVGA NLQLSYAKQD KMSSKGDEPG VIRHAFIRPP VIPVYKDPSD PTYSAADPFT DLPFYKVDGT YQSRYEYSSN PIALAYFTND KRSLFKTFGN VYAEYALLSN KELKFRTNVG LDLNFTHNKA FNQNFGDDDG GGAAEDKGLG RKNRPNSLNE DRGQESTITW NNTLNYEKTI GKHLINAMVG SEYITNYSSS IGATRNRFDY TAPEFQFIDY GNTLTNLWNG GNGAEWTLFS LFSSATYVYD SKYMITGNFR ADASSRFGPN NHWGYFPSVS AGWKISQEDF MKDVRWISDL KLRASVGTLG NQNIGNYTYL TLYTKVGDET KLLRYGNPDL KWESTTQTNI GLDMGMLQNK IYLSVDYFKK KTSGILLPLS LPHLVGDVQP TIVNAAEVKN SGLEVSLSYR NNDGVFKYGV NGNIGTLKNQ VVKLHPNLPN MIGQVTKTEP GHPINSLFGF VMEGIYQNQA EINSHLSGTL NPSELPGDIR FKDLNGDGVI NDSDRDYIGN PNPKLSYGLN LSAGYKGFDL SALFQGVQGV DRYNDLKKII DYDSRPFNHS VRVLDSWHGE GTSNSIPRST FTDNGSSKTS SIFVEDASYL RLKNLEIGYS FKSLLTKTKL GVQNIRLYVS AQNLFTVTNY TGLDPESTDV IDMGTYPQSK AFLFGVNVKF
|
| |