Gene Phep_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1036 
Symbol 
ID8252130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1209422 
End bp1212712 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content40% 
IMG OID644934689 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091318 
Protein GI255530946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTAA AAAAGGGGTT TGTCATTTTT ATTTTATTGG TACTCAGTGT GGCCGTAAAG 
GCCCAGAAAC TCAATTATAC ACAAAAGAAT GTTTCCCTGG TAAGGCTTTT TAAGGAGATC
AAACAGCAGA CCGGTTTTAG TGTGGCCTGG AATGAAAAGG AATTCAATGT GAACCAGCGC
ATAGACATCA GCTACAAGGA TGCCGATGTT AAGAAAGTAA TGGATGACAT TTCGGCCCGA
CTCCAGCTTA GTTATACCAT TATGGGTAAG GCAATTATTG TAAAAGATAA AAGACCTGCA
GCAGGTATTG ACCAAACCAC AAATCCTAAA CCCGCGAATA ATCCTATCCC TATTGTTCAG
GAAAGGGAAT TTTTCCTGCA ACAGGTTGAA ATTGTAAGTA CAGGTTATCA AAACATCCCT
AAAGAACGCG CAACAGGAAG TTTTGCACTC GTAGACAGTG CGCAACTAAA CCGGAGGGTT
AGTCCTGATA TATTTAGCAG GCTGGAAGGT ATAACCAGTG GTTTGCTGTT CAATAAAAAT
ACAGTAAACA GCAATTCTGG TAATTTGGAT TTATCTATAA GGGGCAGAAG TACAATTTTT
GCAAATGATC AGCCCCTGAT TATCCTGGAT AATTTTCCTT TTAATGGTGA TTTCAATTCC
ATCAACCCGA ATGATCTGGC CAATATTACG GTATTGAAGG ATGCAGCAGC AGCTTCTATC
TGGGGCGTAA GGGCTGGAAA TGGTGTAATC GTGATCAGTA CCAAACGGGG AAAGACCGGA
CAAGCATTAA ACATTTCTTT AAATACCAAC ATTACTGTTG CAGGCAAACC GGATGTTTTT
TACAATCCGA ACCACCTGTC TTCATCGGAT TTCATCGATA TTGAAACTTT TCTGTTTAAC
AATGGAAAGT ACGATGCTGC ATTGACAGAC CAGGTTAATT ATCCGGTTGT TTCGCCGGTA
GTGCAAATCC TGAACAAACA AAGGCAGGGA CAGTCTGCCG CCGAAACGGA AAAACAATTA
AATGCCCTGC GTGGAAATGA CATCCGTAAT GAAGAGTTGA AATACTTTTA CCGCAAACCG
GTTTCCCAGC AATATTTTTT GAATGCAAGT GGCGGAACAG CTAGATCGAG TCACTATTTT
TCACTGGGCT ATGACAAGAC ACTTTCCAGC CTGGTTAACA ATGACAATGA CCGGATAACC
ATCAATAGTC AAAATACCTT CAGGCCAATA AAGAACCTGG AAATTGAAGC TGGTTTTAAC
TATACAAGGA TTGCATCAAG AGTGGACAGT ACCATACGAG AAACCTCTGA TGTCAATTTT
ACACCTTATT ATCAGTTCAG GGATGCCAAT GGCAATCCAA CTGTATTTGA TAAGAACTTC
AGCGCTGATT ATAAACAACA GGCGTTGACA AAGGGATTTC TGGATTGGTC TTATGTACCT
TTAACCGAAC TGGGAAAATC TCCATTTATA TCAAAAAATA ATGATGTTCG TGTAAATGGT
GCTTTGAAAT ATACCATAAT ACCCGGCTTA AGTGCCGCAC TAAAATACCA GTACCAGTTG
CTTGACAACA AAACCGAACG TTACAATAGT TTAGAGACTT ACCAGAGCAG GAACCTGATC
AATCAATATT CTGTTTTGAC TTCAGACAGG GTAAGCGGGT ATCATATTCC ATTGGGCGGA
ATTCTTTATA GTGCAAATGG CAAAGCTGTT TCCAATAATT TTCGGGCTCA GCTGGCTTAT
CAGAGAGATC TACAAAATTC TGCAGTTTCG GCTATATTGG GTTATGAATT GTCAGAATTT
TCTTCTGATA TCAGTGACCA TTTTGATTAT GGTTATGATC AGAAAACAGG TACATCCATA
CCTGTTGACT CTACAAGTAC CTTTAATTTA AACCCTTCAG GTACGGGTAA AATCAATACC
GGAGTTGCGC CTTTTGGAAA GCTGGACAGG ATCAGGTCTG TGTTTGCCAA TCTAGCTTAC
AGTTTTAACA ACAAATACGT ATTATCGGCT AGTGCCAGGA TTGATGGATC GAATTATTTT
GGGGTAAAAA CCAATCAGAA AAATGTCCCG CTCTGGTCGG CCGGTGCTTT ATGGAACGTA
GACAGGGAAA CTTTTTATCA GCTAAACTGG TTACCCATTT TAAAATTAAG GGCTTCTTAT
GGTTACAATG GAAACCTGGA TAAATCGAAT ACCGGAATTA CTACTTTCCG GTACAATGGA
ATAGGCGCGC TTTACACCGG GCTGCCCAAC GCGGGAATCC TTAATATTGG GAATCCCGAG
CTGCGCTGGG AGAAAATTGG TATTGCCAAT TTCGGGATTG ATTTTGGCCT GAAAGATCAG
ATTGTTACAG GAAAACTGGA ATACTATTTT AAAAACGGAA CGGATATTTT AGGCGATAAG
GCTTTTGCTT CGAGTACAGG GATCAAAACC TTAAGAGGAA ATTATTCGAA AATGAAGGCC
AATGGCATGG ATGTTTCTTT AATTTCACAA AATTTAAGAG GTGAACTGAA ATGGACTACT
AACTTTTTGT TTTCATTTGT ACACGATAAA GTGACTTCTT ACGATGTGAT AGAGCCAAGA
AGTGCTTATT ATGTGGGGAC TTATAGCACC AAACCTGTAT TGGGAAGGCC AGTATATGGA
ATTTATAGCT ACAAATGGGC GGGGCTTGAT CCGGAAAATG GTGATCCGCG TGGTTATTTA
AATGATGAGG TCAGCAAAAA TTACAGTACA ATTGTCAATA CCACTTCTGT AAATGATCTG
GAATACAACG GGCCGGCAAG ACCAACTGTA TTCGGTGGCT TAAACAATAT TTTTTCTTAT
CGTAAGTTTA CTTTGGGTTT TAACATCAGC TATAAACTAG GCTATTATTT CAGAAAACCT
AGTATAAATT ATACAAACTT ATATACTGGC AATATGGGAT TCTTTATGAA TAGGGATTAT
GAGAACCGCT GGCAAAAAAC CGGGGATGAA CTGATTACCA ATGTACCATC TATGGCCAGT
TATGCAACAG ACAGTTTCAG GGATATTTTT TATAACAATT CTTCTGTTAC AGTTGCCAAA
GGAGATCATA TCCGTTTGCA GGATGTGAGC TTGAGCTATG ACCTTGATCA AACCAACTGG
AAAAGGATAC CTTTTAAAAA AGTTCAGCTT TACTGTTATG CCAGTAATCT GGGAATGATC
TGGAAGGCAA ATGATTTCGG ATTAGACCCG GATGTGATTC CGCTTATTAA GGAAAGATTG
TCCAATCCCC TATCCAAAAG TTTTGCTTTT GGATTAAAGG CAAATTTTTA A
 
Protein sequence
MHLKKGFVIF ILLVLSVAVK AQKLNYTQKN VSLVRLFKEI KQQTGFSVAW NEKEFNVNQR 
IDISYKDADV KKVMDDISAR LQLSYTIMGK AIIVKDKRPA AGIDQTTNPK PANNPIPIVQ
EREFFLQQVE IVSTGYQNIP KERATGSFAL VDSAQLNRRV SPDIFSRLEG ITSGLLFNKN
TVNSNSGNLD LSIRGRSTIF ANDQPLIILD NFPFNGDFNS INPNDLANIT VLKDAAAASI
WGVRAGNGVI VISTKRGKTG QALNISLNTN ITVAGKPDVF YNPNHLSSSD FIDIETFLFN
NGKYDAALTD QVNYPVVSPV VQILNKQRQG QSAAETEKQL NALRGNDIRN EELKYFYRKP
VSQQYFLNAS GGTARSSHYF SLGYDKTLSS LVNNDNDRIT INSQNTFRPI KNLEIEAGFN
YTRIASRVDS TIRETSDVNF TPYYQFRDAN GNPTVFDKNF SADYKQQALT KGFLDWSYVP
LTELGKSPFI SKNNDVRVNG ALKYTIIPGL SAALKYQYQL LDNKTERYNS LETYQSRNLI
NQYSVLTSDR VSGYHIPLGG ILYSANGKAV SNNFRAQLAY QRDLQNSAVS AILGYELSEF
SSDISDHFDY GYDQKTGTSI PVDSTSTFNL NPSGTGKINT GVAPFGKLDR IRSVFANLAY
SFNNKYVLSA SARIDGSNYF GVKTNQKNVP LWSAGALWNV DRETFYQLNW LPILKLRASY
GYNGNLDKSN TGITTFRYNG IGALYTGLPN AGILNIGNPE LRWEKIGIAN FGIDFGLKDQ
IVTGKLEYYF KNGTDILGDK AFASSTGIKT LRGNYSKMKA NGMDVSLISQ NLRGELKWTT
NFLFSFVHDK VTSYDVIEPR SAYYVGTYST KPVLGRPVYG IYSYKWAGLD PENGDPRGYL
NDEVSKNYST IVNTTSVNDL EYNGPARPTV FGGLNNIFSY RKFTLGFNIS YKLGYYFRKP
SINYTNLYTG NMGFFMNRDY ENRWQKTGDE LITNVPSMAS YATDSFRDIF YNNSSVTVAK
GDHIRLQDVS LSYDLDQTNW KRIPFKKVQL YCYASNLGMI WKANDFGLDP DVIPLIKERL
SNPLSKSFAF GLKANF