Gene Phep_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1015 
Symbol 
ID8252109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1186264 
End bp1189245 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content42% 
IMG OID644934669 
ProductTonB-dependent receptor plug 
Protein accessionYP_003091298 
Protein GI255530926 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.045736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTG GATTATTACC TAAAATTCTA CTTATTGTCA AATATGGCTA TGTTTTGCTG 
CTTATTTATG GCAGTTTGGC TATAATTGGA CACACCGAAG CTTTCTGTCA GAAGCATGAA
GGCATTACCA TCAATGGGAA AGTTTCCGAT GAAAAGGGTG AGACTTTACC CGGAGTAACG
GTAAAAGTAA AAGGAACAGC CGTTGGCACA ACAACAGATG CGGATGGAAA TTATACCGTT
AAGGCCCCAA AAGGGGCCAT GCTTACTTTC AAAAGCATTG GCTACCTACT GAAAGAGATT
GAAGTGAAAG ACCAGGTCGG GATCAATGTG GTTTTACAAA CTGATACTAA AACATTAGAG
GAACTCATCG TAGTTGGTTA CGGAACCCAG AAGAAAAGTG ATGTAACAGG TGCTGTAGCA
TCCATCTCTA AAGAAAGATT AGACAACATG GTAAGAACAG ATGTGGTACA GCTGATACAG
GGGGCAGCAG CCGGACTAAA TGTGTCTACT ACTGCAGCCG GGGCAGATCC TGAAAGTGGT
GCTGTGCTGT TGATCCGTGG CCGTAAATCT ATCAGTGCCA GTAACGATCC ATTGATCATT
CTTGACGGGA TCCCTTATAA TGGCATCTTA TCTGATATCA ATTCCAATGA TATTGAAAGC
CTGGATATTT TAAAAGATGC TTCTGCAGTT GCAATTTATG GTTCCAGGGC TTCAAATGGT
GTAATTCTGA TCCAAACGAA ACAAGGTACC AAAGGAAAGG CAGCCATTAA ATATGATTTT
TTGTACGGGT TACAGAATGT GGCCAATTTC CCACACCTGA TGAACGGTCA GGAGTATTAC
GATTTTAAAA AGGGAGTTAC CAATCCTGAT GATGATACCG AAGCTGCAAT TACCCCATCA
GAACTGGAAG TATACAATTC AGGTTCCTAC AATTCATTTA CCTGGAAAGA CCTGATCCTG
AGAAGGGGGA ACAGTCAGCA GCATAATCTA TCTGTATCTG GCGGTGCGGA AAAAACGACT
TACAATGTAA GTATGTCTTA CCTGGGTACA AAGGGTATTG TCATTAATGA CCAGTACAAA
CGCATCAATA CCAGGATAAA TGTTACTTCA AATATCAAAA GCTGGTTAAC ACTGGGCAGC
AGCAGTATGG CTGGGTATAT CAACAACAGC GGAGCCAAGC CCTCGTTTAT CGACCTGTTT
AATAAATCGC CTCTGGCGGT CCCTTTCAAC CCTGACGGTT CTGTAAACAT TACACCGATT
GCCGATGACC CAAGGAAGAT CAATCCGATT GAAAACCTTT TGTATGACGA TTTAAAAAGG
AAGTATTCCG TTTCCAGCAA TAACTACTTA AATGTTAACC TTCCTTTTGT AAAGGGACTT
TCTTACCGGC TGAATACAGG TGTGCAGTAT GAATCAGCTG AGAAAAACTG GTACCAGGGC
ACCAATACAG GTAAAAGCGG AGCCCTGAAA GGGGAGAGTG AAACCAATTT GGGGGTTAAA
TATTCTTATA CCATTGAAAG TATCTTTTCC TACAAAAGAG ATATCGGAAA ACACAATATT
TTCCTTACCG GCCTGTTAAG TGTTGAAGAA AAGGAAAATA AGAACAGCAT TTTAAACGGA
CAGGGTTTTG CCAACGACTT TTTATCTTAT TACGGCATTA CACAGGCCAG CAAAATTGTT
CCTTCCTATA ACTATTTTAA AACTAACCTC TTGTCGCAGA TGTTCAGGGC AAATTATGCT
TACGACAACA GGTACCTGTT CACGTTTACC GTGCGCAGGG ATGGCTTCTC GGGTTTTGGT
GCAAACAGAA AATACGGAGT TTTCCCTTCA GTGGCCCTGG GCTGGAATAT TGCCAATGAA
AAATTCCTCA GTGGCGTGAA AGAAACGCTT AGTACATTGA AACTGCGTGC ATCTTACGGC
CTTAGCGGAA ACCAGGCCAT TAGTCCTTAT CAAACCCTTT CGCAGCTTAC TGAAGGTGAT
TATATTGACG GAACAGTACC TGCACCCGGT TACATTCCTT CCACGCTGGG AACTGCCGGC
CTGGGCTGGG AATCTACACG GGCATTTAAT ATTGGTTTAG ATTTTGGCCT GTTTAATTCC
AGAATTACCG GTGATGTTAA TGTGTATAAA AATAAAACGA ATGATTTGCT CCTTAAACGT
GCTATTTCTG CAGTACACGG GGTAAACAGT GTATTTCAGA ATATTGGTAA AACCATCAAT
GAAGGGATAG AGGTCAGCAT TAACAGCAGG AACATTACCA AATCTAAATT CACCTGGGAC
TCAAACATTA ACTTCTCATT CATCAATACA GAAATATTGG ATCTATACGG GGACGGGAAA
AATGACATTG CCAATAACTG GTTTTTAGGA GAGCAGATCA AGGTAAATTA CGATTACAGA
TTTATTGGGG TCTGGCAGGA AGAAGATACG GAGCTGGCAG CTAAATATGG TGCAAAACCC
GGATATGCAA GGTATGAGGA TCTAAATAAT AATGGAGTTT ATGATCCTGA CGACCGTCAG
CTCATCGGTT CGCCAGAGCC TAACTTTACC TGGGGCCTTA CCAATAATTT CAAGTATAAA
AACTTCGGTT TATCGGTATT TATGTACGGT AAGATGGGCA CACTGAAGGC CAACCCCTAT
AAAGACAGAA ATTATTTAAT TGCCAGGACC TACTGGACAC CGGAGAACAG AAATAATGAG
TTCTGGGCCA ATTCCAGTCA GGCTAACCGT TATCTGGGAA AAGGGATAAC ACCTAGTGTC
TACGACAATG CAGATTTTAT CAGAATTAAA GATATCACGC TAAGTTATTC CTTTGCACCA
AAATTACTTT CAACAGCAGG CCTGAACAGG CTTAATGTAT TTTTTAGTGG GAAAAACCTC
TTCACCATCA CAAAATGGGG CGCGCTCGAT CCGGAGCTGG ATGCGCAACG GGCTATCCCT
TTACAGCGCG AATATATCAT GGGTTTAAAT CTTAGTTTCT AA
 
Protein sequence
MKFGLLPKIL LIVKYGYVLL LIYGSLAIIG HTEAFCQKHE GITINGKVSD EKGETLPGVT 
VKVKGTAVGT TTDADGNYTV KAPKGAMLTF KSIGYLLKEI EVKDQVGINV VLQTDTKTLE
ELIVVGYGTQ KKSDVTGAVA SISKERLDNM VRTDVVQLIQ GAAAGLNVST TAAGADPESG
AVLLIRGRKS ISASNDPLII LDGIPYNGIL SDINSNDIES LDILKDASAV AIYGSRASNG
VILIQTKQGT KGKAAIKYDF LYGLQNVANF PHLMNGQEYY DFKKGVTNPD DDTEAAITPS
ELEVYNSGSY NSFTWKDLIL RRGNSQQHNL SVSGGAEKTT YNVSMSYLGT KGIVINDQYK
RINTRINVTS NIKSWLTLGS SSMAGYINNS GAKPSFIDLF NKSPLAVPFN PDGSVNITPI
ADDPRKINPI ENLLYDDLKR KYSVSSNNYL NVNLPFVKGL SYRLNTGVQY ESAEKNWYQG
TNTGKSGALK GESETNLGVK YSYTIESIFS YKRDIGKHNI FLTGLLSVEE KENKNSILNG
QGFANDFLSY YGITQASKIV PSYNYFKTNL LSQMFRANYA YDNRYLFTFT VRRDGFSGFG
ANRKYGVFPS VALGWNIANE KFLSGVKETL STLKLRASYG LSGNQAISPY QTLSQLTEGD
YIDGTVPAPG YIPSTLGTAG LGWESTRAFN IGLDFGLFNS RITGDVNVYK NKTNDLLLKR
AISAVHGVNS VFQNIGKTIN EGIEVSINSR NITKSKFTWD SNINFSFINT EILDLYGDGK
NDIANNWFLG EQIKVNYDYR FIGVWQEEDT ELAAKYGAKP GYARYEDLNN NGVYDPDDRQ
LIGSPEPNFT WGLTNNFKYK NFGLSVFMYG KMGTLKANPY KDRNYLIART YWTPENRNNE
FWANSSQANR YLGKGITPSV YDNADFIRIK DITLSYSFAP KLLSTAGLNR LNVFFSGKNL
FTITKWGALD PELDAQRAIP LQREYIMGLN LSF