Gene Phep_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0441 
Symbol 
ID8251526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp520152 
End bp523340 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content44% 
IMG OID644934089 
ProductTonB-dependent receptor plug 
Protein accessionYP_003090727 
Protein GI255530355 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTAA CTATTTTCTT ATTAATCATG GGCTGCCTGG CTGTACAAGC TTCAGGATAT 
GCCCAAAAAG TAAACTTATC TGTACAGAAT GCAGGTATAG AAAGCGTATG CATAGCTATA
AAAAATCAGA CCGGTTATTT TTTTCTGTAC GACGCTGATG TGCTTAAAAA ATCGGGTAAT
GTTAGTCTGG AACTCAAAAA TGCCGACCTG ACCGAGGCCT TAAGTAAAAT GACTTCAGGA
AAAGGGCTGG GCTATAAAAT CATAGACCAG ACCGTGATCA TCTCTAAGCT CAAAGTATCC
CAGGAAGAGA TCGTCATTAA AGGAACGGTA AAATCCAAAG AAAGCTCGGG ACGGGCTGAT
ATGCCGGTGC CGGGAGTAAT GGTGACCATA AAGGGCACTA AAAAAGCTAC AGTGACTAAT
GGCGACGGTA ATTACACCAT ACAGGCGCCG GCGAATGCAA CCCTGGTATT TTCTATGATT
GGTTACGGCA GCAGGGAAAT CGCTGTCAAT GGCAATAAAA CCATTAATGT ACTGATACAG
GAAACCGCCA GTGAACTCCA GGAAGTTATT GTGACTGCTT ACGGTACATC AGAGAAAAAA
GAAAATCAGA TCGGCAGTGC ATTTCAGGTA ACCAGCGAAC AACTGGAGCG CAGGCCGGCA
AAAAGGATAG ATGAGCTGCT GGAAGGTATA GTACCAGGCT TACAATATGA GGTGCAAGGC
GGGGGTACCA GCAGCCCCAG ACCAAGATAT CAGACCCGGG TTCGCGGCGA AGCCTCTTTT
GGGGCATCAA ATGAGCCACT TTGGGTACTG GACGGAATTC CTATAAATAC CGGTAATGAA
ACCAACATGA CGCTTGGCGT AAATACAAGT ATCAGTCCGT TAACCTATAT CAACCCTGCC
GATATTGAAT CGGTTGTGGT ATTAAAAGAT GCAACTGCGA CTTCCATTTA CGGAGCAGAT
GGATCAAACG GGGTAATATT GATTACCACT AAAAAGGGTA AAGCCGGAGA AAGAAAGATT
AACTATGCCT TCAGAACAGG GATAAACCTG ATCAATAACA ATCGTTTCCA TGTACTTAAC
GGCAATGAAT ACAGGGAACT TTATGCAGAA TCTTATAGAA ACAATACGGC TTTAAACCAG
GCAGAAATGC CCGATCTGGG CAGCAACAAT ACCGATTGGT ATGATGTATT TTTCCGTACG
GGAGTAAATA CACAACATGA CCTTTCTTTT TCGGGCGGCT ACAAAAACAC CAGGTATTAT
GTATCGGGTG CCTATTACAA TGAGCGCCCT ATCATGATCA AAAACAGGAC CCAAAGGTTT
TCAAGCAGGA TTAACCTGGA TCAGAAAGTG AATAAATCCA TTGATTTATT CCTGAGGGTG
GGCGCGTCCT ATAACATCAA CAATATGTTT AGTCCTGGCA CTAATTATTA TACAAACAGG
CCTACGGATA GCCCGTTCCG CCCTGACGGC ACGCCTATTC TGGCATTTTA TAATAAATTA
CTTGAAGCAG AACACAACGA CAACAGGCAA AAAGCAAATG TGCTTCAGGG CAATGCCGGC
GGAACAATAA ATATCCTGCC AGGGTTTACG TTTACCAGCA CAAATGGGAT TGACTATCAA
TCCATCAAAG AAGATTTTTA CAGCTCCCAG TTTGCGTACA GCGACAGGGG CGAAGGTATG
GCCGCGTTTT CAAAAACCAG GAACTTCGAC TGGAACAGTC AGCAGCGCCT TAATTTCGAC
AAAACCTTCG GTAAACATGA CGTATCGGCA TTGCTGGGTG CAGAAGCCAG AAGCCAGGAC
AGGGAATCCA ATGCCATCAT CGGCAATGGT TTTGAAAATG ATGACATCAG GGATGCCAAT
AAAGCCACAA AAATCCGGTA TACCACTTCG GGCGACGAAA AGACGGCATT GTCTTATTAC
GGACAGCTCA GGTATACATT TGATGGCAGG TACAGCGTGT TGGGCAGTTT CCGTAACGAT
GGTAATTCTG ACTTCGGGTC TGACGTGAAG TGGGCCAAAT TCAGTTCAAT TGGCACGGCA
TGGACCATCA GCAATGAAAG CTTCTGGAAA ATAAGACAAG TTGATTTTGC CAAACTTAAA
ATGAGTTATG GTACCAATGG AAACTCCAGA ATCGGGGCCT ACAGGTCTAA AGGTATTTAC
AGTACCAATG CCGACAATGC TACCTATAAC GGTGAGTTGG GTGCAAAAAT GATCAGTGGC
GAAAATCCGG TTTTATCATG GGAAACCACT TATATCATCA ACGGAGGGCT TAGCCTGGGC
TTGTTCAAAA GAATCTCACT GGAGATAGAA GGCTATAGAA ATGTAACCGA AGGCTTACTG
AATGATGTAG ACGTTTCACG TACCACTGGT TTTACCTATA TCTTACAGAA TGTAGGATCA
GTAAGAAACA AGGGAATAGA GCTGACCTTA AATACACAAA ATATCCTTAA AAAAGATTTT
ACCTGGAATA CCAGATTCAA CCTGGCCAGG AACACCAATC GGATCCTGAA GTTGTACAAC
AACAATACCA AGGTACTGGA TAAAGTGATC CGTCAGGTTG GAGAGGACGT GAATGTTTTT
TACCTGGTAC GATGGGCCGG TGTAGACCCG GCCGACGGGG GACCGCTCTG GTATGATACA
CGTGGTAACC TTACCAAAAC ATTTGATCTG GCCAACAGGG TTACTGTAGG TTCCTCTACC
CCTGATTTTT TTGGGGGGAT GACCAACTCC TTCCAGTATA AGCAATTTAC GCTTTCTGCC
CTGATGATCT ATAACGTGGG TGGTTATGCC TTTAGCGACC TGCAGCGGGA TTCAGAATCG
GATGGCCGTA ACCTCAAAGA CGACAACCAG TCGACCAACC TTTTAGACAG GTGGAGAGAG
CCCGGAGACC TGAGCAACAT TCCCAAAACC GTACTTAACG AAAATGCGAA CAATGCACGT
AACTCTACAC GGTTCCTGCA TAAAAAAACC AGTCTGCGTT TGCAAAACGT AAGTGTAAAC
TACAACTTTT CAGAAGCATT CCTGAAGCGG ATCAGTTTAA GTCGTGCGAA TGTTTATTTG
CAGGCCGACA ATGTAGGTTT CTGGACACCC TATACCACTT CCTCAAACCG CAACGATTAT
AAAAACTCTT TCAATCCATA TCCACAGCCA CTTGTACTTT CATTTGGTTT AAATGTTAGC
TTAAAATAG
 
Protein sequence
MRLTIFLLIM GCLAVQASGY AQKVNLSVQN AGIESVCIAI KNQTGYFFLY DADVLKKSGN 
VSLELKNADL TEALSKMTSG KGLGYKIIDQ TVIISKLKVS QEEIVIKGTV KSKESSGRAD
MPVPGVMVTI KGTKKATVTN GDGNYTIQAP ANATLVFSMI GYGSREIAVN GNKTINVLIQ
ETASELQEVI VTAYGTSEKK ENQIGSAFQV TSEQLERRPA KRIDELLEGI VPGLQYEVQG
GGTSSPRPRY QTRVRGEASF GASNEPLWVL DGIPINTGNE TNMTLGVNTS ISPLTYINPA
DIESVVVLKD ATATSIYGAD GSNGVILITT KKGKAGERKI NYAFRTGINL INNNRFHVLN
GNEYRELYAE SYRNNTALNQ AEMPDLGSNN TDWYDVFFRT GVNTQHDLSF SGGYKNTRYY
VSGAYYNERP IMIKNRTQRF SSRINLDQKV NKSIDLFLRV GASYNINNMF SPGTNYYTNR
PTDSPFRPDG TPILAFYNKL LEAEHNDNRQ KANVLQGNAG GTINILPGFT FTSTNGIDYQ
SIKEDFYSSQ FAYSDRGEGM AAFSKTRNFD WNSQQRLNFD KTFGKHDVSA LLGAEARSQD
RESNAIIGNG FENDDIRDAN KATKIRYTTS GDEKTALSYY GQLRYTFDGR YSVLGSFRND
GNSDFGSDVK WAKFSSIGTA WTISNESFWK IRQVDFAKLK MSYGTNGNSR IGAYRSKGIY
STNADNATYN GELGAKMISG ENPVLSWETT YIINGGLSLG LFKRISLEIE GYRNVTEGLL
NDVDVSRTTG FTYILQNVGS VRNKGIELTL NTQNILKKDF TWNTRFNLAR NTNRILKLYN
NNTKVLDKVI RQVGEDVNVF YLVRWAGVDP ADGGPLWYDT RGNLTKTFDL ANRVTVGSST
PDFFGGMTNS FQYKQFTLSA LMIYNVGGYA FSDLQRDSES DGRNLKDDNQ STNLLDRWRE
PGDLSNIPKT VLNENANNAR NSTRFLHKKT SLRLQNVSVN YNFSEAFLKR ISLSRANVYL
QADNVGFWTP YTTSSNRNDY KNSFNPYPQP LVLSFGLNVS LK