Gene Phep_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1983 
Symbol 
ID8253087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2288134 
End bp2291439 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content38% 
IMG OID644935634 
ProductTonB-dependent receptor plug 
Protein accessionYP_003092253 
Protein GI255531881 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.445269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000345601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTTTT ATAACCAAAT TGGATGCGGC CCAAGTGACC GCTATCTGAT TCAAATCCTG 
CGGATCATGA AATTTACCAC GTTCATCATC GCAATAACCC TAGTCCACGT AAGTGCCGCC
GGTAAGGCCC AGATCAGCCT GGACAAAAAA AATGTTCCGA TCAAGGAGGT TTTCAAAAGT
ATAACCGCCC AAACCGGTTA TGATATATTA TATAGCGATC GTACATTAAA CGATACTGAA
AAAATTAGTA TAACAGCGAA CAATGAGCCC CTGAAAACAG TGCTTGACAG GTGCTTAAAA
GGACAGCTTC TGGAATATGA ACTGGGGAAT AAGACCGTTA TTATAAAAAA ACGCAAGGCC
TTACTTCTCG ACAAGGTTAT TGATTTTTTT ACCAGAATTG ACGTACGTGG TCGGGTGGTA
GATGAAAACA ATCAGCCCCT AGTGGGGGCG GTTATAAAAA TAAAAGACGG CTTGTCCACT
ACCTCCACCA ATTCTAATGG AGAGTTTCTG CTAAAAAAAG TTGCCGAGGA TGCTACCCTG
ACTATCTCTT TTTTAGGATA TGAGACGCGG GAAATAAAGG CTGCTGAAAA TATCTCCATT
ATCAAATTAA CACCGAGTAC CGATAAATTA GAAGAGGTGG AAATCAATGC TGGGTACTAT
ACTGTTACAG ATAAAGAAAG GACTGGATCT ATTTCCAGAA TTTCATCAAA GGAAATTGAA
AAGCAACCTA TAAACAATGT GTTACAGGTT ATGCAGGCTA CTGTACCAGG CTTACAGGTT
ATACAAAACA CAGGTGTTCC AGGGGGCGGT TTTTCTGTTA GAATCAGGGG CCAAAATAGC
TTAACTCAAG GAAATGAGCC CTTTTACATC ATAGATGGCG TTCCATTCAC TGCAACAAGT
TTAGCACCAC CTGTTGGGGT AATAACACCA AATGCAAGTC CTTTGGCCAA TATCAACCCT
GCAGATATCG AAAACATTGA AGTGCTGAAA GATGCTGATG CCACAGCTAT TTATGGTTCC
AGAGGTGCTA ATGGTGTAAT TCTGATCACC ACGAAAAGAG GAAAAGCAGG AAAATCAAGT
GTATCTTTTT CTGCTAATCA AGGTATATCC AAAGTTGGAA GAAAATTGAA ACTTATGAAT
ACGCAGCAGT ATATAGAAAT GCGTAAAGAA GCAAAAGGGA ATGATAACCT GGCAATTTCT
ACTACAGATT ATGATATCAA TGGTACCTGG GATCAAAATA GGTACACAGA CTGGCAAGAT
GAGTTGATTG GCGGTTCAGC TCCTACAACA AACATATTGG CCTCACTTTC AGGAGGAGCA
GGTAATATTA CTTATTTAAT TGGTGGTAAT TTTTACAGTG AAGGAACTGT GTATCCCGGT
GACCAAACCT ATAAGAGAAG TTCAGGTAAT TTTAGCTTAC AATATACTTC GGACAATCAA
AAGTTCGATT CATCTTTCGA TGTAAATTAT AGCCAGATAA ATAGTAATCT TTTTCTTAAT
GATCTTACTC CTTTTATCAG ATTGCCACCA CATTACCCTT CCTTATTAAC TGATGAAGGA
ATTATAAATT GGGGAGATAA CACAATGGGG TCGAATCCAT TGGCACAGCT TCAAAAGCCA
TATGAAGCAA AAACCAACAA TCTTATTGCC AGCGCAGCGC TGAATTACAA AATAATTCCT
GATTTAAAAT TAAAAGCTCG TTTTGGGTAT ACAATTATGG ATAGGAAAGA GTTTAACAGT
CAGCCATTAT CTACTTATAA TCCCGCTAAT AGTCCCGGGC CTGAACAGCG AATTTCAAAA
TTTAGCAACA ACTCGACAAA TAATTGGACA TTTGAATCTC AGGTTGATTA TACAAGAAAA
ATTGGTAATG GTAAATTCAA TGCACTTTTG GGTGTTACAT TTCAACAGGG TGTACTAGAC
GGGCAAATTG TAGAAGGATC TGGTTACAAC AGCGATGTTT TAATGCGGAA TATAGCGGCT
GCATCTGTTT ATAGTGCAAG TGCTTCCTAT TCCAAATACC GGTATATGGC ATTGTTTGGC
CGTATTAACT ACAACTTGAA GGATAAGTAT ATCATTAATT TGACTGGACG TAGAGATGGC
AGCAGCCGTT TTGGGTCTAA CAATCGTTTT GCAAATTTCG GTGCAGCAGG GATAGCCTGG
ATCGTTAGTG AGGAGGATTT TATTAAGGAG CATCTTCCCT TTGTAAGTTT TGCTAAACTT
CGGGGGAGTT TTGGTATTAC AGGAAATGAT CAAATATCAG CTTATGGTTA TTTGGAATTA
TGGAAACCTC TTTCGGGTAG TTATCAGGGA GTTACAACTA TGTTCCCAGG AAATATTGCA
AATCCTGATT ATGCATGGGA AGTTAATAAA AAAGCTGAAG CTGCATTTGA CGTTGGTCTT
TTAAATAATA GGATTAATTT ATCCCTATCA TACTATTCAA ACCGATCTTC AAATCAACTG
GTTTCAGTTC AACTTCCATA CATAACAGGG TTTTCTGGTA TAACCGACAA TCTGGACGCT
ACAATCGGGA ATACGGGTTG GGAATTTGAA TTCATGACAA AAAATATATC GACCAAATCA
TTTCAGTGGT CTACATCTTT TAATTTTACG ATACCAAAAA ACAAATTAAT CGAGTTTCCT
AACCTAGAAA AAAGCACGTA TGCTAATCAA TATGTGATCG GTGAGCCCCT TGCCATTCAA
AAGTTGTATA AAACCAGTGT GAATGCCCAA ACCGGATTAT ATGCTGCTGA AGATTTTGAC
AAAAATGGTC TTATAGACAT TAAAGATCGT TATGTTGTCA CTTTTACTGG CAGGAAATAT
TATGGAGGGC TGCAAAATTC ATTTACATAT AAAGGTTTTG CCTTGGATGT CCTGTGTCAA
TTTGTAAAAC AAGCCGGAGC TGGGTATTTA AATGGCTTTT CAACTGCTGG TAGCTTTGCA
ACAGGCATTC CTACACGAAA CCAACCTGAT TTTGTACTTA ACAGATGGAG AACCCAGGAT
GATCCTGCTC CTTATCAAAA ATACAGCACA CTTACAGCCG CCAGTAACAG TCAGCTGGAT
GCAACTTCAA GAGGAAGTTA TGCTATTGAT AACGCGTCAT TTATCAGATT AAAAAACATT
TCCCTGTCTT ATAACTTGTC AGAAAAACTA ATTCAAAAAA TAAAATTGAA CAGTGCTAAA
GTTTTTTTTC AGGGGCAAAA TCTATTTACA ATCACACCGT ACAAGGGTTT AGATCCTGAA
ACATCAAGTA TAAATAACTT ACCAACGCTT CAGGTTTTTA CGGTTGGTAT GCAGCTAACA
TTCTAA
 
Protein sequence
MNFYNQIGCG PSDRYLIQIL RIMKFTTFII AITLVHVSAA GKAQISLDKK NVPIKEVFKS 
ITAQTGYDIL YSDRTLNDTE KISITANNEP LKTVLDRCLK GQLLEYELGN KTVIIKKRKA
LLLDKVIDFF TRIDVRGRVV DENNQPLVGA VIKIKDGLST TSTNSNGEFL LKKVAEDATL
TISFLGYETR EIKAAENISI IKLTPSTDKL EEVEINAGYY TVTDKERTGS ISRISSKEIE
KQPINNVLQV MQATVPGLQV IQNTGVPGGG FSVRIRGQNS LTQGNEPFYI IDGVPFTATS
LAPPVGVITP NASPLANINP ADIENIEVLK DADATAIYGS RGANGVILIT TKRGKAGKSS
VSFSANQGIS KVGRKLKLMN TQQYIEMRKE AKGNDNLAIS TTDYDINGTW DQNRYTDWQD
ELIGGSAPTT NILASLSGGA GNITYLIGGN FYSEGTVYPG DQTYKRSSGN FSLQYTSDNQ
KFDSSFDVNY SQINSNLFLN DLTPFIRLPP HYPSLLTDEG IINWGDNTMG SNPLAQLQKP
YEAKTNNLIA SAALNYKIIP DLKLKARFGY TIMDRKEFNS QPLSTYNPAN SPGPEQRISK
FSNNSTNNWT FESQVDYTRK IGNGKFNALL GVTFQQGVLD GQIVEGSGYN SDVLMRNIAA
ASVYSASASY SKYRYMALFG RINYNLKDKY IINLTGRRDG SSRFGSNNRF ANFGAAGIAW
IVSEEDFIKE HLPFVSFAKL RGSFGITGND QISAYGYLEL WKPLSGSYQG VTTMFPGNIA
NPDYAWEVNK KAEAAFDVGL LNNRINLSLS YYSNRSSNQL VSVQLPYITG FSGITDNLDA
TIGNTGWEFE FMTKNISTKS FQWSTSFNFT IPKNKLIEFP NLEKSTYANQ YVIGEPLAIQ
KLYKTSVNAQ TGLYAAEDFD KNGLIDIKDR YVVTFTGRKY YGGLQNSFTY KGFALDVLCQ
FVKQAGAGYL NGFSTAGSFA TGIPTRNQPD FVLNRWRTQD DPAPYQKYST LTAASNSQLD
ATSRGSYAID NASFIRLKNI SLSYNLSEKL IQKIKLNSAK VFFQGQNLFT ITPYKGLDPE
TSSINNLPTL QVFTVGMQLT F