Gene Phep_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2600 
Symbol 
ID8253707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3039116 
End bp3042271 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content43% 
IMG OID644936250 
ProductTonB-dependent receptor plug 
Protein accessionYP_003092866 
Protein GI255532494 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.267837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTTA AACCACTAAG GCTTTGTGGG CTTATTACTT TCAGGGTAAT AATTTCCTGG 
GTATTACTGA TGACTTTATC AATCCAAAGT AATGCTCAAA GCAGAAAAAT TTCAGGTAAA
GTGACCTCTG ATGACAATGA ACCGCTCTTT GGTGTCACCG TATTTGAAAA GGGTACTAAA
ACCGGGGCTA CTACTGACGC CGACGGTAAA TTTGTACTTA CTGTTTCAAA TGCAAACGCT
ACATTATCCT TCAGATATGT CGGCTACCAG TCAAGAGACA TTCCTTTAAA CAACAGAAAT
ACCATAACGC TTGCGCTTAA AGCATTGGAC AATGCCCTGA ACGAAATTGT AGTTGTTGGC
TACGGCACAA CAACCCGGAA AGACCTGACA GGTGCTGTTG GCCAGGTCAA AATGAACGAA
TTTGAGAAAG CCCCAGTAAA ATCATTCGAT GAGGCCCTTG CGGGTCGCGT AGCCGGGGTA
CAGGTTTCAG GCGCTGATGG ACAGCCTGGC GAAGTCAACA CCATTGTAAT CCGTGGTGCA
GGTTCTATCA CTCAGGACAA CTCACCCCTA TATGTCGTGG ACGGCTTTCC ACTGGAAGAT
GCAAACAACA ATTCAATAAA CCCAGCTGAT ATTGAATCTA TTGATATCCT GAAGGACGCA
TCGGCTACAG CCATATATGG CGCAAGAGGT GCAAATGGGG TCATCATCAT TACCACCAAA
AGAGGAAAAG CAGATGAGCC CGTGATCGCT TATAATGGTT ACTACGGTTT TCAGAAAAAC
ACGAAGACCA TAGACCTGAT GAATCCTTAT GAATTTGTAA AATACCAGCT GGAGCTCGAT
CAGGCAAATG CGACTACCAC TTACCTGAGC AATGGAAAAA CGCTGGAGGA TTACAGAAAT
GTGAAAGGAC TCGATTTCCA GGATCAGATT TATAGAAATG CGCCCATGCA AAGCCACGAT
ATCTCACTGA GAGGCGGATC TGATAAAACA CGTTATTCAA TTTCCGCCAA TATCATAGAC
CAGAATGGAG TAATTTTAAA CTCTGATTTC AGGCGTTATC AGGGACGCAT TACCCTGGAT
CAGAATGTAA ACCGAAAGTT GAAAGTAGGT GTAAACCTGA ACTACAGCTA TACCATTTCC
AATGGAACAA TAGTCAATGC CTCCTCCTCT ACCCAGTCGG CCAGCACCAA TCTGCTTTAT
GCAGTGTGGG GCTACCGTCC TGTTACCGGT AACGACAATA ACCTGGATGA TGAACTGTTT
GATCCTGGTT TTGATTACAA TGTAATTGCC GATTACAGGG TCAACCCCGT AATCTCTGCC
AAAAATGAGC TCCGCAGAAA CTTAACCACA GGATTAATTG CCAATGCTTA TGCCGAATAC
AAAATTATCC CTTCACTGAC CTTAAGGGTA ACCGGTGGCA TTACCAATAA TATGCTCAGA
AATGAATCCT TTTATAACTC CCTTACCCAA GCTGGCAATC CCCGTAATTC AAGTGGAGTA
AACGGATCTA TTTACTATAA CCCTGCCACA ACCTGGCTAA ATGAAAATAC GCTTACCTAT
AAAAAAATAT TTAACAAAGT ACATAACCTG ACCATACTTG GAGGTTATAC CATGCAGGGA
AACAAAACTG CAAGAAATGG TTTTAACGCA CTGAACGTTC CAAATGAGTT GTTAGGACTT
GACGGTCTGG ACGAATCACC TTCTCTGATC GGTACTTCGT TAAGCTCCAG GTGGGGGCTT
TTGTCTTACC TCGGACGTAT AAATTACAAT TACAATTCAA AATATTACCT GACAGCTTCT
TTCCGATCCG ACGGATCTTC AAAATTTGCC CCTGGAAAGC GCTGGGGCTA CTTTCCTTCC
GGGGCATTTT CATGGCGTAT GAGCGGTGAG GAATTTATGA AGAAATTAAA ATTTGTTTCC
GAAGCAAAAC TCCGTCTTAG CTACGGGGAA ACCGGTAACA ACAGGGTGGG CGACTTTCCT
TACCTCGATC AGATTACCCA ACCCAATTCT GCAGGTTATT CTTACGGCAA CGGCTCTCCG
TCAAAAGGTG CTATATTAAC TGCTTTTGGA AATGCCAGTC TGAGATGGGA AACTACAAAG
CAGGTAAACA TTGGCTACGA CCTTAGCCTG TTTAAGCAGC GCATCAACTT AACTGTAGAC
TTGTACAAAA AAACTACGCA TGACCTATTG CTGCAAGCTC TTCTACCCTA TACCACCGGA
TTGTCAAATG CCTACAAAAA TATAGGTAAA ATGCAAAACA ATGGACTTGA AATTACACTA
AACACCGTCA ATATTAATGG AAAAACATTC AGTTGGAACT CCAATTTCAA TATCAGTTTC
AACAAAAACA AAGTACTTGC ACTTGCTGAG AACCAAACTT CTCTAACCAG TCCGGTAAGT
TTCGATCAGA AATGGAATGC ACTTTCTCCT TATATTGCTG TTGTAGGTCA GGCAGTAGGC
CAATTATATG GTGCCATCTG GGATGGAGTT TATCAGTATG AAGATTTTGA TAAAAGTCCC
CAGGGTGTTT ACACACTAAA AGGCAATATC CCCAGAAATG GCAACCCTAC AGTGCAGCCC
GGGGATATCA AGTATAAAGA TATCAATGGT GATGGAAATG TTAATGCCTC AGATTTCACG
GTAATCGGAC GTGGTCTGCC TGTACATACA GGTGGCCTGT CAAATAATTT TATCTATAAA
GGTTTTGACC TGAATGTATT CCTGCAATGG TCCTACGGGA ACGACATCAT CAATGCCAAC
CGCCTGCTCT TTGAGGGGAA TGGAAAACAG TCCAGGTTCT TTAACCAATA TGCGAGTTAT
GCCAACAGGT GGCAGCCCGA TAATCCCAGC AATACTTTAT TCCGTACCGG GGGACAAGGC
CCTTTCTATT ATTCCAGCAG AGTAGTTGAA GACGGCTCAT ACCTCAGGCT AAAAACCATT
TCACTGGGCT ACAATGTACC TGTAAAAATA TATAAAAAAG CCAATCTCAA AAGTTTAAGA
ATCTATGCTT CTGCCCAGAA CATTGCTACC TGGACAAATT ACTCTGGACC AGACCCTGAA
GTCTCTGTAA GAAATTCAAC GCTTACACCG GGGTTCGATT TTTCAGCCTA TCCACGTGCA
AGTACACTCA TATTCGGATT AAACTTATCG CTCTAA
 
Protein sequence
MLLKPLRLCG LITFRVIISW VLLMTLSIQS NAQSRKISGK VTSDDNEPLF GVTVFEKGTK 
TGATTDADGK FVLTVSNANA TLSFRYVGYQ SRDIPLNNRN TITLALKALD NALNEIVVVG
YGTTTRKDLT GAVGQVKMNE FEKAPVKSFD EALAGRVAGV QVSGADGQPG EVNTIVIRGA
GSITQDNSPL YVVDGFPLED ANNNSINPAD IESIDILKDA SATAIYGARG ANGVIIITTK
RGKADEPVIA YNGYYGFQKN TKTIDLMNPY EFVKYQLELD QANATTTYLS NGKTLEDYRN
VKGLDFQDQI YRNAPMQSHD ISLRGGSDKT RYSISANIID QNGVILNSDF RRYQGRITLD
QNVNRKLKVG VNLNYSYTIS NGTIVNASSS TQSASTNLLY AVWGYRPVTG NDNNLDDELF
DPGFDYNVIA DYRVNPVISA KNELRRNLTT GLIANAYAEY KIIPSLTLRV TGGITNNMLR
NESFYNSLTQ AGNPRNSSGV NGSIYYNPAT TWLNENTLTY KKIFNKVHNL TILGGYTMQG
NKTARNGFNA LNVPNELLGL DGLDESPSLI GTSLSSRWGL LSYLGRINYN YNSKYYLTAS
FRSDGSSKFA PGKRWGYFPS GAFSWRMSGE EFMKKLKFVS EAKLRLSYGE TGNNRVGDFP
YLDQITQPNS AGYSYGNGSP SKGAILTAFG NASLRWETTK QVNIGYDLSL FKQRINLTVD
LYKKTTHDLL LQALLPYTTG LSNAYKNIGK MQNNGLEITL NTVNINGKTF SWNSNFNISF
NKNKVLALAE NQTSLTSPVS FDQKWNALSP YIAVVGQAVG QLYGAIWDGV YQYEDFDKSP
QGVYTLKGNI PRNGNPTVQP GDIKYKDING DGNVNASDFT VIGRGLPVHT GGLSNNFIYK
GFDLNVFLQW SYGNDIINAN RLLFEGNGKQ SRFFNQYASY ANRWQPDNPS NTLFRTGGQG
PFYYSSRVVE DGSYLRLKTI SLGYNVPVKI YKKANLKSLR IYASAQNIAT WTNYSGPDPE
VSVRNSTLTP GFDFSAYPRA STLIFGLNLS L